Eupedia Forums
Site NavigationEupedia Top > Eupedia Forum & Japan Forum
Results 1 to 3 of 3

Thread: How many samples are statistically significant ?

  1. #1
    JAK2 Achievements:
    3 months registered500 Experience Points

    Join Date
    25-11-10
    Location
    LYON
    Age
    72
    Posts
    7

    Y-DNA haplogroup
    G
    MtDNA haplogroup
    H1

    Ethnic group
    EASTERN EUROPE
    Country: France



    Unhappy How many samples are statistically significant ?

    I am new comer in this topic and as a former MD, I am not really convinced by 400 to 2400 samples as a reference to a true Result ...
    What does mean in such a complex topic as populations genetics,a result for a country of 80 millions, obtained with 2450 samples...; as I just read about Germany on the Forum s results tables...(!) and much less:until a very few hundreds; for many other places and groups...
    I have read some critics about many attitudes considered as political Agenda and abuse of proportions (considering as a whole definitive majority, results concerning only 45% of an ethnic group.....What about all other people which don't enter in the Box??)
    Quick Progresses show that some peremptory results are mostly controversial...
    Who can give me more informations in that sense of questionning?
    Thanks a lot
    Warmest regards

  2. #2
    Satyavrata Achievements:
    Three FriendsRecommendation First ClassVeteran50000 Experience PointsTagger First Class
    Maciamo's Avatar
    Join Date
    17-07-02
    Location
    Lothier
    Posts
    5,683
    Points
    214,397
    Level
    100
    Points: 214,397, Level: 100
    Level completed: 0%, Points required for next Level: 0
    Overall activity: 100.0%


    Ethnic group
    Celto-germanic
    Country: Belgium - Brussels



    Sorry for the very delayed reply. I sometimes miss some threads.

    The number of samples required to obtain statistically significant results depends chiefly on two factors :

    1) the size of the population tested. Obviously the sample size for Luxembourg can be smaller than for Germany, and Germany smaller than China.

    2) the heterogeneity of the population tested. Some modern populations have grown very fast over the course of the last few centuries, while others have grown more steadily over the ages. I pointed out in a thread 3 years ago that in the early 19th century, Belgium was twice more populous than the Netherlands, while today the latter has a population 60% bigger than Belgium. Within Belgium, Wallonia use to be more populous than Flanders a few centuries ago. Flanders is now nearly twice as populous due to a much faster growth in the 20th century.

    In 1350, France had a population of 20 million, only three times less than now. If we deduct all the people with foreign surnames in France (immigration of the last few centuries), we see that the French population has only grown 2.5 folds in the last 750 years, which is very little. In comparison, in 1350 Britain had a population of roughly 4 million (3m in England), Poland 2 million and Russia 8 million. These countries' populations have grown approximately 15 to 20 folds. Italy had 10 million and Spain 7 million - each experienced about a 6 fold increase.

    So it's only natural that the genetic diversity should be higher in countries like France, Belgium, Italy and Spain than in northern or eastern Europe. In fact, the size of the historical population since the Middle Ages is fairly well reflected by the diversity of surnames. Italy, France and Belgium have the highest number of surnames per capita in Europe, while Scandinavia, the British Isles and most Slavic countries have among the lowest.

    The second factor is the most important, yet also the most overlooked.

    So what is the minimum sample size necessary to be relevant ? In northern and eastern Europe, where the medieval population density was much lower than in the former Roman Empire, I would say that 50 samples per million inhabitant (now) already gives a pretty good idea. This means 3000 samples for Britain, 2000 for Poland, or 250 for Denmark or Finland. Countries like Ireland clearly have more than enough Y-DNA samples to have a quite accurate picture. For countries like France, Belgium, Italy or Greece, 250 samples per million inhabitant are necessary, and they need to be selected carefully to cover every region, as there are often major disparities even in small adjacent regions (e.g. Cantabria vs Basque country, or Crete vs Peloponese, or Auvergne vs Rhône-Alpes). In other words, Belgium and Greece would need 2500 samples, France and Italy 15,000 samples.

    Spain and Portugal are a bit different because a large part of the medieval (Muslim and Jewish) population was expelled in the 15th century, and the modern population therefore grew from a smaller portion of the medieval population, which explains why the surname diversity is also lower. I would place them in an intermediary category, along with Germany, and estimate that 100 samples per million inhabitant is representative enough (so 1000 samples for Portugal, 4000 for Spain, and 8000 for Germany).
    Last edited by Maciamo; 03-10-11 at 13:14.

  3. #3
    Regular Member Achievements:
    1 year registered1000 Experience Points
    MarTyro's Avatar
    Join Date
    06-08-11
    Posts
    39

    Y-DNA haplogroup
    J2

    Country: Austria



    Quote Originally Posted by Maciamo View Post
    minimum sample size necessary to be relevant
    France and Italy 15,000 samples
    8000 for Germany
    4000 for Spain
    3000 samples for Britain
    Belgium and Greece 2500 samples
    2000 for Poland
    1000 samples for Portugal
    250 for Denmark or Finland
    Interesting. I guess France, Italy and also Germany and Spain have not reached that size by far. I would also add Switzerland and Austria as important alpine refugiums, the Balkan as important old melting pot and Hungary/Czechia/Slovakia as indicators of some immigration. Also the haplogroup-definition/nomenclature to me seems important: older studies can make problems. So we must expect some news (subclades, hg-enclaves, etc.). There should be build a central Haplogroup-distribution-database (maybe an EU-Project?), where every scientist and interested researcher can make his calculations; but there is none?
    Last edited by MarTyro; 05-10-11 at 07:42. Reason: regroup

Similar Threads

  1. European Authors or Books that were significant to you?
    By Sirius2b in forum Literature & Theatre
    Replies: 25
    Last Post: 09-05-13, 17:59
  2. old DNA samples (Y-DNA/mtDNA) 500 BC to 1500 AD
    By MarTyro in forum Ancient DNA studies
    Replies: 4
    Last Post: 17-02-12, 14:47

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •