• Skip to Main Content
  • Media Center

prb-hero

Mission and Values

About population reference bureau.

PRB is a nonpartisan, not-for-profit research organization focused on improving people’s health and well-being through evidence-based policies and practices. Our staff analyze population data and ensure the research and its applications are understood and used widely by decisionmakers, advocates, and media. Our ability to assess and easily communicate critical issues about topics like gender equality, climate adaptation, equity, and aging makes us a valued partner and resource for those working at all levels and in all areas of the world.

OUR MISSION

PRB harnesses the power of population data to inform and advance solutions to the most urgent challenges facing our world and improve the well-being of people globally.

What’s population data? 

It’s how we measure information about a population—from gender roles to where people live, how many children they have, how long they live—and identify trends.

  • How can cities prepare for increased growth, including internal migration, hastened by climate events?
  • Why is the mortality rate among young people in the United States so much higher than in other wealthy nations?
  • How can countries with rapidly aging populations continue to support women’s advancement if women also provide most of care needs to older adults?

Understanding population data and demographic trends is essential to solving the important problems of today and tomorrow. As we work toward solutions, we must consider how people are affected by these problems and, in turn, how people can affect them.

Our Strategic Plan

OUR ACTIONS

We provide data and communication solutions to advocates, policymakers, and practitioners so they can make informed decisions that affect communities around the world. PRB:

  • Forecasts housing and transportation needs for U.S. localities.
  • Supports civil society organizations in East and West Africa to develop evidence-based materials to advocate for expanded access to health care services.
  • Serves as an intermediary between data users and government agencies to facilitate knowledge-sharing via virtual and in-person forums.
  • Trains and mentors journalists and social media influencers from Nepal to Nigeria to clearly and accurately communicate technical information on topics from sexual and reproductive health and rights to early childhood development to the demographic transition.
  • Communicates technical knowledge to and motivates fresh thinking among practitioners by showcasing and highlighting key data and insights from ongoing and completed work, sharing relevant information, ideas, tools, and resources.

OUR PRINCIPLES

Collaboration.  We believe that the most powerful solutions occur when we collaborate with and learn from one another. PRB works closely with local organizations and partners so community members lead, set priorities, and identify solutions that are grounded in local realities. Internally, we seek to break down silos to identify where we can achieve more together.

Equity. We recognize and honor the fundamental rights and dignity of all individuals. We are committed to promoting equity throughout the organization and in our relationships with partners and funders. We remain committed to doing work that exposes inequities, challenges conventional thinking, and leads to positive outcomes within the organization and for women, children, and families globally. (Read about our commitment to diversity, equity, and inclusion .)

Inclusivity.  The differences each person brings to PRB make us stronger and better. We are committed to sustaining a work environment that respects and supports diverse backgrounds, traditions, heritages, lifestyles, beliefs, values, and experiences. We are also dedicated to equity and inclusivity of data for accurate and ethical population analysis, research translation, and information sharing.

Integrity.  We believe quality information is essential to the creation of sound policies and programs. Our analyses will always be rigorous, objective, nonpartisan, and evidence-based, reflecting our unwavering commitment to accuracy and reliability.

Learning and Innovation.  We strive to be an adaptive learning organization: curious, transparent, and open to new ideas and ways of doing things. We actively seek new subjects of inquiry, work products, and methods for achieving results. We make evidence-based decisions, measure the impact of our work, and apply lessons learned so that PRB and the people who work here can improve.

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 29 April 2014

Geographic population structure analysis of worldwide human populations infers their biogeographical origins

  • Eran Elhaik 1 , 2   na1 ,
  • Tatiana Tatarinova 3   na1 ,
  • Dmitri Chebotarev 4 ,
  • Ignazio S. Piras 5 ,
  • Carla Maria Calò 5 ,
  • Antonella De Montis 6 ,
  • Manuela Atzori 6 ,
  • Monica Marini 6 ,
  • Sergio Tofanelli 7 ,
  • Paolo Francalacci 8 ,
  • Luca Pagani 9 ,
  • Chris Tyler-Smith 9 ,
  • Yali Xue 9 ,
  • Francesco Cucca 5 ,
  • Theodore G. Schurr 10 ,
  • Jill B. Gaieski 10 ,
  • Carlalynne Melendez 10 ,
  • Miguel G. Vilar 10 ,
  • Amanda C. Owings 10 ,
  • Rocío Gómez 11 ,
  • Ricardo Fujita 12 ,
  • Fabrício R. Santos 13 ,
  • David Comas 14 ,
  • Oleg Balanovsky 15 , 16 ,
  • Elena Balanovska 16 ,
  • Pierre Zalloua 17 ,
  • Himla Soodyall 18 ,
  • Ramasamy Pitchappan 19 ,
  • ArunKumar GaneshPrasad 19 ,
  • Michael Hammer 20 ,
  • Lisa Matisoo-Smith 21 ,
  • R. Spencer Wells 22 &

The Genographic Consortium

Nature Communications volume  5 , Article number:  3513 ( 2014 ) Cite this article

42k Accesses

94 Citations

377 Altmetric

Metrics details

  • Biogeography
  • Population genetics

A Corrigendum to this article was published on 31 October 2016

This article has been updated

The search for a method that utilizes biological information to predict humans’ place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700 km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000–130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50 km of their villages. GPS’s accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.

Similar content being viewed by others

research reference population

Phylogenomics and the rise of the angiosperms

research reference population

Genome-wide association studies

research reference population

Network of large pedigrees reveals social practices of Avar communities

Introduction.

The ability to identify the geographic origin of an individual using biological data, poses a formidable challenge in anthropology and genetics because of its complexity and potentially dangerous misinterpretations 1 . For instance, owing to the intrinsic nature of biological variation, it is difficult to say where one population stops and another starts by looking at the spatial distribution of a trait (for example, hair colour 2 ). Darwin acknowledged this problem, stating that ‘it may be doubted whether any character can be named which is distinctive of a race and is constant’ 3 . Yet, questions of biogeography and genetic diversity such as ‘why we are what we are where we are?’ have piqued human curiosity as far back as Herodotus of Halicarnassus, who has been called ‘the first anthropologist’ 4 . However, only in the past decade have researchers begun harnessing high-throughput genetic data to address them.

The work of Cavalli-Sforza and other investigators 5 , 6 , 7 established a strong relationship between genetic and geographic distances in worldwide human populations, with major deviations explicable by admixture or extreme isolation 8 . These observations, stimulated the development of biogeographical methods. Accurate measurements of biogeography began at the cusp of the DNA sequencing era and have since been applied extensively to study genetic diversity and anthropology 9 , 10 , infer origin and ancestry 11 , 12 , map genes 13 , identify disease susceptibility markers 2 and correct for population stratification in disease studies 14 .

Biogeographic applications currently employ principal component (PC)-based methods such as PC analysis (PCA), which was shown to be accurate within 700 km in Europe 11 , and the most recent Spatial Ancestry Analysis (SPA) 12 that explicitly models allele frequencies. Yet, estimated by the percentage of individuals correctly assigned to their country of origin, the accuracy of PCA and SPA is relatively low for Europeans (40±5 and 45±5%, respectively) and much lower for non-Europeans 12 . Such results suggest limitations with these approaches for biogeographical inference 12 , 15 , 16 , coupled with a possible limitation of commonly used genotyping arrays in capturing fine substructure 17 , 18 . As patterns of genetic diversity in human populations or admixture are frequently described as genome-based estimates of several ancestries that sum to 100% (refs 19 , 20 , 21 ), we speculated that an admixture-based approach may yield better results.

To overcome the possible limitations of markers included on commonly used microarrays, we used the GenoChip, a dedicated genotyping array for population genetics 17 , which includes over 100,000 ancestral informative markers (AIMs). These autosomal AIMs were collected from the literature and captured using tools such as AimsFinder, which identifies the smallest number of markers sufficient to differentiate a pair of populations that are genetically distinct. AimsFinder was applied to private and public data sets for multiple populations, many of which were not studied before or searched for AIMs, and the recovered markers were included in the GenoChip.

To overcome the methodological limitations, we developed an admixture-based Geographic Population Structure (GPS) method for predicting the biogeographical origin of worldwide individuals from resident populations. We demonstrated the performances of this method on three data sets and compared its results with those of SPA 12 . GPS outperformed SPA in all analyses, demonstrating the power of admixture-based tools to infer biogeography.

GPS implementation

The GPS method consists of two steps. In the first step, carried out once, we constructed a diverse panel of worldwide populations and analysed them using an unsupervised ADMIXTURE analysis. This analysis yielded allele frequencies for K hypothetical populations whose genotypes can be simulated to form putative ancestral populations. Next, from the data set of worldwide population, we constructed a smaller data set of reference populations that are both genetically diverse and have resided in their current geographical region for at least few centuries. These populations were next analysed in a supervised ADMIXTURE analysis that calculated their admixture proportions in relation to the putative ancestral populations ( Fig. 1 ).

figure 1

Admixture analysis was performed for K =9. For brevity, subpopulations were collapsed. The x axis represents individuals from populations sorted according to their reported ancestries. Each individual is represented by a vertical stacked column of colour-coded admixture proportions that reflects genetic contributions from putative ancestral populations.

Looking at the resulting graph, we found that all populations exhibit a certain amount of admixture, with Puerto Ricans and Bermudians exhibiting the highest diversity and Yoruba the least. We further found distinct substructure among geographically adjacent populations that decreased in similarity with distance, suggesting that populations can be localized based on their admixture patterns. To correlate the admixture patterns with geography, we calculated two distance matrices between all reference populations based on their mean admixture fractions (GEN) and their mean geographic distances (GEO) from each other. Using these distance matrices, we calculated the relationship between GEN and GEO (Equation 1).

In the second step, GPS inferred the geographical coordinates of a sample of unknown origin by performing a supervised ADMIXTURE analysis for that sample with the putative ancestral populations. It then calculated the Euclidean distance between the sample’s admixture proportions and GEN. The shortest distance, representing the test sample’s deviation from its nearest reference population, was subsequently converted into geographical distance using the inferred relationships (Equation 1). The final position of the sample on the map was calculated by a linear combination of vectors, with the origin at the geographic centre of the best matching population weighted by the distances to M nearest reference population and further scaled to fit on a circle with a radius proportional to the geographical distance obtained by Equation 1 (see Calculating the bio-origin of a test sample in the Methods).

Biogeographical prediction for worldwide individuals

We applied GPS to approximately 600 worldwide individuals collected as part of the Genographic Project and the 1000 Genomes Project and genotyped on the GenoChip ( Supplementary Table 1 ). We included the highly heterogeneous populations of Kuwait 22 , Puerto Rico and Bermuda 23 , as well as communities from the same country, such as Peruvians from Lima and indigenous highland Peruvians. We tested the accuracy of GPS predictions using the leave-one-out procedure. The resulting figure bears a notable resemblance to the world’s geographic map ( Fig. 2 ). Individuals from the same geographic regions clustered together, and populations from different countries were largely distinguished. Assignment accuracy was determined for each individual based on whether the predicted geographical coordinates were within the political boundaries of the country and regional locations. GPS correctly assigned 83% of the individuals to their country of origin, and, when applicable, ~66% of them to their regional locations ( Fig. 3 , Supplementary Table 2 ), with high sensitivity (0.75) and specificity (0.99). These results supported the known connection between admixture patterns and geographic origins in worldwide populations 5 , 6 , 7 , 8 .

figure 2

( a ) Small coloured circles with a matching colour to geographical regions represent the 54 reference points used for GPS predictions. Each circle represents a geographical point with longitude and latitude and a certain admixture proportion. The insets provide magnification for dense regions. ( b ) GPS individual assignment based on 54 points. Individual label and colour match their known region/state/country of origin using the following legend: BE (Bermudian), BU (Bulgarian), CHB (Chinese), DA (Danish), EG (Egyptian), FIN (Finnish), GO (Georgian), GR (German), GK (Greek), I-S/N/W/E (India, Southern/Northern/Western/Eastern), IR (Iranian), ID/TSI (Italy: Sardinian/Tuscan), JPT (Japanese), LWK (Kenya: Luhya), KU (Kuwaiti), LE (Lebanese), M-O/B/N/D/T (Madagascar: Antananarivo/Ambilobe/Manakara/Andilambe/Toliara), X-G/H/M (Mexico: Guanajuato/Hidalgo/Morelos), MG (Mongolian), N-S/K/H/T (Namibia: Southeastern/Kaokoveld/Hereroland/Tsumkwe), YRI (Yoruba from West African), P-C/N (Papuan: Papua New Guinea/Bougainville-Nasioi), PH/PEL (Peruvian: Highland/Lima), PR (Puerto Rican), RO (Romanian), CA (Northern Caucasian), R-M/T/A (Russians: Moscow/Tatarÿ/Altaian), S-J/U/S/K/ (RSA: Johannesburg/Underberg/Northern Cape/Free State), IBS (Iberian from Spain & Portugal), PT (Pamiri from Tajikistan), TU (Tunisian), UK (British from United Kingdom), VA (Vanuatu), KHV (Vietnam). Note : occasionally all samples of certain populations (for example, Vietnamese) were predicted to the same spot and thus appear as a single sample.

figure 3

Populations for which regional data were available are marked with an asterisk. The average accuracy per population is shown in red and is calculated across populations given equal weights.

In terms of distances from the point of true origin, GPS placed 50% of the samples within 87 km from the origin with 80 and 90% of them within 645 and 1,015 km from the origin, respectively. GPS further discerned geographically adjacent populations known to exhibit high genetic similarity, such as Greeks and Italians. The prediction accuracy was correlated with both countries’ area ( N =600, r =0.3, Student’s t -test P -value=0.03) and admixture diversity ( N =600, r =−0.34, Student’s t -test P -value=0.01).

Individuals belonging to recently mixed populations, such as Kuwaitis and Bermudians, proved to be the most difficult to correctly predict, because their mixing was temporally brief and insufficient to generate a distinct regional admixture signature. As a result, such individuals are more likely to be placed within their original countries of origin, which is incorrect according to our scoring matric. For example, Kuwaiti individuals whose ancestors come from Saudi Arabia, Iran and other regions of the Arabian Peninsula 22 were predicted to come from these regions rather than their current state.

To test GPS’s accuracy with individuals from populations that were not included in the reference population set, we conducted two analyses. We first repeated the previous analysis using the leave-one-out procedure at the population level. As expected, GPS accuracy decreased with 50% of worldwide individuals predicted to be 450 km away from their true origin. The predicted distance increased to 1,100 and 1,750 km for 80 and 90% of the individuals, respectively ( Fig. 4a ). Because GPS best localizes individuals surrounded by M genetically related populations, populations from island nations (for example, Japan and United Kingdom) or populations whose most related populations were under-represented in our reference population data set (for example, Peru and Russia) were most poorly predicted. Consequently, the median distances to the true origin were much smaller for individuals residing in Europe (250 km), Africa (300 km) and Asia (450 km) due to their being more commonly represented in the reference population data set compared with Native Americans and Oceanians. These results represent the upper limit of GPS’s accuracy when the specific population of the test individual is absent from the reference population data set.

figure 4

Calculated for individuals of the Genographic (left) and the HGDP (right) data sets.

Next, we analysed over 600 individuals from the Human Genome Diversity Panel (HGDP) whose subpopulations and populations reside in countries that are not covered by our reference population data set ( Supplementary Table 1 ). GPS’s accuracy further decreased with 50% of worldwide individuals predicted to be 1,250 km away from their true origin ( Fig. 4b , Supplementary Data 1 ). As before, geographically remote populations were less accurately predicted with a higher error for regions that were poorly represented in the reference population data set. For example, the Brazilian Surui were predicted to be located 4,800 km away from their true origin. This was not surprising because the closest population in the reference population data set resided in Central America. By contrast, remote European populations that reside on islands or along ocean shores and are not surrounded by other populations (for example, Orcadians and French) were predicted to be ~1,200 km from their true origin, due to the higher density of nearby populations in the reference population data set. The results were also affected by populations with a history of recent migrations, such as Bedouins, Druze 24 and the Pakistani Hazara, the latter being suspected of having some Mongolian ancestry 25 . Adding the HGDP populations to our reference population data set yielded similar results to those reported in Fig. 3 . Overall, these results illustrated the dependency of GPS on the density of the reference population data set and indicated that accuracy improves with the inclusion of additional populations residing in geographically distant or isolated regions.

GPS applicability using a thinner marker set

PC-based applications have long aspired to provide accurate results down to the level of an individual’s village. However, due to different factors such as cohort effects 26 , these solutions have been mostly ad hoc . In fact, PC solutions were shown to discern only populations of selected cohorts, such as Italian villagers 27 or Europeans 11 , 12 . When individuals of various ancestries are included in the cohort, the PCs are altered to the point where none of the individuals are correctly predicted to their country of origin or continental regions.

To test the precision of GPS’s predictions given finer regional annotation, we assessed 243 Southeast Asians and Oceanians and 200 Sardinians from 10 villages (4–180 km apart) using subsets of 40,000 and 65,000 GenoChip markers, respectively. We first tested whether admixture frequencies calculated over a smaller set of GenoChip markers provided sufficient accuracy. For this assessment, we carried out a series of admixture analyses in a supervised mode for nine 1000 genomes worldwide populations using smaller sets of markers (95,000, 65,000 and 40,000) and compared the admixture proportions with those obtained using the complete marker set ( Fig. 5 ). We found small differences in the admixture proportions that slowly increased for smaller GenoChip marker sets. The largest observed difference (3%) for the smallest number of markers used in our analyses (40,000) was within the natural variation range of our populations and did not affect the assignment accuracy. We were thus able to supplement the reference population set with the newly tested populations.

figure 5

The mean (left) and maximum (right) absolute difference in individual admixture coefficients are shown.

Fine-scale biogeography down to home island

Next applied to Southeast Asians and Oceanians ( Supplementary Table 3 , Supplementary Fig. 1 ), GPS’s prediction accuracy was stringently estimated as the individual assignment to the region occupied by one’s population or subpopulation. The prediction accuracy for Han Chinese (64%) and Japanese (88%) obtained here using ~40,000 markers was the same as that obtained in the complete data set, as expected ( Fig. 5 ).

GPS’s assignment accuracy for the remaining Southeast Asian and Oceanian populations (87.5%) and subpopulations (77%) ( Fig. 6 ) was higher than that obtained for worldwide populations ( Fig. 3 ). These results reflect GPS’s greatest advantage compared with alternative methods. Unlike PCA and SPA whose accuracy is lost with the addition of samples of various ancestries 12 , GPS predictions increase in accuracy when provided a more comprehensive reference set.

figure 6

Pie charts depicts correct mapping at the subpopulation level (red), population level (black) and incorrect mapping (white).

A few populations stand out in that they are not reliably assigned to their region of origin ( Fig. 6 ). Polynesians and Fijians in particular are not well predicted and incur the highest misclassification rates (47 and 40%, respectively) mainly to Nusa Tenggara and the Moluccas Islands. These results are not surprising given two main issues. First, Polynesian populations, East Polynesian populations in particular, are not well represented in the large databases from which the GenoChip’s ancestry informative markers were ascertained 17 , so a likely ascertainment bias exists for Polynesia. The second issue relates to the complex settlement history of the Oceania region. Interestingly, this aspect of population history is clearly reflected in the results produced.

The component identified in the admixture analysis ( Supplementary Figure 1 ) as representing Oceania (pink) most likely represents the early migrants into the region some 50,000 years ago. This component is dominant in populations from New Guinea and Australia, which were joined together, making up the ancient landmass of Sahul, until approximately 11,000 years ago when they became separated due to rising sea levels. This Oceanic signature is also seen in Island Southeast Asia, such as Nusa Tenggara and the Moluccas, which indicates the likely pathway taken to Sahul. The Remote Oceanic settlement, represented here by Fiji and Polynesia, is much more recent and has been associated with the Neolithic expansion of peoples out of East Asia, through Island Southeast Asia and ultimately through Near Oceania and the rest of the Pacific including Polynesia.

The first people to arrive in Remote Oceania (the region east of the Solomon Islands) did so only about 3,000 years ago and are associated with the expansion of the Lapita cultural complex as far east as Fiji, Samoa and Tonga, on the edge of the Polynesian Triangle. Mitochondrial DNA and Y chromosomal data from Remote Oceanic populations, Polynesians in particular, indicate mixed ancestry 28 . MtDNA suggests primarily Island Southeast Asian ancestry for Remote Oceania, indicated by high frequencies of mtDNA haplogroup B4a1a and descendent lineages, with some Near Oceanic contributions (identified by haplotypes belonging to haplogroups P and Q). Y chromosome studies, however, show a stronger Near Oceanic component in Polynesian ancestry, with some Southeast Asian contribution 29 . Genome-wide studies are consistent with this mixed ancestry for Polynesian, Remote Oceanic and some Near Oceanic populations 30 . Our findings, therefore, represent the heterogeneity of Remote Oceanian populations due to their long history of expansions and settlements, which is reflected by their complex population structure ( Supplementary Fig. 1 ) and GPS predictions ( Fig. 6 ).

Fine-scale biogeography down to home village

The island of Sardinia (24,090 km 2 ) was first settled 14,000 years ago and experienced a complex demographic history that includes low effective sizes due to plagues and wars and scant matrimonial movement, which accentuated stochastic effects. Interestingly, Sardinians have been described both as a genetic isolate with endogamy peaking in the central-southern and mountain areas with little internal mobility 31 and a heterogeneous population when microareas or close single village are considered 32 , 33 , 34 .

Applied to Sardinian villagers ( Supplementary Fig. 2 ), GPS correctly placed a quarter of the Sardinians in their village, as well as half within 15 km and 90% of individuals within 100 km of their homes ( Fig. 7 , Supplementary Fig. 3 ). As expected from the high percentages of matrilocal marriages 35 and residence 36 , 37 common to Sardinia, the locations of females were better predicted than those of males, with 30% placed in their exact village of origin compared with 10% of the males.

figure 7

The mean predicted distances (km) from the village of origin are marked by bold (females) and plain (males) circles.

Our findings revealed the Sardinians to have a genetic microheterogeneous structure affected both by altitude and physical location. The prediction accuracy as the distance to the village of origin ( Supplementary Fig. 3 ) are detailed in Supplementary Table 4 . The correlations between altitude and the distance from the village of origin are shown in Supplementary Table 5 . The average predicted distances from the villages roughly corresponded to Sardinian subregions ( Fig. 7 ). Unsurprisingly, the more precise positioning refers to individuals coming from Ogliastra (east Sardinia), since this area is characterized both by high altitude, high endogamy and relative cultural isolation, whereas populations from the western shores are considered to be more admixed. We found a significantly negative correlation between altitude and the predicted distance to villages for males ( N (coastal)=96, r (coastal)=−0.21, Student’s t -test P -value(coastal)=0.019; N (coastal)=27, r (inland)=−0.38, Student’s t -test P -value(inland)=0.024). The results for females were marginally significant for all villages ( N =126, r =−0.14, Student’s t -test P =0.06) and inland villages ( N =29, r =−0.27, Student’s t -test P =0.08), but not for coastal villages ( N =97, r =−0.1, Student’s t -test P =0.14). These results are expected from the high proportion of endogamy (64.1% in plain, 82.8% in mountains) that are correlated with the rise of altitude 35 . This correlation was particularly high in inland compared with coastal villages. Our results not only fit with the genetic and demographic characteristics of Sardinians but also resolve conflicting findings due to the matrilocal matrimonial structure 36 , 38 , 39 , 40 . Finally, because GPS carries a sample-independent analysis, predictions for worldwide individuals were largely identical to those previously reported ( Fig. 3 ) in both analyses.

Comparing the performances of GPS with SPA

The SPA tool explicitly models the spatial distribution of each SNP by assigning an allele frequency as a continuous function in geographic space 12 . SPA can model the spatial structure over a sphere to predict the spatial structure of worldwide populations and was designed to operate in several modes. In one mode, when the geographic origins of the individuals are known or when the geographic origins of some individuals are known, they can be used as training set. Using the later approach, Yang et al . 12 trained the SPA model on 90% of the individuals to predict the locations of the remaining 10%. SPA was reported to predict the geographic origins of individuals of mixed ancestry, which cannot be done with PCA 12 .

When analysing a Europeans-only data set, SPA was successful in assigning close to 50% of the individuals to their correct country of origin. However, when worldwide individuals were analysed, SPA distorted the distances between continents and failed to assign even a single individual to his home country ( Fig. 8a ), with Melanesians being misclassified as Indians 12 being the most obvious example.

figure 8

The mean longitude and latitude for each population were calculated by averaging individual spatial assignments ( N =596). After assigning populations to continental regions, the mean and s.d. were calculated based on the predicted coordinates for each region. Dashed lines mark s.d. ( a ) SPA prediction accuracy for continental regions obtained from Yang et al . 12 results (their supplementary Table 1 12 ). The mean coordinates are marked with a triangle (expected) and square (Predicted by SPA). ( b ) Comparing the results for worldwide populations analysed here for SPA (square), GPS (circle) and for the real coordinates (triangle).

We compared the accuracy of GPS with that of SPA by providing SPA with favourable conditions to its operation. We used the more rigorous application involving a training data set and ran SPA in two steps, as described by Yang et al . 12 . First, we provided SPA with the genotype file of worldwide populations (596 individuals, 127,361 SNPs) with their complete geographic locations ( Supplementary Table 2 ) without any missing data. When executed, SPA produced the model file that would be later used to predict the geographical locations. Next, we provided SPA with the model file and the same genotype file. Because of the absence of missing geographical coordinates, SPA was expected to yield geographic coordinates that closely resemble those it received in the first step. However, SPA failed to assign 98% of individuals to their countries and placed most individuals in oceans or in the wrong continental region ( Fig. 8b ). By comparison, GPS was used in the leave-one-out individual mode, in which the geographic coordinates of the populations in the reference population data set were recalculated without the test individual. GPS accurately assigned nearly all individuals to their continental regions, countries and regional locations with a high degree of accuracy.

We tested SPA with four additional data sets and calculated the assignment accuracy for each one. When providing SPA with the combined data set of worldwide individuals and Southeast Asian and Oceanian individuals (~40,000 SNPs) with their complete geographical coordinates ( Supplementary Table 3 ), we obtained a similar assignment accuracy of 2% for the worldwide individuals, although with different coordinates and an assignment accuracy of 1.5% for the remaining individuals. When testing only Southeast Asian and Oceanian individuals, the assignment accuracy was 4.8%. Unfortunately, we were unable to estimate the prediction accuracy for about 20% of these samples because SPA’s results (e.g., Latitude=−2, Longitude=256) exceeded those of a three-dimensional sphere. Finally, we calculated the assignment accuracy for worldwide individual and Sardinian individuals (~65,000 SNPs), again by providing complete geographical data ( Supplementary Table 4 ). SPA coordinates for worldwide individuals varied from our previous analyses, being accurate for three individuals (0.5%) but completely inaccurate (0%) for Sardinians whether they were tested with the worldwide individuals or separately.

We suspect that the inaccuracy of SPA predictions in the tested mode of operation results from the predictions for test individuals being affected by other individuals in the cohort. As such, it suffers from the same limitations as PCA when analysing a diverse cohort. Even if a single individual from a different continent is included in the data set, SPA’s accuracy drops to 0%. In other words, for SPA to correctly assign every other European individual to his country, the individuals need to be a priori confirmed as Europeans, which makes SPA impractical.

A comparison of the runtime and CPU timings was done on a Linux machine (x86 64) with an Intel(R) Xeon(R) E5430 processor 2.66 GHz CPU and 8 GB memory. The SPA runtime (wall time) was well over 3 h, compared with 6 min for GPS, including the initial step of calculating admixture proportions using ADMIXTURE.

We present a solution to one of the most challenging problems of biogeography: localizing individuals based on their genetic data. Our solution consists of analysing populations genotyped over a relatively small set of AIMs and applying an admixture-based GPS method to predict their origin. We have shown that our approach can predict the geographical origin of worldwide individuals from single resident populations down to the level of island and home village and is more accurate than SPA.

Some of the limitations of SPA and PCA for inferring ancestral origin have been previously noted 12 , 16 . A major limitation of these methods for biogeography is their specificity to relatively homogeneous populations, such as Europeans, and their susceptibility to biases caused by populations of different ancestry, as is the case with real-life data. SPA’s assignment accuracy ranged between 0 and 4.8% with predictions varying over different runs for the same individuals. By averaging over all data sets, we estimated SPA’s assignment accuracy to be 1.5%. These results contradict those reported by Yang et al . 12 for European populations, whose estimation is based on a European-only data set. However, when worldwide populations were used, the results of Yang et al . 12 ( Fig. 8a ) are in agreement with ours ( Fig. 8b ).

By contrast, GPS is a sample-independent method that relies on a fixed set of reference populations to predict the individual’s geographical origin irrespective of the tested cohort. We have shown that GPS successfully localized 83% of worldwide individuals to their country of origin and that its accuracy increased with the addition of more localized and well-annotated populations such as Southeast Asians, Oceanians and Sardinians. The advantage of using reference populations becomes apparent when analysing migratory populations that have relocated yet maintained endogamy. Such populations present a formidable concern to biogeographic methods that rely on the relationships between geography and genetics 5 , 6 , 7 , 8 . In theory, SPA would yield biased results if the immigrant population with its current geographical location would be used for training or be included in the test cohort with populations located nearby its modern-day region. Unfortunately, the low accuracy of SPA prevents us from demonstrating this effect, although it was noted for a similar approach, PCA 12 , 16 .

GPS addresses this concern by using a reference population set that excludes such populations and by adopting a sample-independent approach. The success of this approach was demonstrated with the Kuwaitis, who comprise an amalgam of populations that were predicted to their former origin ( Fig. 2 ). We have further demonstrated the accuracy of our approach when using only 40,000 markers, making GPS applicable to genetic data genotyped on the most common microarrays. Overall, GPS’s accuracy, high sensitivity and specificity, along with its memory efficiency and high speed, make it a powerful tool for biogeography and related scientific fields.

Given these successes, GPS has other potential applications. For example, in genealogical research, it could help adoptees find their home region, while, in forensic research, it could improve the assignment of ethnic (geographic) ancestry to DNA evidence. Although common wisdom in genetics is that ‘more is better,’ we demonstrate that only tens of thousands of AIMs are sufficient to accurately infer biogeography down to the home village, provided that the appropriate samples are available in the reference population data set. We emphasize that GPS may not necessarily yield the most recent region of residency for populations that experienced recent admixture or have recently migrated and maintained a certain degree of endogamy, but rather a historical residency. Therefore, the region of origin can be intuitively interpreted as where the last major admixture took place at the population level since the establishment of the reference populations in their provided geographical location.

We envision that, with time, biogeographical applications will become enhanced for more worldwide communities due to the addition of populations to the reference panel. Therefore, our results should be considered a lower bound to the full potential of GPS for biogeography. We hope that our study will promote new thinking about how population size, genetic diversity and environment have shaped human population structure.

Sample collection and genotyping

Genographic sample collection was conducted according to the ethical protocol of The Genographic Project ( https://genographic.nationalgeographic.com/wp-content/uploads/2012/07/Geno2.0_Ethical-Framework.pdf ), with oversight provided by the University of Pennsylvania and regional IRBs (specified in the original reports from which data analyzed in this study were taken). IRBs were obtained for new collections in Italy, UK, Denmark, Greece, Germany and Romania (sample and data collection were undertaken with approval from the IRB, Comitè Ètic d’Investigació Clínica—Institut Municipal d’Assistència Sanitària (CEIC-IMAS) in Barcelona (2006/2600/I)); Peru (sample and data collection were undertaken with approval from the local IRB at Universidad San Martin de Porres, Lima, Peru; Federal Wide Assurance (FWA) for International Protection of Human Subject 0001532; US Health and Human Services (HHS) International Review Board IRB0000325); Puerto Rico (sample and data collection were undertaken with approval from the University of Pennsylvania IRB #8 and the support of Liga Guakia Taina-Ke); Mexico (sample and data collection were undertaken with approval from the University of Pennsylvania IRB #8, the Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), and the Comision Nacional para el Desarrollo de los Pueblas Indigenas (CDI)); Egypt, Iran, Kuwait, Lebanon and Tunisia (the sample and data collection protocol were originally approved by the IRB committee of the Lebanese American University); North Eurasia (North Eurasia sample and data collection were undertaken with approval from the Ethical Committee of the Research Centre for Medical Genetics RAMS and the Academic Council of the same Research Centre); India (the sample and data collection protocol were originally approved by IRB of Madurai Kamaraj University, Madurai, India). Approval for further sampling, studies and collaborations have been obtained from IRB of Chettinad Academy of Research & Education, Kelampakka, India. Thirteen of the 65 samples studied from Tamil Nadu have already been investigated for NRY chromosomes 41 ( Supplementary Table S3 41 )). In South Africa, sample and data collection was undertaken with approval from the Human Research Ethics Committee (University of the Witwatersrand, South Africa, Protocol Number M120151) under the auspices of grants from the South African Medical Research Council to HS. Data for three Italian samples were undertaken with approval of the Ethics Committee for Clinical trials of the University of Pisa (N. 3803). In Oceanian, the Nasioi samples were obtained from the HGDP-CEPH repository, whereas the Papuan and Vanuatu samples were previously published 42 , 43 . All participants provided written informed consent for the use of their DNA in genetic studies. Samples were collected under the same criteria requiring unrelated individuals with four grandparents from their population affiliation and geographic region of origin ( Supplementary Table 3 ). We also genotyped nine populations from the 1000 Genomes Project, including Mexican-Americans, African-Americans, Peruvians from Lima, Finns, Yoruba, Luhya from Kenya, Han Chinese, Japanese and Kinh from Vietnam.

Overall, we sampled 615 unrelated individuals representing 98 worldwide populations and subpopulations with ~15 samples per population. Samples were genotyped on the GenoChip array, an Illumina HD iSelect genotyping bead array dedicated solely for genetic anthropology and lacking medically relevant markers. The GenoChip includes nearly 150,000 highly informative Y-chromosomal, mitochondrial, autosomal and X-chromosomal markers, of which only autosomal markers (~130,000) were used.

We obtained the HGDP data set available at ftp://ftp.cephb.fr/hgdp supp10/ 44 and the geographical coordinates for each sample 45 . From the 828 samples reported to include no outliers using the filtering procedure used by Patterson and colleagues 44 , we excluded 201 samples for which no clear geographical origin was described or that their populations were already included in our reference population panel ( Supplementary Data 1 ). Because of the size of China and the high representation of Chinese populations in the HGDP data set, we excluded only the Han Chinese.

Assessing data quality

Data quality was achieved by applying two criteria to the SNP data. The first was low missingness rate (<5%), calculated as the average number of null genotypes over all samples in a population. In addition, individuals that exhibit distinct admixture proportions compared with samples of the same populations were considered outliers and omitted. Overall, we omitted 2,423 SNPs and seven Genographic samples from the analysis.

Generating putative ancestral populations

To infer the putative ancestral populations, we applied ADMIXTURE 46 in an unsupervised mode to the filtered data set. This analysis uses a maximum likelihood approach to determine the admixture proportions of the individuals in question assuming they emerged from K hypothetical populations. We speculated that our method will be the most accurate when populations have uniform admixture assignments. In choosing the value of K that seemed to best satisfy this condition, we experimented with different K s ranging from 6 to 12. We identified a substructure at K =10 in which populations appeared homogeneous in their admixture composition. Higher values of K yielded noise that appeared as ancestry shared by very few individuals within the same populations. ADMIXTURE outputs the speculated allele frequencies of each SNP for each hypothetical population.

Using these data, we simulated 15 samples for each hypothetical population and plotted them in a PCA analysis with the Genographic populations. We observed that two hypothetical populations were markedly close to one another, suggesting they share the same ancestry and eliminated one of them to avoid redundancy. The remaining nine populations were considered the putative ancestral populations and were used in all further analyses.

Creating a reference population data set

To infer the geographical coordinates (latitude and longitude) of an individual given his K admixture frequencies, GPS requires a reference population set of N populations with both K admixture frequencies and two geographical coordinates (longitude and latitude). We omitted four populations for which we had no clear geographical data, were recently admixed populations or had a recent migratory history and did not maintain strict endogamy (African and Mexican-Americans, Brahmin Indians and Romanian Gypsies), as they were reported to deviate from the established relationship between genetic and geographic distances 7 and cannot be expected to be predicted to their present day origin correctly. The final data set consisted of 596 unrelated individuals representing 94 worldwide populations and subpopulations with an average of 17 individuals per population and at least two individuals per subpopulation ( Supplementary Table 3 ). These populations were considered hereafter as reference populations, since their admixture frequencies were calculated by applying ADMIXTURE in a supervised mode with the nine putative ancestral populations ( Fig. 1 ).

Although few populations included detailed annotation for subpopulations within the parental populations (for example, Tuscany and Sardinian Italians), such annotation was unavailable for most populations, even though they exhibited fragmented subpopulation structure. This heterogeneity is expected given the young age of some of the populations. For example, using an internal data set, we were able to verify that the observed substructure among Germans was due to individuals from both West and East Germany.

A combination of several criteria was employed to determine the number of subpopulations present within the study populations. Let N a denote the number of samples per population a ; if N a was less than three, the population was left unchanged. For other populations, we used k -means clustering routine in R . Let X ij be the admixture proportion of individual i in component j . For each population, we ran k -means clustering for k ∈ [2,4], using N a × 9 matrix of admixture proportions ( X ij ) as input. At each iteration, we calculated the ratio of the sum of squares between groups and the total sum of squares. If this ratio was >0.9, then we accepted the k -component model. Since k -means clustering cannot be implemented for k =1, to decide between two clusters or a possible single cluster, we also calculated Kullback-Leibler distance (KLD) between the k =2 and k =1 models. If the KLD <0.1 and the ratio of the sum of squares between groups and the total sum of squares for two-component model is above 0.9, then the k =1 model was selected because, in such cases, there are no subgroups in the population. Of the 216 subclusters detected, we excluded all the subclusters that contained only one individual. Overall, we included 146 subpopulations in our reference population set and determined the Mean admixture coefficients for each subpopulation.

Calculating the relationship between admixture and geography

figure 9

A loess distribution fitting is shown in red line with blue bar marking the limit of the linear fitting.

Using linear regression and filtering out geographic distances above 4,000 miles, we determined intercept (β) and slope (α) for the relationship between genetic (Δ GEN ) and geographic distances (Δ GEO ):

The relationship between genetics and geography obtained using Equation 1 varied across continents. Although this may be a true effect, it is more likely to be an artefact due to low sample density for non-European populations. Because we analyze geographically local population structure, we applied the intercept and slope calculated for Europeans as β=38.7 and α=2,523 for non-European populations for which sampling density was lower and thus local regression is less reliable. This regression equation should be refined with the addition of new population samples.

Calculating the biogeographical origin of a test sample

Intuitively, the predicted position of a test sample can be thought of as being determined by M nearest reference populations, each ‘pulling’ in its direction with a strength proportional to its genetic distance from the test sample. The shorter the genetic distance to the reference population, the smaller is the assumed genetic difference and the higher is the pull. We thus ‘anchored’ the centre at the nearest reference population and represented the influence of the remaining M -1 nearest reference populations using a linear combination of two-dimensional vectors with the length of each vector inversely proportional to the genetic distance between the test sample and the reference populations. The direction of these vectors is calculated using the difference between the coordinate of the best matching reference population to those of all other M -1 nearest reference populations. Therefore, GPS best predicts populations that are geographically surrounded by the reference populations.

Estimating the accuracy of GPS

Different measures of assignment accuracy have been proposed in published studies. For example, Novembre et al . 11 estimated the distance from the country of origins, whereas Yang et al . 12 used a more stringent criterion of defining success as assigning individuals to their country of origin. To compare GPS’s accuracy to those of SPA 12 , we adopted the more stringent criteria of Yang et al . 12 . Assignment accuracy for subpopulations was based on the political and municipal boundaries of the regional locations.

To estimate GPS’s assignment accuracy, we utilized the ‘leave-one-out’ approach at the individual level. In brief, we excluded each reference individual from the data set, recalculated the mean admixture proportions of its reference population, predicted its biogeography, tested whether it is within the geographic regions of the reported origin and then computed the mean accuracy per population. More specifically, we assumed that our individual is the j th sample from the i th population that consists of n i individuals. For all populations, excluding the individual in question, the average admixture proportion and geographical coordinates were calculated as:

where θ m,s is the parameter vector for the s th individual from the m th population, and n m is the size of the m th population. For the i th population the adjusted average will be

Last, we computed the best-matching population for each ‘left-out’ individual and calculated the proportion of individuals that are mapped to the correct continental region, country and, if applicable, regional locations for each population. In a similar manner, we utilized the ‘leave-one-out’ approach at the population level. That is, we excluded one population at a time from the reference population set, predicting the biogeography of the samples of that population, calculating their predicted distance from the true origin and computing the mean accuracy per continental populations and all samples.

To calculate GPS accuracy for the HGDP samples, we followed the procedure described in the above section and calculated the distances from the predicted to the true origin of each individual. To evaluate the sensitivity and specificity per population, we counted the number of samples of the population in the country of interest predicted to their true country as true positives, the number of samples from other countries assigned to the country of interest as false positives, the number of samples from the country of interest assigned to other countries as false negatives and finally the number of samples from other countries that were not assigned to the country of interest as true negatives.

Applying GPS to Southeast Asia and Oceania populations

A data set of 243 individuals genotyped over 350,000 markers was obtained with permission from Reich et al . 48 Admixture proportions of individuals were calculated by applying ADMIXTURE in a supervised mode with the nine putative ancestral populations on the ~40,000 autosomal markers that overlapped with the GenoChip markers ( Supplementary Figure 1 ).

The mean admixture proportions of these populations and their geographical coordinates were added to our reference population data set. In addition, we analysed 88 Han Chinese (CHB) and 87 Japanese (JPT) populations from this data set, even though they were already included in the worldwide data set and calculated their admixture in a similar manner. The admixture proportions of the CHB and JPT were similar to those reported in Fig. 1 .

Applying GPS to Sardinian populations

Genotype data of 290 Sardinian individuals from 28 villages genotyped on an Affymetrix array with nearly 700,000 markers was obtained with permission from Piras et al . 40 Each individual had four grandparents living in the same village. Genotype data were from four subregions: Ogliastra, Trexenta, Sulcis and Campidano. In particular, Ogliastra is a mountainous area characterized as a genetic isolate differentiated from the rest of the island and constituted by different villages with high endogamy, low immigration and high genetic differentiation. Conflicting results from studies that have investigated the internal genetic structure of different macroareas 38 , 39 , 40 suggest that the internal heterogeneity among the macroareas is limited to particular areas.

After filtering out villages with insufficient data, ten villages with 249 individuals remained ( Supplementary Table 4 ). We obtained admixture proportions for the remaining individuals by applying ADMIXTURE in a supervised mode with the nine putative ancestral populations on the ~65,000 autosomal markers that overlapped GenoChip’s markers ( Supplementary Fig. 2 ). The mean admixture frequencies of these ten Sardinian populations and their geographical coordinates were added to our reference population data set.

GPS availability

Data, GPS R code and GPS on-line calculator are available on http://chcb.saban-chla.usc.edu/gps/ . GPS code can be found in the Supplementary Note . SPA’s code for Linux was obtained from the author’s website: http://genetics.cs.ucla.edu/spa/binary/linux.zip .

Additional information

How to cite this article: Elhaik, E. et al . Geographic population structure analysis of worldwide human populations infers their biogeographical origins. Nat. Commun. 5:3513 doi: 10.1038/ncomms4513 (2014).

Change history

31 october 2016.

A correction has been published and is appended to both the HTML and PDF versions of this paper. The error has not been fixed in the paper.

Tishkoff, S. A. & Kidd, K. K. Implications of biogeography of human populations for ‘race’ and medicine. Nat. Genet. 36 , S21–S27 (2004).

Article   CAS   Google Scholar  

Harcourt, A. H. Human Biogeography University of California Press (2012).

Darwin, C. The Descent of Man and Selection in Relation to Sex John Murray London (1871).

Rowe, J. H. The Renaissance Foundations of Anthropology. American Anthropologist 67 , 1–20 (1965).

Article   Google Scholar  

Cavalli-Sforza, L. L. L., Menozzi, P. & Piazza, A. The History and Geography of Human Genes Princeton university press (1994).

Eller, E. Population substructure and isolation by distance in three continental regions. Am. J. Phys. Anthropol. 108 , 147–159 (1999).

Relethford, J. H. Global analysis of regional differences in craniometric diversity and population substructure. Hum. Biol. 73 , 629–636 (2001).

Ramachandran, S. et al. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc. Natl Acad. Sci. USA 102 , 15942–15947 (2005).

Article   CAS   ADS   Google Scholar  

Reich, D., Thangaraj, K., Patterson, N., Price, A. L. & Singh, L. Reconstructing Indian population history. Nature 461 , 489–494 (2009).

Miller, W. et al. Genetic diversity and population structure of the endangered marsupial Sarcophilus harrisii (Tasmanian devil). Proc. Natl Acad. Sci. USA 108 , 12348–12353 (2011).

Novembre, J. et al. Genes mirror geography within Europe. Nature 456 , 98–101 (2008).

Yang, W. Y., Novembre, J., Eskin, E. & Halperin, E. A model-based approach for analysis of spatial structure in genetic data. Nat. Genet. 44 , 725–731 (2012).

Shriver, M. D. et al. Skin pigmentation, biogeographical ancestry and admixture mapping. Hum. Genet. 112 , 387–399 (2003).

PubMed   Google Scholar  

Nelson, M. R. et al. The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am. J. Hum. Genet. 83 , 347–358 (2008).

Frichot, E., Schoville, S. D., Bouchard, G. & Francois, O. Correcting principal component maps for effects of spatial autocorrelation in population genetic data. Frontiers in Genet. 3 , 254 (2012).

Elhaik, E. The missing link of jewish european ancestry: contrasting the Rhineland and the Khazarian hypotheses. Genome Biol. Evol. 5 , 61–74 (2013).

Elhaik, E. et al. The GenoChip: a new tool for genetic anthropology. Genome Biol. Evol. 5 , 1021–1031 (2013).

Albrechtsen, A., Nielsen, F. C. & Nielsen, R. Ascertainment biases in SNP chips affect measures of population divergence. Mol. Biol. Evol. 27 , 2534–2547 (2010).

Li, J. Z. et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science 319 , 1100–1104 (2008).

Schlebusch, C. M. et al. Genomic variation in seven Khoe-San groups reveals adaptation and complex African history. Science 338 , 374–379 (2012).

Wollstein, A. et al. Demographic history of Oceania inferred from genome-wide data. Curr. Biol. 20 , 1983–1992 (2010).

Alsmadi, O. et al. Genetic substructure of Kuwaiti population reveals migration history. PLoS One 8 , e74913 (2013).

Price, A. L. et al. A genomewide admixture map for Latino populations. Am. J. Hum. Genet. 80 , 1024–1036 (2007).

Shlush, L. I. et al. The Druze: a population genetic refugium of the Near East. PLoS One 3 , e2105 (2008).

Article   ADS   Google Scholar  

Rosenberg, N. A. et al. Genetic structure of human populations. Science 298 , 2381–2385 (2002).

McVean, G. A genealogical interpretation of principal components analysis. PLoS Genet. 5 , e1000686 (2009).

O’Dushlaine, C. et al. Genes predict village of origin in rural Europe. Eur. J. Hum. Genet. 18 , 1269–1270 (2010).

Kayser, M. et al. Melanesian and Asian origins of Polynesians: mtDNA and Y chromosome gradients across the Pacific. Mol. Biol. Evol. 23 , 2234–2244 (2006).

Kayser, M. et al. Melanesian origin of Polynesian Y chromosomes. Curr. Biol. 10 , 1237–1246 (2000).

Kayser, M. et al. Genome-wide analysis indicates more Asian than Melanesian ancestry of Polynesians. Am. J. Hum. Genet. 82 , 194–198 (2008).

Calò, C., Melis, A., Vona, G. & Piras, I. Sardinian population (Italy): a genetic review. Int.l J. Mod. Anthropol. 1 , 39–64 (2008).

Google Scholar  

Angius, A. et al. Not all isolates are equal: linkage disequilibrium analysis on Xq13.3 reveals different patterns in Sardinian sub-populations. Hum. Genet. 111 , 9–15 (2002).

Cappello, N. et al. Genetic analysis of Sardinia: I. data on 12 polymorphisms in 21 linguistic domains. Ann. Hum. Genet. 60 , 125–141 (1996).

Pistis, G. et al. High differentiation among eight villages in a secluded area of Sardinia revealed by genome-wide high density snps analysis. PLoS One 4 , e4654 (2009).

Sanna, E., Iovine, M. C. & Floris, G. Evolution of marital structure in 20 Sardinian villages from 1800 to 1974. Anthropol. Anz. 62 , 169–184 (2004).

Oppo, A. In Storia Della Famiglia Italiana Barbagli e M., Kertzer (a cura di) D. edBologna (1992).

Murru Corriga, G. Il cognomen della madre. Tendenze matrilineari nella parentela in Sardegna (XVI-XVIII sec.). Antropologia Contemporanea 18 , 47–66 (1995).

Contu, D. et al. Y-chromosome based evidence for pre-neolithic origin of the genetically homogeneous but diverse Sardinian population: inference for association scans. PLoS One 3 , e1430 (2008).

Lampis, R. et al. The inter-regional distribution of HLA class II haplotypes indicates the suitability of the Sardinian population for case-control association studies in complex diseases. Hum. Mol. Genet. 9 , 2959–2965 (2000).

Piras, I. S. et al. Genome-wide scan with nearly 700,000 SNPs in two Sardinian sub-populations suggests some regions as candidate targets for positive selection. Eur. J. Hum. Genet. 20 , 1155–1161 (2012).

ArunKumar, G. et al. Population Differentiation of Southern Indian Male Lineages Correlates with Agricultural Expansions Predating the Caste System. PLoS One 7 , e50269 (2012).

Hammer, M. F. A recent insertion of an alu element on the Y chromosome is a useful marker for human population studies. Mol. Biol. Evol. 11 , 749–761 (1994).

CAS   PubMed   Google Scholar  

Hammer, M. F. et al. Out of Africa and back again: nested cladistic analysis of human Y chromosome variation. Mol. Biol. Evol. 15 , 427–441 (1998).

Patterson, N. J. et al. Ancient Admixture in Human History. Genetics 192 , (3): 1065–1093 (2012).

Rosenberg, N. A. et al. Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet. 1 , e70 (2005).

Alexander, D. H. & Lange, K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics 12 , 246 (2011).

Gellert, W., Gottwald, S., Hellwich, M., Kästner, H. & Küstner, H. The VNR Concise Encyclopedia of Mathematics , 2nd edn, ch. 12 (Van Nostrand Reinhold: New York, 1989.

Reich, D. et al. Denisova admixture and the first modern human dispersals into Southeast Asia and Oceania. Am. J. Hum. Genet. 89 , 516–528 (2011).

Download references

Acknowledgements

E.E is supported in part by Genographic grant GP 01‐12. L.P, C.T.S and Y.X were supported by The Wellcome Trust (098051). O.B. was supported in part by Presidium RAS (MCB programme) and RFBR (13-04-01711). T.T. was supported by grants from The National Institute for General Medical Studies (GM068968), and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (HD070996). S.T. is supported by a PRIN2009 grant. The Genographic Project is supported by the National Geographic Society IBM and the Waitt Foundation. We are grateful to all Genographic participants who contributed their DNA samples for this study.

Author information

Eran Elhaik, Tatiana Tatarinova and Colin Renfrew: These authors contributed equally to this work

Authors and Affiliations

Department of Animal and Plant Sciences, University of Sheffield, Western Bank, S10 2TN, Sheffield, UK

  • Eran Elhaik

Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, 615 N. Wolfe Street, Baltimore, 21205, Maryland, USA

Department of Pediatrics, Keck School of Medicine and Children’s Hospital Los Angeles, University of Southern California, 4650 Sunset Blvd, Los Angeles, 90027, California, USA

Tatiana Tatarinova

T.T. Chang Genetic Resources Center, International Rice Research Institute, Los Baños, Laguna, Philippines

Dmitri Chebotarev

Department of Sciences of Life and Environment, University of Cagliari, SS 554, Monserrato, 09042, Italy

Ignazio S. Piras, Carla Maria Calò & Francesco Cucca

Research Laboratories, bcs Biotech S.r.l., Viale Monastir 112, Cagliari, 09122, Italy

Antonella De Montis, Manuela Atzori & Monica Marini

Department of Biology, University of Pisa, Via Ghini 13, Pisa, 56126, Italy

Sergio Tofanelli

Department of Science of Nature and Territory, University of Sassari, Località Piandanna, 07100, Italy

Paolo Francalacci

The Wellcome Trust Sanger Institute, Hinxton, CB10 1SA, UK

Luca Pagani, Chris Tyler-Smith & Yali Xue

Department of Anthropology, University of Pennsylvania, Philadelphia, 19104, Pennsylvania, USA

Theodore G. Schurr, Jill B. Gaieski, Carlalynne Melendez, Miguel G. Vilar & Amanda C. Owings

Departamento de Toxicología, Cinvestav, San Pedro Zacatenco, CP 07360, Mexico

Rocío Gómez

Instituto de Genética y Biología Molecular, University of San Martin de Porres, Lima, Peru

Ricardo Fujita, Oscar Acosta & Jose Raul Sandoval

Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, CEP, 31270-901, Brazil

Fabrício R. Santos & Daniela R. Lacerda

Institut de Biologia Evolutiva (CSIC-UPF), Departament de Ciences de la Salut i de la Vida, Universitat Pompeu Fabra, Barcelona, 08003, Spain

David Comas, Jaume Bertranpetit & Begoña Martínez-Cruz

Vavilov Institute for General Genetics, Moscow, 119991, Russia

Oleg Balanovsky

Research Centre for Medical Genetics, Moscow, 115478, Russia

Oleg Balanovsky & Elena Balanovska

The Lebanese American University, Chouran, 1102 2801, Beirut, Lebanon

Pierre Zalloua & Marc Haber

National Health Laboratory Service, Sandringham 2131, Johannesburg, South Africa

Himla Soodyall

The Genographic Laboratory, School of Biological Sciences, Madurai Kamaraj University, Madurai, 625 021, Tamil Nadu, India

Ramasamy Pitchappan, ArunKumar GaneshPrasad & Arun Varatharajan Santhakumari

Department of ecology and evolutionary biology, University of Arizona, Tucson, 85721, Arizona, USA

Michael Hammer, Matthew E. Kaplan & Nirav C. Merchant

Department of Anatomy, University of Otago, Dunedin, 9054, New Zealand

Lisa Matisoo-Smith & Andrew C. Clarke

National Geographic Society, Washington, 20036, District of Columbia, USA

R. Spencer Wells & David F. Soria Hernanz

Applied Biosystems, Foster City, California 94494, USA.,

Syama Adhikarla & Janet S. Ziegle

The Australian Centre for Ancient DNA, School of Earth and Environmental Sciences, University of Adelaide, Adelaide, South Australia 5005, Australia,

Christina J. Adler, Alan Cooper, Clio S. I. Der Sarkissian & Wolfgang Haak

Department of Genetics, School of Molecular Sciences, La Trobe University, Melbourne, Victoria 3086, Australia,

Li Jin & John R. Mitchell

Department of Pathology, Fudan University, Shanghai 200433, China,

Hui Li & Shilin Li

IBM, Somers, New York 10589, USA,

Laxmi Parida, Daniel E. Platt & Pandikumar Swamikrishnan

Institut Pasteur, Unit of Evolutionary Genetics, 75015 Paris, France,

Lluis Quintana-Murci & Ajay K. Royyuru

McDonald Institute for Archaeological Research, University of Cambridge, Cambridge, CB2 3ER, UK,

Colin Renfrew

You can also search for this author in PubMed   Google Scholar

  • Oscar Acosta
  • , Syama Adhikarla
  • , Christina J. Adler
  • , Jaume Bertranpetit
  • , Andrew C. Clarke
  • , Alan Cooper
  • , Clio S. I. Der Sarkissian
  • , Wolfgang Haak
  • , Marc Haber
  • , Matthew E. Kaplan
  • , Shilin Li
  • , Begoña Martínez-Cruz
  • , Nirav C. Merchant
  • , John R. Mitchell
  • , Laxmi Parida
  • , Daniel E. Platt
  • , Lluis Quintana-Murci
  • , Colin Renfrew
  • , Daniela R. Lacerda
  • , Ajay K. Royyuru
  • , Jose Raul Sandoval
  • , Arun Varatharajan Santhakumari
  • , David F. Soria Hernanz
  • , Pandikumar Swamikrishnan
  •  & Janet S. Ziegle

Contributions

E.E. and T.T. conceived and conducted the experiments, E.E. and T.T. designed and carried out the data analysis and cowrote the paper together with L.M.S, I.L.S, C.M.C, A.D.M, M.A, M.M., and T.G.S. All other coauthors were involved in sample and data collection and provided helpful feedback for the paper.

Corresponding author

Correspondence to Eran Elhaik .

Ethics declarations

Competing interests.

The authors declare no competing financial interests.

Lists of participants and their affiliations appear at the end of the paper.

Supplementary information

Supplementary figures, tables, note and reference.

Supplementary Figures 1-3, Supplementary Tables 1-5, Supplementary Note 1 and Supplementary References (PDF 516 kb)

Supplementary Data 1

Predicted population and distance from true origin for 627 HGDP samples. (XLSX 32 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/

Reprints and permissions

About this article

Cite this article.

Elhaik, E., Tatarinova, T., Chebotarev, D. et al. Geographic population structure analysis of worldwide human populations infers their biogeographical origins. Nat Commun 5 , 3513 (2014). https://doi.org/10.1038/ncomms4513

Download citation

Received : 17 April 2013

Accepted : 26 February 2014

Published : 29 April 2014

DOI : https://doi.org/10.1038/ncomms4513

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

This article is cited by

Multivariate statistical approach and machine learning for the evaluation of biogeographical ancestry inference in the forensic field.

  • Eugenio Alladio
  • Brando Poggiali
  • Elena Pilli

Scientific Reports (2022)

The impact of cross-kingdom molecular forensics on genetic privacy

  • Sofia Ahsanuddin
  • Christopher E. Mason

Microbiome (2021)

The story of the lost twins: decoding the genetic identities of the Kumhar and Kurcha populations from the Indian subcontinent

  • Ranajit Das
  • Vladimir A. Ivanisenko
  • Priyanka Upadhyai

BMC Genetics (2020)

Genetic ancestry, skin pigmentation, and the risk of cutaneous squamous cell carcinoma in Hispanic/Latino and non-Hispanic white populations

  • Eric Jorgenson
  • Hélène Choquet
  • Maryam M. Asgari

Communications Biology (2020)

Differentiation of Hispanic biogeographic ancestry with 80 ancestry informative markers

  • Casandra H. Setser
  • John V. Planz
  • Deanna S. Cross

Scientific Reports (2020)

By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

research reference population

Establishing a Reference Population

  • First Online: 26 October 2013

Cite this chapter

research reference population

  • Isabelle Séguy 5 &
  • Luc Buchet 6  

Part of the book series: INED Population Studies ((INPS,volume 2))

580 Accesses

1 Citations

The grounding of our approach must be broad and stable, so that future research can build upon commonly recognized components. To that end, our work must comply with a few essential rules that are outlined below.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
  • Durable hardcover edition

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Our warmest thanks to Claude Masset for making all the data available to us.

A 3-year period was also tried, but this still exhibited artificial irregularities (in annual age classes). However, after placing the individuals in 5-year age classes, the results obtained were close to those given above.

16 women and 42 men.

Yann Ardagna presents a comprehensive inventory in his thesis (defended 2004, Marseille): “The conservation of biological archives and associated documents in biological anthropology. Applications for French and Hungarian anthropological collections”.

Anthropological study: Luc Buchet and Marième Bouali (October 2006). This series will be called the “Museum collection” when it is distinguished from those of Ferraz de Macedo and the University of Coimbra.

We wish to express our warmest gratitude to Maria da Graça Ramalhinho, director of the Museu Nacional de História Natural (MNHN) and Hugo Cardoso, anthropologist at the Museu Bocage, MNHN, for their help and hospitality during our stay in Lisbon.

This later obliteration of female sutures was noted by Mihály Lenhossék in 1917 (cited by Abdelhamid Grait in his biological and medical science dissertation: “Determination of age at death from the postcranial skeleton”, University of Lyon I, 2006, http://anthropologie-et-palaeopathologie.univ-lyon1.fr ).

Since Ferraz de Macedo’s collection has been totally destroyed, the sutures can no longer be re-interpreted to determine with certainty whether a specific male suture closure pattern exists.

We looked for a distribution presenting a ratio between successive coefficients very similar to that observed for women (i.e. ≈ 0.62).

Source for the 1890 census: Censo da População do Reino de Portugal no 1 de Dezembro de 1890. Volume II. Lisboa, Direcção da Estatística Geral e Comércio, 1896 (INED shelfmark: S2Q 1890/2). The age pyramid, established for 1 December 1890, was extrapolated back to 1 July 1889, the nearest date to the death of the individuals under study.

Source for the 1889 death records: Movimento da População. Terceiro ano 1889–1890. Lisboa, Ministério das Finanças, Direcção Geral da Estatística, 1892 (INED shelfmark: S3 Q 1889–1890).

This is true for late nineteenth-century Lisbon, but is also more generally applicable to pre-industrial populations. For example, the proportions of deaths before and after age 50 in France in 1770–1779 were 35 % and 65 % (based on Blayo 1975 ).

In the original Masset collection, 59 % of all women and 65 % of men were aged under 50.

Number of men per 100 women.

The joint between the sphenoid and occipital bones in the skull base.

They can always be brought back in later, either with the 20–29 age group, as anthropologists do, or with younger groups, as demographers do (15–19 age group).

The chi-squared test comparing the two distributions is non-significant at 2.5 % level for both sexes combined, at 1 % for men and at 1.2 % for women. The observed mortality distribution for Lisbon does not significantly differ, therefore, from that defined by the pre-industrial mortality standard. Female mortality in Lisbon between ages 25 and 45 is lower than that in the Pre-industrial Standard. In the light of the remarks we have made concerning the male sample, this under-estimation of female mortality is highly plausible given the biases in late nineteenth-century Lisbon’s statistical data (erroneous age declarations, under-recording of certain population categories in censuses, inaccuracies in 10-year age groups for counting annual deaths).

At higher ages, women are more numerous than men.

The multiplier α = 521/473.

Each suture segment is graded from 0 to 4. Divisions of 10th or even 50th may be used to calculate the coefficient. For the sake of convenience, we have multiplied these values by 10.

Although it is essential to adapt a reference population to the pre-industrial model (P Lisbon1889 and below, P Maubuisson and P Antibes1890 ) when using the probability vector method, this is no longer the case when using estimation methods based on a constant distribution of stages by age group (see Chaps. 12 and 13 ). Nevertheless, careful thought must be given to the way reference collections are constituted before attempting to estimate the age at death of a buried population. Whatever methodology is used, the quality of the estimates depends on the representativeness of the biological characteristics observed in the reference collection (of known age and sex).

Emergence is complete when the tooth breaks through the gum.

Our warmest thanks go to all those who have forwarded to us the orthopantomograms necessary for establishing the reference population, friends, colleagues and practitioners, working in the Alpes-Maritimes département . In this last group, the following deserve special mention: Drs Kamilla-André and Terrasson (Cagnes-sur-Mer), Chaussy, Lachaud and R蝐cker (Cannes), Savoye (Le Cannet), Bougues (Marmande), Dossios, Favot, Jasmin, Mahler, Millet and Raybaud (Nice), and Alibert (Sophia Antipolis)

We are very grateful for the invaluable help provided by Eve-Line Boulle and MariÒme Bouali for data entry, Magali Belaigues-Rossard, Nicolas Lannoy and Magali Sucheki for statistical analysis. We also warmly acknowledge Arnaud BringÕ, whose initial results were published in 2005 and 2006, for his close collaboration on this study, (Buchet et al. 2005 , 2006c ).

In all, 715 dental x-rays of children aged 2–18 were analysed. We excluded from the sample the 17 individuals aged 18–20, in line with our biological distinction between juveniles and adults. A certain number of x-rays of children aged 2–17 were withdrawn (21 in all), usually where there had been one or more therapeutic extractions or where the extent of mineralisation diverged too far from the average distribution (recording errors or pathology).

The Roman series (second-early third century AD) came from the Isola Sacra necropolis, 23 km west of Rome. It comprised some 2,000 individuals (of whom 800 children) of both sexes and all ages. The nineteenth-century series belongs to St Thomas’s Anglican Church, Belleville, Ontario. It comprises 1,564 skeletons, of whom 282 children under 15. Of these, 229 were sufficiently well preserved for dental examination.

A team of dentists and biostatisticians (Parner et al. 2001 ) investigated any possible trend over time in the eruption of permanent teeth by analysing two samples of Danish schoolchildren from 1969 to 1982. They observed a slight but statistically significant increase in mean age for both sexes and all teeth (with 95 % CI: 1.5 days per year for boys and 2.6 for girls). However, the interval between the two samples was short and the public health conditions in the years concerned cannot be compared to those of pre-industrial populations.

Excluding the third molar, or wisdom tooth, because of the wide variation in its eruption and mineralisation.

Since mineralisation occurs symmetrically (0.92 > r > 1) for permanent and deciduous teeth, either the left or right jaw can be used.

There are special places for infant graves, but archaeological digs do not always have the time, or luck, to reach them.

Average age at skeletal maturity is indicated both by the fusion of the sphenoid and occipital and acceleration of the process of epiphyseal closure that marks the end of growth in a skeleton. Various authors place the age of maturity between 18 and 20 for the skull and 20 and 25 for the post-cranium (for example, the proximal humeral epiphysis is fused by age 25, like the distal epiphyses of the radius and ulna).

Alduc-Le Bagousse, A. (1988). Estimation de l’âge dentaire des non-adultes: maturation dentaire et croissance osseuse. Données comparatives de deux nécropoles bas-normandes. In L. Buchet (Ed.), Anthropologie et histoire ou anthropologie historique? (coll. Notes et monographies techniques, Vol. 24, pp. 81–95). Paris: CNRS.

Google Scholar  

Ardagna, Y. (2004). “La conservation des archives biologiques et des documents associés en anthropologie biologique. Applications à des collections anthropologiques françaises et hongroises”. PhD thesis from Université de la Méditerranée-Aix-Marseille II, Faculté de médecine, and the University of Szeged.

Baud, C. A., & Gossi, M. (1980). “Degree of mineralization of bone tissue as determined by quantitative micro-radiography: Effect of age, sex and pathological conditions”. In Proceedings of fourth international conference on bone measurement (pp. 345–351).

Blaizot, F., Alix, G., & Ferber, E. (2003). Le traitement funéraire des enfants décédés avant un an dans l'antiquité : Études de cas. Bulletins et Mémoires de la Société d’anthropologie de Paris, 15 (1–2), 49–77.

Blayo, Y. (1975). La mortalité en France de 1740 à 1829. Population, 30 , 123–142. Démographie historique.

Article   Google Scholar  

Bocquet-Appel, J.-P. (1977b). Perspectives paléodémographiques, thèse d’Anthropologie historique . Paris: Éditions de l’EHESS.

Bocquet-Appel, J.-P., & Masset, C. (1995). L’âge au décès dans les populations inhumées: comparaison de méthodes et de résultats. Antropologia Portuguesa, 13 , 39–48.

Bocquet-Appel, J.-P., & Xavier de Morais, M.-H. (1987). Anthropologie et histoire. Un essai de reconstitution de la variation biologique de la population portugaise au xix e siècle (Fontes documentais portuguesas, Vol. XXII). Paris: Fondation Calouste, Gulbenkian, Centre culturel portugais.

Bocquet-Appel, J.-P., Maia Neto, A., Tavares Da Rocha, A., & Xavier de Morais, H. (1978). Estimation de l’âge au décès des squelettes d’adultes par régressions multiples. Contribuições para o Estudo da Antropologia Portuguesa, 10 (3), 107–167.

Buchet, L., Séguy, I. et al. (2005). “L’âge au décès des enfants: nouvelles approches paléodémographiques. L’Homme et ses images”. Actes du XXV e colloque du Groupe des anthropologistes de langue française (Marseille, July 2001), CD-ROM, UMR 6578, CNRS-Faculté de médecine de Marseille.

Cardoso, H. F. V. (2006). Brief communication: The collection of identified human skeletons housed at the Bocage museum (National Museum of Natural History). Lisbon, Portugal. American Journal of Physical Anthropology, 129 , 173–176.

Fazekas, I. G., & Kósa, F. (1978). Forensic fetal osteology . Budapest: Akadémiai Kiadó.

Guy, H., & Masset, C. (1997). Particularités taphonomiques des os d’enfants. In L. Buchet (Ed.), L’enfant, son corps, son histoire (Actes des 7 es Journées anthropologiques de Valbonne, juin 1994) (pp. 35–43). Antibes: APDCA.

Lalys, L. (2002). “ Biométrie radiologique de l’avant-bras et de la main, appliquée à l’estimation de l’âge des individus immatures ”. PhD thesis, Université de la Méditerranée-Aix-Marseille II, Faculté de médecine.

Masset, C. (1982). “Estimation de l’âge au décès par les sutures crâniennes”. PhD thesis, université Paris Diderot Paris 7.

Moorrees, C. F. A., Fanning, E. A., & Hunt, E. E., Jr. (1963a). Formation and resorption of three deciduous teeth in children. American Journal of Physical Anthropology, 21 , 205–213.

Moorrees, C. F. A., Fanning, E. A., & Hunt, E. E., Jr. (1963b). Age variation of formation stages for ten permanent teeth. Journal of Dental Research, 42 (6), 1490–1502.

Parner, E. T., Heidmann, J. M., Væth, M., & Poulsen, S. (2001). A longitudinal study of time trends in the eruption of permanent teeth in Danish children. Archives of Oral Biology, 46 (5), 425–431.

Saunders, S. R., Hoppa, R. D., Macchiarelli, M., & Bondioli, L. (2000). Investigating variability in human dental development in the past. Anthropologie, 38 (1), 101–107.

Séguy, I., Couvert, N., Belaigues-Rossard, M., & Bringé, A. (2006b). Des tables-types de mortalité pour les populations pré-industrielles. In L. Buchet, C. Dauphin, & I. Séguy (Eds.), La paléodémographie. Mémoire d’os, mémoire d’hommes (Actes des 8 es Journées anthropologiques de Valbonne, juin 2003) (pp. 303–321). Antibes: APDCA.

Buchet, L., Séguy, I., Suchecki, M., & Belaigues-Rossard, M. (2006c). Estimation de l’âge au décès des immatures par le degré de minéralisation. In L. Buchet, C. Dauphin, & I. Séguy (Eds.), La paléodémographie. Mémoire d’os, mémoire d’hommes (Actes des 8 es Journées anthropologiques de Valbonne, juin 2003) (pp. 289–296). Antibes: Éditions APDCA.

Sundick, R. I. (1978). Human skeletal growth and age determination. Homo, 29 (4), 228–249.

Ubelaker, D. H. (1989). The estimation of age at death from immature human bone. In M. Y. İşcan (Ed.), Age markers in the human skeleton (pp. 55–70). Springfield: Charles C. Thomas.

Download references

Author information

Authors and affiliations.

Unité Histoire et Population, INED, Paris, France

Isabelle Séguy

CEPAM, UMR 7264 (CNRS, UNS), Nice, France

You can also search for this author in PubMed   Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this chapter

Séguy, I., Buchet, L. (2013). Establishing a Reference Population. In: Handbook of Palaeodemography. INED Population Studies, vol 2. Springer, Cham. https://doi.org/10.1007/978-3-319-01553-8_4

Download citation

DOI : https://doi.org/10.1007/978-3-319-01553-8_4

Published : 26 October 2013

Publisher Name : Springer, Cham

Print ISBN : 978-3-319-01552-1

Online ISBN : 978-3-319-01553-8

eBook Packages : Humanities, Social Sciences and Law Social Sciences (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

  • Foundations
  • Write Paper

Search form

  • Experiments
  • Anthropology
  • Self-Esteem
  • Social Anxiety

research reference population

Research Population

All research questions address issues that are of great relevance to important groups of individuals known as a research population.

This article is a part of the guide:

  • Non-Probability Sampling
  • Convenience Sampling
  • Random Sampling
  • Stratified Sampling
  • Systematic Sampling

Browse Full Outline

  • 1 What is Sampling?
  • 2.1 Sample Group
  • 2.2 Research Population
  • 2.3 Sample Size
  • 2.4 Randomization
  • 3.1 Statistical Sampling
  • 3.2 Sampling Distribution
  • 3.3.1 Random Sampling Error
  • 4.1 Random Sampling
  • 4.2 Stratified Sampling
  • 4.3 Systematic Sampling
  • 4.4 Cluster Sampling
  • 4.5 Disproportional Sampling
  • 5.1 Convenience Sampling
  • 5.2 Sequential Sampling
  • 5.3 Quota Sampling
  • 5.4 Judgmental Sampling
  • 5.5 Snowball Sampling

A research population is generally a large collection of individuals or objects that is the main focus of a scientific query. It is for the benefit of the population that researches are done. However, due to the large sizes of populations, researchers often cannot test every individual in the population because it is too expensive and time-consuming. This is the reason why researchers rely on sampling techniques .

A research population is also known as a well-defined collection of individuals or objects known to have similar characteristics. All individuals or objects within a certain population usually have a common, binding characteristic or trait.

Usually, the description of the population and the common binding characteristic of its members are the same. "Government officials" is a well-defined group of individuals which can be considered as a population and all the members of this population are indeed officials of the government.

research reference population

Relationship of Sample and Population in Research

A sample is simply a subset of the population. The concept of sample arises from the inability of the researchers to test all the individuals in a given population. The sample must be representative of the population from which it was drawn and it must have good size to warrant statistical analysis.

The main function of the sample is to allow the researchers to conduct the study to individuals from the population so that the results of their study can be used to derive conclusions that will apply to the entire population. It is much like a give-and-take process. The population “gives” the sample, and then it “takes” conclusions from the results obtained from the sample.

research reference population

Two Types of Population in Research

Target population.

Target population refers to the ENTIRE group of individuals or objects to which researchers are interested in generalizing the conclusions. The target population usually has varying characteristics and it is also known as the theoretical population.

Accessible Population

The accessible population is the population in research to which the researchers can apply their conclusions. This population is a subset of the target population and is also known as the study population. It is from the accessible population that researchers draw their samples.

  • Psychology 101
  • Flags and Countries
  • Capitals and Countries

Explorable.com (Nov 15, 2009). Research Population. Retrieved May 13, 2024 from Explorable.com: https://explorable.com/research-population

You Are Allowed To Copy The Text

The text in this article is licensed under the Creative Commons-License Attribution 4.0 International (CC BY 4.0) .

This means you're free to copy, share and adapt any parts (or all) of the text in the article, as long as you give appropriate credit and provide a link/reference to this page.

That is it. You don't need our permission to copy the article; just include a link/reference back to this page. You can use it freely (with some kind of link), and we're also okay with people reprinting in publications like books, blogs, newsletters, course-material, papers, wikipedia and presentations (with clear attribution).

Want to stay up to date? Follow us!

Save this course for later.

Don't have time for it all now? No problem, save it as a course and come back to it later.

Footer bottom

  • Privacy Policy

research reference population

  • Subscribe to our RSS Feed
  • Like us on Facebook
  • Follow us on Twitter

research reference population

The 2021 World Population Data Sheet provides the latest population, health, and environment indicators for more than 200 countries and territories, each carefully researched by PRB’s expert team of demographers and analysts. Read more about the 2021 Data Sheet: Press Release

As of August 2021, we don’t have enough information to fully understand COVID-19’s global demographic impact. The pandemic is likely the cause of an increase in crude death rates in some countries around the world and a dip in life expectancy in the United States. Its impact on fertility trends has generated speculation and concern globally, but reliable data on how COVID-19 is influencing fertility are limited and continue to change as the pandemic unfolds.

Globally, the total fertility rate (TFR)—or average lifetime births per woman—has declined from 3.2 in 1990 to 2.3 in 2020. However, large variations exist, ranging from 4.7 in sub-Saharan Africa to 1.3 in East Asia and Southern Europe. Key pre-pandemic trends include a significant trend toward delayed childbearing across high-income countries, while the proportion of births to adolescent mothers ages 15-19 remains substantial in many low-income countries.

Toshiko Kaneda

Toshiko Kaneda joined PRB in 2004. Kaneda has 20 years of experience conducting research and demographic analysis. She has written numerous policy publications and peer-reviewed articles on noncommunicable diseases, population aging, health care access, and reproductive health and family planning. Kaneda directs data analysis for the World Population Data Sheet and provides technical guidance on demographic and statistical methods within PRB, as well as to external partners. She also directs PRB’s U.S. Policy Communications Training Program, which trains U.S. doctoral students to translate scientific research findings for non-scientific audiences. The program is supported by the National Institutes of Health. Prior to joining PRB, Kaneda was a Bernard Berelson Fellow at the Population Council. She holds a Ph.D. in sociology from the University of North Carolina at Chapel Hill, where she was also a predoctoral trainee at the Carolina Population Center.

research reference population

Charlotte Greenbaum

Charlotte Greenbaum joined International Programs as a policy analyst in 2017. Her focus is on reproductive health, family planning, and demography, with a particular interest in equity and reproductive health among youth. Greenbaum previously worked with the nongovernmental organization Tostan in Senegal as a volunteer assistant to the Reinforcement of Parental Practices project. She also worked as an intern at the Clinton Health Access Initiative in Lesotho. Greenbaum holds a master of science in global health and population from the Harvard T.H. Chan School of Public Health and a bachelor’s degree in human biology and French from Stanford University, with a concentration in global disparities in women’s health. She is proficient in French.

research reference population

Carl Haub has been associated with PRB since 1979. Despite his retirement 10 years ago, he still kindly provides his expertise to the annual World Population Data Sheet. While at PRB, Haub worked on projects in the United States and abroad, including Belarus, Germany, Honduras, India, Jamaica, Seychelles, and Vietnam. Much of his time at PRB was devoted to answering questions from journalists and reporters on demographic issues. He provided briefings to journalists at newspapers throughout the United States and Europe and gave lectures at meetings and press conferences. Haub holds a bachelor’s degree in government and a master’s degree in demography from Georgetown University.

research reference population

Population Reference Bureau, 1875 Connecticut Avenue, N.W. Suite 520, Washington, D.C. 20009 Phone: 800-877-9881, Email: [email protected]

© 2021 Population Reference Bureau. All Rights Reserved. Privacy Policy

National Academies Press: OpenBook

Population Growth and Economic Development: Policy Questions (1986)

Chapter: references.

Below is the uncorrected machine-read text of this chapter, intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text of each book. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

References Ahlburg, D.A. (1985) The Impact of Population Growth on Economic Growth in Developing Nations: The Evidence from Macroeconomic-Demographic Models. Background paper prepared for the Worldng Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Allan, W. (1965) The African Husbandman. London: Oliver and Boyd. Arthur, W.B., and G. McNicoll (1978) Samuelson, population "d intergenerational transfers. International Economic Review 19(1) 241-246. Ashton, B., K. Hill, A. Piazza, and R. Zeitz (1984) Famine in China, 1958-61. Population and Dc~clopmcut Review 10(4):613~646. Bale, M., and R. Duncan (1983) Food prospects in the developing countnes: a qualified optimistic view. American Economic Review 73:244~248. Barlow, R. (1967) The econonuc effects of malaria eradication. American Economic Review 57:13~157. Barlow, R. (1979) Heals and economic development: a theoretical and empirical review. Research in Human Capital and Dc~clopmcnt 1:4~75. Barnett, H.J., G.hl. van Muiswinkel, M. Schechter, and J.J. Myers (1984) The global trends in nonfuel minerals. Pp. 31~338 in J.L. Simon and H. Kahn, eds., Thc Resourceful Earth: A Rcsponsc to Global '000. New York: Basil Blaclcwell. Baumol, W., and W. Oates (1984) Long-mn trends in environmental quality. ~ J.L. Simon and H. Kahn, eds., Thc Rcsourec~ul Earth: A Rcsponsc to Global 9000. New York: Basil Blackwell. Beckford, G.L. (1984) Induced innovation model of agricultural development comment. Pp. 75~1 in C.K. Eicher and J. Staatz, eds., Agricultural Dc~clopmcnt in the Third World. Baltimore, Md.: Johns Hopkins University Press. Beckmann, P. (1984) Solar energy and over "al~rnadve" energy. In J.L. Simon and H. Kahn, eds., Thc Rcsoureefu1 Earth: A Rcsponsc to Global 9000. New York: Basil Blackwell. Behrman, J.R., and N. Birdsall (1983) The qualifier of schooling: quantity alone is misleading. American Economic Rcvu~w 73:928-946. 94

REFERENCES 95 Behrman, J.R., and N. Birdsall (19853 The Reward for Choosing Well the Timing of One's Birth: Cohort Effects and Earnings Functions for Brazilian Males. World Bank, Washington, D.C. Berelson, B. (197S) The Great Debate on Population Policy: An Instructive Entertainmens. New York: The Population Council. Berelson, B., and J. Lieberson (1979) Government efforts to influence fertility: the ethical issues. Population and Dc~clopmcnt Review 5(4):581-613. Binswanger, H.P., and P.L. Pingali (1984) The Evolution of Farming Systems and Agricultural Technology in Sub-Saharan Africa. Discussion Paper ARU-23. World Bank, Washington, D.C. Birdsall, N. (1977) Analytical approaches to the relationship of population grown and development. Population and Dcvelopmcat Reavow 3(1-2):63-102. Birdsall, N. (1980) The cost of siblings: child schooling in urban Colombia. Research in Population Economics 2:115-150. Boserup, E. (196S) The Conditions of Agricultural Growth. Chicago: Aldine Publishing Co. Boserup, E. (1981) Population and Technological Change. Chicago: University of Chicago Press. Boulier, B.L. (1985) Evaluating Unmet Needs for Contraception: Estimates for Thin~r-Six Developing Countries. Working Riper No. 678. World Bank, Washington, D.C. Brackett, J.W. (1978) Family planning in four Latin American countries knowledge, use and unmet needs: some findings from the World Ferdliq Survey. International Family Planning Perspectives 4(4):11~123. Breman, H., and C.T. de Wit (1983) Rangeland productivity and e~cploitabon in the Sahel. Scenic =1 :1341-1 347. Brown, L.R. (19813 World population grown, soil erosion, and food security. Sconce 214:995-1001. Brown, L.R. (1984) Securing food supplies. Pp. 175-193 in L.R. Brown et al., eds., Sync of the World. New York: W.W. Norton & Co. Bulatao, R. (1979) On the Nature of the Transition in He Value of Children. Paper No. SEA. I he East-West Population Institute, Honolulu, Hawaii. Bulatao, R. (1984) Reducing Fertility in Developing Counties: A Review of Determinants and Policy Levers. Working Paper No. 680. World Bank, Washington, D.C. Bureau of the Census (1983) World Population 1983~ccent Demographic Estimates for the Countries and Regions of the World. Washington, D.C.: U.S. Department of Commerce. Cam, M. (1983) Fertility as an adjustment to risk. Population and Dcvelopmcat Review 9(4):688-702. Caldwell, J.C., P.H. Reddy, and P. Caldwell (1982) Demographic change in rural south India. Population and Dc~clopmcat Review 8:68~727. Chandler, W. (1984) Recycling materials. Pp. 95-114 in L.R. Brown et al., eds., State of the World. New York: W.W. Norton & Co. Chenery, H., and M. Syrquin (1975) Patterns of Dc~clopmcnt, 195~1970. New York: O:ctord Universitr Press. Clark, C. (1978) Mathematical Biocconomics: The Optimal Management of Renewable Rcsourecs. New York: Wiley. Clawson, M. (1982) Private forests. Pp. 283-292 in P.R. Pormey and R.B. Hass, eds., Current leases u: Natural Rcsourecs Policy. Baltimore, Md.: Johns Hopkins University Pats. Coale, A.J. (1978) Population growth and economic development the case of Me.,uco. Foreign Affairs 56(2):415~29. Coale, A.J. (1984) Rapid Population Change in China, 1952-1982. Washington, D.C: National Acadeny Press.

~6 REFERENCES Coale, A.J., and E.M. Hoover (1958) Population Grown and Economic Development In Low-lncomc Countries. Pnnceton, N.J.: Princeton University Press. Cochrane, S.H. (1983) Development Consequences of Rapid Population Growth: A Review from the Perspective of Sub~ahara Afnca. Background paper for Report on Population Strategies for Sub Saharan Africa. World Bank, Washington, D.C. Commoner, B., M. Carr, and P.J. S~mler (1971) The causes of pollution. Environment 1 3(3):2-19. Corsa, L., Jr., and D. Oakley (1971) Consequences of population growth for heals services in less developed countries an initial appraisal. Pp. 368 402 in National Academy of Sciences, Rapid Population Growth: Conseq~caces and Policy Implications, Vol. II. Baltimore, Aid.: Johns Hopldas University Press. Council on En~n~nmental Quality and U.S. Department of State (1980) The Global 9000 Report to the President. Washington, D.C.: U.S. Government Pnndag Office. Crosson, PR. (1982) Agncultuml land. Pp. 253-282 in P.R. Portney and R.B. Hass, eds., Current Issues In Natural Rcsourecs Policy. Baltimore, Md.: Johns Hopkins University Press. Crosson, P.R. (1983) Soil Erosion in Developing Countnes: Amounts, Consequences, and Policies. Working Paper No. 21. Center for Resource Policy Studies, School of Natural Resources, University of Wisconsin, Madison. Crosson, P.R. (1984) National Costs of Erosion Effects on Productivity. Unpublished paper. Resources for the Future, Washington, I).C. Dasgupta, P. (1985) The Ethical Foundations of Population Policy. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Dasgupta, P., and G.M. Heal (1979) Economic Theory and ExhaustibkRcsourecs. Cambndge, England: Cambndge University Press. David, P. (1975) Technical Choice, I - ovation and Economic Growth. Cambndge, England: Cambridge University Press. Deardorff, A.V. (1985) Trade and Capital Mobility in a World of Diverging Populations. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Deevey, E.S., D. Rice, P. Rice, H.H. Vaughan, M. Brenner, and M.S. Flannel (1979) Mayan urbanism: impact on a tropical karst environment. Science 2Q6(19):19~306. de Janvsy, E. (1984) The political economy of rural development in Latin Amenca: an interpretation. Pp. 82-95 in C.K. Eicher and J. So, eds., Agac~tural Development u' the Third World. Baltimore, Md.: Johns Hopkins University Press. Denison, E.F. (1962) The Sourecs of Economic Growth and the Alternatives Before Us. New York: Committee for Economic Development. Denison, E.F. (1974) According for Unfed Swes Economic Growth. 1979-1969. Washington, D.C.: Brookings Institution. Denison, E.F., and W.K. Chung (1976) How Japan's Economy Grew So Fast. Washington, D.C.: Brookings Institution. Deolalikar, A. (1984) Are There Primary Retunes to Health in Agncultu~al Work? An Econometric Analysis of Agricultural Wages and Farm P=ductiviq in Rural Soup India. Unpublished manuscript. Dq~arOnent of Economics, University of Pennsylvania. Easterlin, R.E., ed. (1980) Population and Economic Change in Developing Countries. Chicago: University of Chicago Press. Economic Commission for Africa (1984) Food production and population in Affica. Pp. 241-266 in United Nations Internadosal Conference on Population, Pop-lion, Rcsourecs, E - irormnent, and Dc~clopmcnt. Population Study No. 90, ST/ESAtSER.A190. New Yorlc: United Nations.

REFERENCES 97 Eicher, C.K. (1984) Facing up to Africa's food crisis. Pp. 452~80 in C.K. Eicher and J. Steatz, eds., Agricultural Dc~clopmcnt In the Third World. Baltimore, Md.: Johns Hopkins University Press. Eicher, C.K., and J. Staatz, eds. (1984) Agricultural Development in the Third World. Baltimore, Md.: Johns Hopkins University Press. Ernst, C., and J. Angst (1983) Birth Order: Its Influenec on Pcrsomlity. New York: Springer-Verlag. Evenson, R.E. (1984a) Benefits and obstacles in developing awropuate agricultural technology. Pp. 348-361 in C.K. Eicher and J. So, eds., Agriculsural Dc~clopmcnt uz the Third World. Baltimore, Md.: Johns Hopkins University Press. Evenson, R.E. (1984b) Population Growth, Infrastructural Development, Technologr and Welfare in Rural North India. Paper prepared for the IUSSP Seminar on Population and Rural Development. New Delhi, India. Fields, G. (1975) Rural-urban migration, urban unemployment and underemployment, and job search activity in LDCs. Journal of Dc~clopmcnt Economics 2:165-187. Firor, J.W., and P.R. Portney (1982) The world climate. Pip. 17~215 in P.R. Porh~ey, ea., Current Issues in Natural Rcsourecs Policy. Washington, D.C.: Resources for the Future. Flavin, C., and S. Pastel (1984) Developing renewable energy. Pp. 136-174 in L.R. Brown et al., eds., State of the World 1984. New York: W.W. Norton & Co. Food and Agnculture Organization (1981 ) Agriculture: Toward 9000. Itome: Food and Agriculture Organization. Food and Agriculture Organization (1983) Land, Food, and Population. FAO Conference Session 22, 5 November 1983, Rome, Italy. Ghatalc, S., and K. Ingersent (1984) Agriculture and Economic Dc~clopmcnt. Baltimore, Md.: Johns Hopkins Universitr Press. Gilbert, A. (1976) The argument for very large cities reconsidered. Urban Studies 13(1):27- 34 . Giovannini, A. (1983) Interest-elasticity of savings in developing countries: the existing evidence. World Dc~clopmcnt 11(7).601~7. Clover, D., and J.L. Simon (1'975) The effects of population density on infras~c~re: the case of road building. Economic Development and Cultural Change 23:453~68. Goeller, H.E., and A. Zucker (1984) Infinite resources: the ultimate strategy. Scorned 223.456~62. Golladay, F., and B. Liese (1980) Heals Issues and Policies in the Developing World. Working Paper No. 412. World Bank, Washington, D.C. Gould, J. (1972) Economic Growth in History: Survey and Analysis. London: Methuen & Company. Gourou, P. (1980) The Tropical{ World: Its Social and Economic Cond lions and Its Future Status, 5th edition. New York: Longman. Greenwood, M.J. (1969) The determinants of labor migration in Egypt. Journal of Regional Scicnec 9:283-290. Greenwood, M.J. (1971) Regression analysis of migration to urban areas of a less-developed country: the case of India. Journal of Regional Science 11:253-262. Greenwood, M.J. (1978) An econometric model of internal migration and regional economic growth in Mexico. Journal of Regional Science 18:17-31. Gregory, P. (1980) An assessment of changes in employment conditions in less developed countries. Economic Dc~clopmcnt and Cul~r~1 Change 28(4):673-700. Habakkok, H.J. (1962) American and British Technology us Tic Nineteenth Century. Cambridge, England: Cambridge University Press.

98 REFERENCES Hammer, 3.S. (1984) Population Growth and Savings in Less Developed Countnes. Background paper prepared for Be 1984 World Dcvelopmcat Report. World Bank, Washington, D.C. Hanushek, E.A. (1981) Throwing money at schools. Journal of Policy Analysis and Management 1:19~1. Harnagton, W., and A.C. Fisher (1982) Endangered species. Pp. 117-148 in P.R. Portney and R.B. Hass, eds., Current issues in Natural Resources Policy. Washington, D.C.: Resources for the Future. Harris, R., and R. Sabot (1982) Urban unemployment in Ll)Cs: toward a more general search model. In R. Sabot, ea., Migration and the L - or Market in Developing Countries. Boulder, Colo.: Westview Press. Harris, R., and M. Todaro (1970) Migration, unemployment, and development: a two sector analysis. American &ononuc Review 60:12~142. Hayami, Y., and V.W. Ruttan (1984) The green revolution: inducement and distribution. Pakistan Development Ncv~cw 23(Spnog):38~3. Haya~ni, Y., and V.W. Ruttan (1985a) Agricultural Development: An latcrnational Perspective. Baltimore, Md.: Johns Hopkins Unive~sibr Press. Hayami, Y., and V.W. Ruttan (1985b) Population Growth and Agncultural Productivity. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Henderson, 3.V. (1984) General equilibrium modeling of systems of cities. In P. Nijkamp and E. Mills, eds., Ifandbooks u' Regional and Urban Economics. New York: North- Holland. Henderson, J.V. (1985) Industrialization and Urbanization: International Experience. Background paper prepared for He Working Group on Population Growth and Econonuc Development, Committee on Population, National Research Council, Washington, D.C. Heyneman, S.P., and W.A. Lo~ley (1983) The effect of primary school quality on academic achievement across twenty-nine high- and low-income countries. American Journal of Sociology 88(6):1 162-1 192. Hinman, C.W. (19843 New crops for arid lands. Scicnec 225:1445-1448. Ingram, G.K. (1980) Land in Perspective: Its Role in He Structure of Cides. Urban and Regional Report 80~9. World Bank, Washington, D.C. Inten~adonal Labour Organisation (1974) Sharing in Dc~clopmcn`. Geneva: International Labour Organis~on. James, J. (1985) Population and Technical Change in the Manufactunog Sector of Developing Countnes. Background paper prepared for the Working Group on Population Grown and Economic Development, Comnuttee on Population, National Research Council, Washington, D.C. Johnson, D.G. (1974) Population, food, and economic adjustment. American Statistician 28(3):8~93. Johnson, D.G. (1985) World Commodity Market S.-ation and Outlook. Washington, D.C.: American Enterprise Institute. Jones, G.W. (1971) Effect of population change on He aazirunent of educational goods in developing countnes. Pp. 315-367 in National Academy of Sciences, Rapid Population Growth: Consequences and Policy Implications. Baltimore, Md.: Johns Hopkins University Press. Jones, G.W. (1975) Population Growth and Education Planning in Dc~clopcng Nations. New York: John Wiley and Sons. Keeley, M.C. (1976) A neoclassical analysis of economic~e~nographic simulation models. Pp. 25 46 in M.C. Keeley, ea., Population, Public Policy, and Economic Development. New Yorlc. Praeger.

REFERENCES 99 Kelley, A.C. (1973) Population growth, the dependency rate, and the pace of economic development. Populatu~n Stairs 27:4J35 414. Kelley, A.C. (1976) Demographic change and the size of the government sector. Southern Economic Journal 43(2):10S~1066. Kelley, A., and J. Williamson (1984) What Drives Third Work! COy Growth? Princeton, N.J.: Princeton University Press. Khan, A.R. (1984) Population Growth and Access to Land: An Asian Perspective. Paper prepared for the IUSSP Seminar on Population, Food, and Rural Development. New Delhi, India. King, E.M. (1985) Consequences of Population Pressure in the Family's Welfare. Background paper prepared for the Working Group on Population Growth and Econonuc Development, Committee on Population, National Research Council, Washington, D.C. Kirchner, J., J. Drake, G. Ledec, R. Goodland, V. Smil, and H.E. Daly (1984) Carrying Capacity, Population Growth and Sustainable Development. Working Paper No. 690. World Bank, Washington, D.C. Knight, J.B., and R. Sabot (1982) Land market discrimination in a poor urban economy. Journal of Dcvelopmcat Studies 19(13:67-87. Koopmans, T.C. (1974) Proof for a case where discounting advances the doomsday. Pp. 117-120 in Symposium on the Economics ~ Exhaustzbic Rcsourecs. Krueger, A. (1978) Alternative trade strategies and employment in LDCs. American Economic Reavow 68:27~274. Kuzoem, S. (1967) Pbpulabon and economic grown. Procec~u~gs of He American Philosophical Society 3(3):17~193. Kuzoets, S. (1976) Demographic aspects of the size distribution of income. Economic Dcvelopmeru and Cultural Change 25(1):1-94. Lam, D. (1985) Distribution Issues in the Relationship Between Population Growth and Economic Development. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Lee, R. (1980) An historical perspective on economic aspects of the population explosion: the case of preindustrial England. Pp. 517-556 in R.E. F~c~rlin' ea., Population and Economic Change in Developing Countries. Chicago: University of Chicago Press. Lefebvre, A. (1977) C:roissmce demographique et progres economique dans lo. pays en developpment de 1960 a 1974 Demographic growth and economic progress in developing countries from 1960 to 1974]. Population Is) 32(6):1287-1293. Leff, N. (1968) Dependency rates and savings rates. American Economic Review 59:886 895. Leibenstein, H. (1971) The impact of population grown on economic welfare~ontraditional elements. Pp. 175-198 in National Acaden~ of Sciences, Rapid Population Growth: Consequences and Policy Implications. Baltimore, Md.: Johns Hopkins University Press. Leo~ef, W., J. Koo, S. Nasar, and I. Sohn (1983) The Fu~rc of Nonfucl Minerals u' Kc U.S. and World Economy. Lexington, Mass.: D.C. Head. Lewis, A. (1954) Development with unlimited supplies of labor. Manchester School of Economics and *acial Stains 20:13~192. Li, C. (1985) Economic reform bongs better life. Beijing Reavow 28(29):1~22. Lion, J. (1983) Civics in the Dcvelopu~g World. New York: O:cfo~ University Press. MacKellar, F.L., and D.R. Vining, Jr. (1985) Nature Resource Scarcity: A Global Summary. Background paper prepared for He Working &mup on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Mason, A. (1985) National Sa~nag Rates and Population Growth: A New Model and New Evidence. Background paper plagued for the Worldug Group on Population Grow and

100 REFERENCES Economic Development, Committee on Population, National Research Council, Washington, D.C. Mazumdar, D. (1984) The rural-urban wage gap migration and the working of urban labor markets. radian Economic Review 18(2):169-198. Mazumdar, D. (1985) Rural-Urban Migration in Developing Countnes. Paper prepared for the Conference on Population Growth, Urbanization, and Urban Policies in the Asia- Pacific Region. East-West Center, Honolulu, Hawaii. McNicoll, &. (1984) Consequences of rapid population growth: overview and assessment. Population and Dc~clopmcnt Review 10(2):177-240. Meadows, D.H., D.L. Meadows, J. Randers, and W.W. Behrens m (1972) Tkc Limits to Growth. New York: Universe Books. Mellor, J., and B.F. Johnston (1984) The world food equation: interrelations among development, employment, and food consumption. Journal of Economic L~raturc 12:531- 574. Mensch, B., H. Leather, and S. Preston (1985) Chin Mortality D`f~crcrstials u: Developing Countries. New York: United Nations. Miller, K.R., J. Furtado, C. de Klemm, J.A. McNeely, N. Muyers, M.E. Soule, and M.C. Trexler (1986) Issues on the presentation of biological diversity. In R. Repetto, ea., Tkc Global Possible. World Resources Stile. New Haven, Conn.: Yale University Press. Mohan, R. (1984) The effect of population grown, the pattern of demand and of technology on the process of urbanization. Journal of Urban Economics 15:125 156. Montgomery, M.A. (1985) The Impacts of Urban Population Grown on Urban Labor Markets and the Costs of Urban Service Delivery: A Review. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Mosley, H. (1983) Will Primary Health Care Reduce Infant and Child Mortality? A Critique of Some Current Strategies, with Special Reference to Africa and Asia. Paper prepared for the IUSSP Seminar on Social Policy, Heald1 Policy, and Mortality Prospects. Paris, France. Mueller, E. (1984) Income, aspirations, and fertility in rural areas of less developed countries. Pp. 121-150 in W.A. Schutjer and C.S. Stokes, eds., Rural Dc~clopmcnt and Fcrdlity. London: Macmillan. Murnane, R.J. (1981) Interpreting the evidence on school effectiveness. Teachers CoUcgc Record 83:1~35. Muscat, R. (1984) Carrying Capacity and Rapid Population Growth: Definition, Cases, and Consequences. Working Paper No. 690. World Bank, Washington, D.C. Myers, N. (1980) Conversion of Tropical Moist Forests. Board on Science and Technology for International Development. Washington, D.C.: National Academy Press. National Academy of Sciences (1971) Rapid Population Growth: Consequences and Policy lrnplications. 2 vols. Baltimore, Md.: Johns Hopldas University Press. National Research Council (1984) Changing Climate: Report clef the Carbon Dioxide Assessment Committee. Board on Atmospheric Sciences and Climates. Washington, D.C.: National Academy Press. Nerlove, M., A. R~, and E. Sadka (1984) Some Welfare Theoretic Enplications of Endogenous Ferdliq. Paper prepared for the Conference on He Economics of He Family. University of Pennsylvania. North, D.l)., and R.P. Thomas (1973) 7.hc J{isc of the Western World: A New Economic ~ story. Cambridge, England: Cambridge University Press. Oberai, A. (1978) Changes in taut Structwc of Employment with Economic Development. Geneva: IntErnabonal Labour Organisabon.

REFERENCES 101 PheIps, E.S. (1962) The view of investment. Queerly Journal of Economics 76:548-567. Phelps, E.S. (1968) Population increase. Canadian Journal of Economics 1(3):497-518. Phelps, E.S. (1980) On the increase of technology. Pp. 12~132 in E.S. Phelps, ea., Studies In Macroeconomic Theory. Vol. a: Redistribution and Growth. New York: Academic Press. Pingali, P.L., and H.P. Binswanger (1984) Population Density and Farmung Systems: The Changing Focus of Innovations and Technical Change. Paper prepared for the IUSSP Seminar on Population and Rural Development. New Delhi, India. Pingali, P.L., and H.P. Binswanger (198S) Population Density and Agricultural Intensification: A Study of the Evolution of Technologies in Tropicad Agriculture. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Portes, A., and L. Benton (1984) Industrial development and labor force absorption. Population and Dc~clopmcut Review 10(4):S89~12. Portney, P.R. (1982) Introduction. Pp. 1-13 in P.R. Portney and R.B. Haas, eds., Current Issues in Natural Unsoured Policy. Baltimore, Md.: Johns Hopkins University Press. Postel, S. (1984) Protecting forests. Pp. 74~94 in L.R. Brown et al., eds., State of the World. New York: W.W. Norton & Co. Potter, J. (1978) Demographic factors and income distribution in Latin America. Pp. 321- 337 in Proceedings, Economic and Demographic Chant: Issues for the 1980s. Helsinki: International Union for the Scientific Study of Population. Preston, S. (1979) Urban growth in developing countries: a demo~anhic It Population and Dcvelopmc~st Reavow 5(2):195-215. sin ~ ~ AN ~ ~. . . O ~__ ~ sacnaropoulos, G. (1981) Returns to education: an updated international comparison. Comparative Education 17(3):321-347. Ran,, R., and T.W. Schultz (1979) Life span, health, savings, and productivity. Economic Dc~clopmcat and Cultural Change 27(3):399421. Rempel, H. (1981) Rural-Urban Labor Migration and Urban Unemployment in Kenya. La~cenburg, Austria: Intemational Institute for Applied Systems Analysis. Renaud, B. (1982) National Urbani--ation Policy u' Developing Countries. New York: Oxford University Press. Revelle, R. (1976) The resources available for agriculture. Scientific American 235(3):16 179. Ridker, R.G. (1976) Population and Dcvelopmcat: The Search for Sckctivc Interventions. Baltimore, Md.: Johns Hopkins University Press. Rodgers, G. (1984) Poverty and Population: Approaches and Evidenec. Geneva: International Labour Organisation. Rosenzweig, M., and K.I. Wolpin (1980) Testing the quantity~uality fertilibr model: the use of twins as a natural experiment. Econometrica 48(1):2~27-240. Rosenzweig, M., H.P. Binswanger, and 3. McIntyre (1984) From Land-Abundance to Land~carcit~r: The Effects of Population Growth on Production Relations in Agranan Economies. Paper prepared for the IUSSP Conference on Population, Food, and Rural l Development. New Delhi, India. Ruttan, V.W., and Hayami, Y. (1984) Induced innovation model of agricultural development. Pp. 59-74 in C.K. Eicher and J. Stow, eds., Agricultural Dc~clopmcnt in the Third World. Baltimore, Md.: Johns Hopkins University Press. Sabolo, Y. (1975) Employment and unemployment, 19601990. International L - our Review 1 12(6):401~17. Schultz, T.P. (1971) Rural-urban migration in Colombia. Pc~iew of Economics and Statistics 53:157-163.

102 REFERENCES Schultz, T.P. (1985) School Expenditures and Enrollment, 1960 1980: The Effects of Income Prices and Population Grown. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Schultz, T.W. (1979) Investment in population quality in low income countnes. Pp. 339- 360 in P.M. Haieser, ea., World Population and Development: Challenges and Prospects. Syracuse, N.Y.: Syracuse University Press. Schutjer, W.A., and C.S. Stokes, eds. (1984) Rural Development and Fertility. London: Macmillan. Sedjo, R.A., and M.C. Clawson (1984) Global forests. Pp. 128-170 in J.L. Simon and H. Kahn, eds., Thc Rcsoureeful Earth: A Response to Global 9000. New York: Basil Blaclcwell. Sen, A. (1981) Poverty and Famines: An Essay on Ent~lcmcnt and Depravation. Oxford, England: Clarendon Press. Simmons, J., and L. Alexander (1978) The determinants of school achievement in developing countnes: a review of the research. Economic Dcvelopmcut and Cultural Change 26(2):341- 357. Simon, J.L. (1975) The positive effect of population growth on agricultural saving in irrigation systems. Reavow of Economics and Statistics 57:71-79. Simon, J.L. (197 7) The Economics of Population Growth. Princeton, N.J.: Princeton University Press. Simon, J.L. (1981) Thc Ultimate Resource. Princeton, N.J.: Princeton T3niversiq Press. Simon, J.L., and R. Gobin (1980) The relationship between population and economic growth in LDOs. Research in Population Economics 2:21S-234. Simon, J.L., and H. Kahn, eds. (198~4) Thc Rcsoureeful Earth: A Response to Global woo. New York: Basil Blaclmell. Simon, J.L., and A.M. Pilarski (1979) The effect of population growth upon the quantity of education children receive. Review of Economics and Statistics 61(4):572-584. Simon, J.L., and G. Steinman (1981) Population grown and Phelps' technical progress model: inte~pretabon and generalization. Pp. 23~W in J.L. Simon and P. Lindert, eds., Rcscarok in Population &onom~cs, Vol. 3. Greenwich, Conn.: JAI Press. Sunon, J.L., and A. Wildavsky (1984) On species loss, the absence of d - , and risks to humanity. pp. 171-183 in J.L. Simon and H. Kahn, eds., Tkc Resourceful Earth: A Response to Global 9000. New York: Basil Blaclmell. Slade, M.E. (1982) Trends in natural resource commodity prices: an analysis of He time domain. Journal of Environmcn~l Economics and Management 9:122-137. Slade, M.E. (1985) Natural Resources, Population Grog, and Economic Well-Being. Background Development, Committee on Population, National Research Council, Washington, D.C. Slicher van Bad, B.H. (19633 Thc Agrarmn History of Western Europe, A.D. 500-1850. London: Edward Arnold. Smil, V. (1984) Thc Bad Earth: Environmental Degradation in China. Armonk, N.Y.: Shy. Smith, V.K., and J.V. Krutilla (1979) Resource and environmental constraints to grown. American Journal of Agricultural Economics 61(3)~395 408. Solow, R. (1956) A contribution to the theory of economic growth. Quarterly Journal of Economics 70:65-90. Squire, L. (1981) Employment Policy in Developing Comics. New York: Oxford University . MASS. Srini~n, T.N. (1985) Population, Food, and Rural Development. Background paper prepared for He Working Group on Population Growth and Economic Development, never prepared for the Worldng Group on Population Growdl and Economic

REFERENCES 103 Committee on Population, National Research Council, Washington, D.C. Srinivasan, T.N., and P.K. Bardhan (1974) Poverty and lucomc Distribution in India. Calcutta: Statistical Publishing Society. Standing, G. (1984) Labour Suwly and Development Policies: A Perspective. Paper prepared for the Conference on Population Growth and Labor Absorption in the Developing World (Rockefeller Foundation). Bellagio, Italy. Starrett, D. (1972) On golden rules, the "biological rate of interest" and competitive inefficiency. Journal of Political Economy 80:27~291. Stern, J., and J. Lewis (1980) Employment Patterns and Income Growth. Working Paper No. 419. World Bank, Washington, D.C. Stiglitz, J.E. (1979) A neoclassical analysis of the economics of natural resources. In V.K. Smith, ea., Scarcity and Growth Reconsidered. Baltimore, Md.: Johns Hopkins University Press. Stolnitz, G.J. (1984) Urbanization and Rural-t - Urban Migration in Relation to LDC Fertility. Fertility Determinants GroupIFutures Group Study for the Agency for International Development. Indiana University, Bloomington. Strauss, J. (1985) Does Better Nutntion Raise Farm Productivity? Unpublished manusenpt. Yale University. Tan, J.P., and M. Haines (1983) Schooling and the Demand for Children: Historical Perspectives. Background paper prepared for the 1984 World Development Report. World Bank, Washington, D.C. Terhune, K.W. (1974) A Review of the Actual and Expected Consequenecs of Family Size. Buffalo, N.Y.: Calspan Corp. Tobin, J. (1967) Life cycle saving and balanced economic growth. Pp. 231-256 in W. Fellner, ea., Ten Economic Studies in the Tradition of Irving Fisher. New York: Wiley Press. Todaro, M. (19693 A model of labor migration and urban unemployment in less developed countnes. American Economic Avow 59:393~23. Rodeo, M. (1980) Internal migration in developing countnes: a surrey. In R. Easterlin, ea., Population and Economic Change in Dc~clop~g Coteries. Chicago: University of Chicago Press. Todaro, M., and J. Stilkind (1981) City Bias and Rural Neglect: The Dilemma of Urban Dcvelopmcat. New York: The Population Council. Trussell, J., and A.R. Pebley (1984) The Potential Impact of Changes in Fertility on Infant, Child, and Maternal Mortality. Unpublished manuscript. Princeton University. United Nations, Population Division (19803 Patterns of Urban and Rural Pop~dation Growth. New York: United Nations. United Nations, Department of Technical Cooperation for Development (1984) Pcport of the International Confcrcncc on Population 1984. New York: United Nations. United Nations (1985) Migration, Population Growth and Employment In Metropolitan Areas of Selected Dcvelopu', Countries. New York: United Nations. U.S. Department of Agriculture, Economic Research Service (1985) China: Outlook and Situation. Report RS 85-5. Washington, D.C.: U.S. ~parunent of Agriculture. Usher, D. (1973) An imputation to the measure of economic growth for changes in life expectancy. In M. Moss, ea., The Measurement of Economic and Social Pcrform~cc. New York: National Bureau of Economic Research. Wattenberg, B.., and K. Zinsmeister (1985) Arc World Population Treads a Problem? Washington, D.C.: American Enterprise Institute. Westoff, C.F. (1978) The unmet need for bird control in five Asian countnes. International Family Planning Perspectives 4(13:~17.

104 REFERENCES Williamson, J.G., and P.H. Lindert (1980) American Inequality: A Macroeconomic History. New York: Academic Press. Willis, R.J. (1985) Externalities and Population. Background paper prepared for the Working Group on Population Growth and Economic Development, Committee on Population, National Research Council, Washington, D.C. Woodwell, G.M., J.E. Hobble, R.A. Houghton, J.M. Melillo, B. Moore, B.J. Peterson, and G.R. Shaver (1983) Global deforestation: contribution to atmospheric carbon dioxide. Science 222:1081-1088. World Bank (1974) Population Policies and Economic Dc~elopmcat. Baltimore, Md.: Johns Hopkins University Press. World Bank (1983a) World Tables. Economic Data, Vol. I. Baltimore, Md.: Johns Hopkins University Press. World Bank (1983b) World Babes. Social Data, Vol. II. Baltimore, Md.: Johns Hopkins University Press. World Bank (1984) World Development Report. Washington, D.C.: World Bank. World Health Organization (1975) Find: Report on the World Health Situation. Official Records of He World Health Organization, N. 225. Geneva: World Health Organization. World Meteorological Organization (1983) Population and Climate. Paper presented at the International Conference on Population. Geneva. Wray, J.I). (1971) Population pressure on families: family size and child spacing. Pp. 403~61 in National Academy of Sciences, Rapid Population Growth: Consequences and Policy Implications. Baltimore, Md.: Johns Hopkins University Press. Yap, L. (1977) The attraction of cities: a review of He migration literature. Journal of Dc~clopmcat Economics 4(3):239-264.

This book addresses nine relevant questions: Will population growth reduce the growth rate of per capita income because it reduces the per capita availability of exhaustible resources? How about for renewable resources? Will population growth aggravate degradation of the natural environment? Does more rapid growth reduce worker output and consumption? Do rapid growth and greater density lead to productivity gains through scale economies and thereby raise per capita income? Will rapid population growth reduce per capita levels of education and health? Will it increase inequality of income distribution? Is it an important source of labor problems and city population absorption? And, finally, do the economic effects of population growth justify government programs to reduce fertility that go beyond the provision of family planning services?

READ FREE ONLINE

Welcome to OpenBook!

You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

Do you want to take a quick tour of the OpenBook's features?

Show this book's table of contents , where you can jump to any chapter by name.

...or use these buttons to go back to the previous chapter or skip to the next one.

Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

To search the entire text of this book, type in your search term here and press Enter .

Share a link to this book page on your preferred social network or via email.

View our suggested citation for this chapter.

Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

Get Email Updates

Do you enjoy reading reports from the Academies online for free ? Sign up for email notifications and we'll let you know about new publications in your areas of interest when they're released.

Sampling Methods: A guide for researchers

Affiliation.

  • 1 Arizona School of Dentistry & Oral Health A.T. Still University, Mesa, AZ, USA [email protected].
  • PMID: 37553279

Sampling is a critical element of research design. Different methods can be used for sample selection to ensure that members of the study population reflect both the source and target populations, including probability and non-probability sampling. Power and sample size are used to determine the number of subjects needed to answer the research question. Characteristics of individuals included in the sample population should be clearly defined to determine eligibility for study participation and improve power. Sample selection methods differ based on study design. The purpose of this short report is to review common sampling considerations and related errors.

Keywords: research design; sample size; sampling.

Copyright © 2023 The American Dental Hygienists’ Association.

  • Research Design*
  • Sample Size

Introducing The Forrester Reference IT Capability Map

Charles Betz , Research Director

Today, I am happy to announce the publication of the Forrester Reference IT Capability Map . This is a comprehensive statement of the major capabilities used to define and deliver IT and digital systems, both individually and at a portfolio level.

From the report summary: Large digital and IT organizations represent complex ecosystems of processes, people, and platforms. They acquire technology and implement partnerships, composing and configuring solutions into valuable customer and end-user experiences, and support these experiences as applications, products, and services for years or decades. The IT capabilities for this delivery must be adaptive, as technology continues to evolve. Use this technology-, industry-, and maturity-agnostic IT capability map to assess your digital and IT organization and any significant deficits, gaps, or redundancies in your abilities to deliver value for your stakeholders.

Why a framework, and why now?

When companies reach a certain size, and their IT systems have grown accordingly, they tend to reach a watershed moment when they realize that they can no longer manage their IT estate with emails and spreadsheets and Post-it notes. At the larger scale, they may assign an enterprise architect to try to make sense of the space; I held that role for a large bank, wrote a book about the topic, and ultimately found myself helping to start up the IT4IT standard that is now a product of The Open Group (a more systems-level view of the topic that does not currently include a capability model, per se).

At Forrester, we frequently entertain client inquiries on this topic as well, and I have continued to evolve my thinking on this topic through covering enterprise service management, DevOps, product-centric transformation, and enterprise architecture over the past seven years. Clearly, the overall space has evolved as IT operating models have changed , and I felt that an updated view was called for.

So without further ado, here it is. This is the “Level 2” version, with 20 capabilities grouped into seven major areas. We go to a Level 3, with about 75 subcapabilities (3–5 for each Level 2).

research reference population

(Click for a larger view)

Inputs into the diagram include major IT frameworks and the work of various authors. At one point, we had a spreadsheet with hundreds of entries, which we then boiled down through multiple iterations and conversations about what’s important and what is changing. As noted in the report, “These capabilities evolve slowly over time as IT management approaches change and transcend the churn of the year-to-year technology market and application and service lifecycles.” Yet they do change. Products and platforms have entered industry thinking broadly and now need to be recognized.

This represents a kickoff of a major research stream for me on IT management architectures, which my long-term followers and associates will recognize as an interest of mine for about 20 years now. There are three primary taxonomies: capabilities (this work), an ontology, and a reference systems architecture we are tentatively calling the IT control plane. The latter is where we will link in actual markets for this space, including evaluative research such as our Forrester Wave™ work on topics like enterprise service management, strategic portfolio management, enterprise architecture, DevOps, and observability. Finally, all of this work will inform analyses of topics such as vendor M&A, market convergences, the impact of generative AI, and broader topics like technical debt and the rise of increasingly broad IT management platforms.

I could not have done this report without extensive collaboration, so big shout out to: Frederic Giron ,  Bobby Cameron ,  Sam Higgins ,  Joseph Blankenship ,  Andrew Hewitt ,  Aaron Katz ,  Margo Visitacion ,  Julie Mohr ,  Fiona Mark ,  David Mooter ,  Stephanie Balaouras , Carlos Casanova, Brent Ellis,  Matthew Guarini , Melissa Chhay , and Kara Hartig

There will be much more to come on this topic. Please drop me a line if you are interested!

Forrester clients only: I will be presenting a webinar on June 4 at 1 p.m. ET/10 a.m. PT.

  • Architecture & Technology Strategy
  • enterprise architecture

research reference population

Thanks for signing up.

Stay tuned for updates from the Forrester blogs.

Stop Runaway Tech Debt With Our Essential Toolkit

Unmanaged tech debt piles up fast. tackle it head on with our tech debt toolkit, featuring seven management strategies, a red-flag checklist, and a framework for success., the rise of application generation platforms, is your vmware knowledge stuck in 2010 let’s catch you up, get the insights at work newsletter, help us improve.

IMAGES

  1. Description of reference population, study population and sample

    research reference population

  2. 1.4

    research reference population

  3. study population

    research reference population

  4. Sample & Population Statistics: Understanding the Basics

    research reference population

  5. Conceptual diagram showing the steps in the population reference

    research reference population

  6. Population and Sampling

    research reference population

VIDEO

  1. Research Population

  2. Get to know Population Reference Bureau (PRB)

  3. 11 Questions-Solution/Unit-2 Demographic Measures of Ageing/Fundamentals of Ageing/code-445

  4. Writing About Population Research for Non-Scientists

  5. 2013 World Population Data Sheet Webinar

  6. Number of R&D researchers per million people

COMMENTS

  1. What Is the Big Deal About Populations in Research?

    In research, there are 2 kinds of populations: the target population and the accessible population. The accessible population is exactly what it sounds like, the subset of the target population that we can easily get our hands on to conduct our research. While our target population may be Caucasian females with a GFR of 20 or less who are ...

  2. PRB

    Research Division of the Innovation, Technology, and Research Hub in the USAID Bureau for Development, Democracy, and Innovation. View All. ... Population Reference Bureau 1111 19th St. NW Suite 400 Washington, DC 20036. Phone: 800-877-9881. Email: [email protected]. Workstyle Serviced Offices

  3. Defining the study population: who and why?

    A population-based approach was proposed and the sample frame was from the National Cancer Database, which includes more than 40 million historical records from over 1500 treatment sites. This was used to create the study population (women with T1-3N1 breast cancer before chemotherapy) by refining the initial dataset to match the research ...

  4. Statistics without tears: Populations and samples

    A population for a research study may comprise groups of people defined in many different ways, for example, ... random sample from a listing of individuals is to assign a number to each individual and then select certain numbers by reference to random number tables which are published in standard statistical textbooks. Random number can also ...

  5. Defining and Identifying Members of a Research Study Population: CTSA

    The defined population then will become the basis for applying the research results to other relevant populations. Clearly defining a study population early in the research process also helps assure the overall validity of the study results. Many research reports fail to define or describe a study population adequately.

  6. Who and What Is a Population?

    Methods. In this article, I review the current conventional definitions of, and historical debates over, the meaning(s) of "population," trace back the contemporary emphasis on populations as statistical rather than substantive entities to Adolphe Quetelet's powerful astronomical metaphor, conceived in the 1830s, of l'homme moyen (the average man), and argue for an alternative definition ...

  7. Study Population

    Study Population. Reference work entry; pp 6412-6414; Cite this reference work entry; ... Specifically, defining the study population has received great research attention in medical and clinical study (Friedman et al., 2010; Gerrish & Lacey, 2010; Riegelman, 2005). The characteristics of those being studied are defined by inclusion criteria ...

  8. Understanding Population in Scientific Research: A Comprehensive

    The first step in addressing the population in research is to clearly define the target population. This involves specifying the characteristics of the larger group to which the study's findings will be generalized. The target population should be explicitly defined in terms of relevant factors such as demographic characteristics, geographic ...

  9. About

    Our annual World Population Data Sheet has tracked global population data since 1962. Research Translation. ... Population Reference Bureau 1111 19th St. NW Suite 400 Washington, DC 20036. Phone: 800-877-9881. Email: [email protected]. Workstyle Serviced Offices The Address Building, 7th floor

  10. Mission and Values

    about population reference bureau PRB is a nonpartisan, not-for-profit research organization focused on improving people's health and well-being through evidence-based policies and practices. Our staff analyze population data and ensure the research and its applications are understood and used widely by decisionmakers, advocates, and media.

  11. Geographic population structure analysis of worldwide human populations

    We thus 'anchored' the centre at the nearest reference population and represented the influence of the remaining M-1 nearest reference ... Research Laboratories, bcs Biotech S.r.l., Viale ...

  12. Establishing a Reference Population

    1.3 The P Lisbon1889 Reference Population. This artificially reconstituted population P Lisbon1889 will, therefore, be the basis for estimates of age at death for a set of buried adults. Table 4.2 shows its characteristics for men, women and both sexes combined, by stage and 5-year age group.

  13. Differentiating Between Population and Target Population in Research

    Differentiating Between Population and Target Population in Research Studies. June 2022. International Journal of Medical Science and Clinical Research Studies. DOI: 10.47191/ijmscrs/v2-i6-14 ...

  14. Research Fundamentals: Study Design, Population, and Sample Size

    design, population of interest, study setting, recruit ment, and sampling. Study Design. The study design is the use of e vidence-based. procedures, protocols, and guidelines that provide the ...

  15. Population vs. Sample

    A population is the entire group that you want to draw conclusions about.. A sample is the specific group that you will collect data from. The size of the sample is always less than the total size of the population. In research, a population doesn't always refer to people. It can mean a group containing elements of anything you want to study, such as objects, events, organizations, countries ...

  16. Methodology for research I

    This reference population or target population is the group on which the study outcome would be extrapolated. Once this target population is identified, researcher needs to assess whether it is possible to study all the individuals for an outcome. Usually, all cannot be included, so a study population is sampled.

  17. Population, Land Use, and Environment: Research Directions

    Bilsborrow, R.E. 1987 Population pressures and agricultural development in developing countries: A conceptual framework and recent evidence. World Development 15:183-203.. 1994 Population, Development, and Deforestation: Some Recent Evidence. Pp. 117-134 in Proceedings of the United Nations Expert Group Meeting on Population, Environment, and Development.

  18. Research Population

    A research population is generally a large collection of individuals or objects that is the main focus of a scientific query. It is for the benefit of the population that researches are done. However, due to the large sizes of populations, researchers often cannot test every individual in the population because it is too expensive and time ...

  19. Home

    The 2021 World Population Data Sheet is here! Get the latest numbers for more than 200 countries and territories. ... Technical Director, Demographic Research Charlotte Greenbaum. Policy Advisor ... Population Reference Bureau, 1875 Connecticut Avenue, N.W. Suite 520, Washington, D.C. 20009 Phone: 800-877-9881, Email: [email protected]

  20. Population Growth and Economic Development: Policy Questions

    A Critique of Some Current Strategies, with Special Reference to Africa and Asia. Paper prepared for the IUSSP Seminar on Social Policy, Heald1 Policy, and Mortality Prospects. ... J.L., and R. Gobin (1980) The relationship between population and economic growth in LDOs. Research in Population Economics 2:21S-234. Simon, J.L., and H. Kahn, eds ...

  21. Sampling: how to select participants in my research study?

    The essential topics related to the selection of participants for a health research are: 1) whether to work with samples or include the whole reference population in the study (census); 2) the sample basis; 3) the sampling process and 4) the potential effects nonrespondents might have on study results. We will refer to each of these aspects ...

  22. Sampling Methods: A guide for researchers

    Sampling is a critical element of research design. Different methods can be used for sample selection to ensure that members of the study population reflect both the source and target populations, including probability and non-probability sampling. Power and sample size are used to determine the number of subjects needed to answer the research ...

  23. Agriculture

    This example will serve as a valuable reference point for future field studies. ... Xu, Jian, Shunli Sun, Xiaoting Li, Zhiheng Zeng, Chongyang Han, Ting Tang, and Weibin Wu. 2024. "Research on the Population Flow and Mixing Characteristics of Pelleted Vegetable Seeds Based on the Bonded-Particle Model" Agriculture 14, no. 5: ...

  24. Reference range: Which statistical intervals to use?

    7 Conclusions. The objective of a reference range is to contain a pre-specified large content level (100 P) % of the population with γ confidence level, so that a future observation falling outside the reference range is regarded as atypical and considered for further investigation. This procedure should be useful as part of screening programmes, whose aim is to identify subjects at ...

  25. Introducing The Forrester Reference IT Capability Map

    Introducing The Forrester Reference IT Capability Map. Charles Betz, Research Director. May 9 2024. Today, I am happy to announce the publication of the Forrester Reference IT Capability Map. This is a comprehensive statement of the major capabilities used to define and deliver IT and digital systems, both individually and at a portfolio level.

  26. Sample Size and its Importance in Research

    The sample size for a study needs to be estimated at the time the study is proposed; too large a sample is unnecessary and unethical, and too small a sample is unscientific and also unethical. The necessary sample size can be calculated, using statistical software, based on certain assumptions. If no assumptions can be made, then an arbitrary ...