-
Tytuł:
-
Ancestry inference using reference labeled clusters of haplotypes.
-
Autorzy:
-
Wang Y; AncestryDNA, San Francisco, CA, 94107, USA.
Song S; AncestryDNA, San Francisco, CA, 94107, USA.
Schraiber JG; AncestryDNA, San Francisco, CA, 94107, USA.
Sedghifar A; AncestryDNA, San Francisco, CA, 94107, USA.
Byrnes JK; AncestryDNA, San Francisco, CA, 94107, USA.
Turissini DA; AncestryDNA, San Francisco, CA, 94107, USA.
Hong EL; AncestryDNA, San Francisco, CA, 94107, USA.
Ball CA; AncestryDNA, San Francisco, CA, 94107, USA.
Noto K; AncestryDNA, San Francisco, CA, 94107, USA. .
-
Źródło:
-
BMC bioinformatics [BMC Bioinformatics] 2021 Sep 25; Vol. 22 (1), pp. 459. Date of Electronic Publication: 2021 Sep 25.
-
Typ publikacji:
-
Journal Article
-
Język:
-
English
-
Imprint Name(s):
-
Original Publication: [London] : BioMed Central, 2000-
-
MeSH Terms:
-
Genetics, Population*
Genome, Human*
Haplotypes ; Humans ; Polymorphism, Single Nucleotide
-
References:
-
Genetics. 2012 Jun;191(2):607-19. (PMID: 22491189)
Nature. 2010 Oct 28;467(7319):1061-73. (PMID: 20981092)
Science. 2008 Feb 22;319(5866):1100-4. (PMID: 18292342)
Genetics. 2013 Apr;193(4):1233-54. (PMID: 23410830)
Genetics. 2003 Dec;165(4):2213-33. (PMID: 14704198)
PLoS Genet. 2009 Jun;5(6):e1000519. (PMID: 19543370)
Genome Res. 2009 Sep;19(9):1655-64. (PMID: 19648217)
Nat Genet. 2004 May;36(5):512-7. (PMID: 15052271)
BMC Genet. 2020 Apr 7;21(1):40. (PMID: 32264823)
Am J Hum Genet. 2013 Aug 8;93(2):278-88. (PMID: 23910464)
Nat Genet. 2016 Oct;48(10):1279-83. (PMID: 27548312)
Am J Hum Genet. 2007 Nov;81(5):1084-97. (PMID: 17924348)
Nature. 2007 Oct 18;449(7164):851-61. (PMID: 17943122)
PLoS Genet. 2012 Jan;8(1):e1002453. (PMID: 22291602)
Nat Genet. 2016 Nov;48(11):1443-1448. (PMID: 27694958)
Nat Genet. 2006 Aug;38(8):904-9. (PMID: 16862161)
-
Contributed Indexing:
-
Keywords: ARCHes; Ancestry inference; HMM; Haplotype modeling; Local ancestry; RFMix
-
Entry Date(s):
-
Date Created: 20210926 Date Completed: 20210928 Latest Revision: 20210930
-
Update Code:
-
20240105
-
PubMed Central ID:
-
PMC8466715
-
DOI:
-
10.1186/s12859-021-04350-x
-
PMID:
-
34563119
-
Background: We present ARCHes, a fast and accurate haplotype-based approach for inferring an individual's ancestry composition. Our approach works by modeling haplotype diversity from a large, admixed cohort of hundreds of thousands, then annotating those models with population information from reference panels of known ancestry.
Results: The running time of ARCHes does not depend on the size of a reference panel because training and testing are separate processes, and the inferred population-annotated haplotype models can be written to disk and reused to label large test sets in parallel (in our experiments, it averages less than one minute to assign ancestry from 32 populations using 10 CPU). We test ARCHes on public data from the 1000 Genomes Project and the Human Genome Diversity Project (HGDP) as well as simulated examples of known admixture.
Conclusions: Our results demonstrate that ARCHes outperforms RFMix at correctly assigning both global and local ancestry at finer population scales regardless of the amount of population admixture.
(© 2021. The Author(s).)
Zaloguj się, aby uzyskać dostęp do pełnego tekstu.