Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Apr 19;6(1):10.
doi: 10.1186/1748-7188-6-10.

Haplotypes versus genotypes on pedigrees

Affiliations

Haplotypes versus genotypes on pedigrees

Bonnie B Kirkpatrick. Algorithms Mol Biol. .

Abstract

Background: Genome sequencing will soon produce haplotype data for individuals. For pedigrees of related individuals, sequencing appears to be an attractive alternative to genotyping. However, methods for pedigree analysis with haplotype data have not yet been developed, and the computational complexity of such problems has been an open question. Furthermore, it is not clear in which scenarios haplotype data would provide better estimates than genotype data for quantities such as recombination rates.

Results: To answer these questions, a reduction is given from genotype problem instances to haplotype problem instances, and it is shown that solving the haplotype problem yields the solution to the genotype problem, up to constant factors or coefficients. The pedigree analysis problems we will consider are the likelihood, maximum probability haplotype, and minimum recombination haplotype problems.

Conclusions: Two algorithms are introduced: an exponential-time hidden Markov model (HMM) for haplotype data where some individuals are untyped, and a linear-time algorithm for pedigrees having haplotype data for all individuals. Recombination estimates from the general haplotype HMM algorithm are compared to recombination estimates produced by a genotype HMM. Having haplotype data on all individuals produces better estimates. However, having several untyped individuals can drastically reduce the utility of haplotype data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Genotype and Haplotype Pedigrees. Genotyped individuals are shaded, and all the individuals are labeled. Individuals 1, 2, and 5 are the founders, and individual 6 is the grandchild of 1 and 2.
Figure 2
Figure 2
Haplotype Pedigrees. Haplotyped individuals are shaded, and individuals have the same labels. For each of the genotyped individuals, i, from the previous figure, the mapping adds a nuclear family containing five new individuals labeled i0, i1, i2, i3, i4.
Figure 3
Figure 3
Predicting Recombinations for Half-Siblings. This is the average accuracy for predictions from a pedigree with two half-siblings and three parents. Five hundred simulation replicates were performed, and the average accuracy of estimates from the haplotype data is superior to those from genotype data. However, as the number of untyped founders increases, in both cases, the accuracy of estimates from haplotype data drop relative to the accuracy from genotype data. The accuracies of genotype and haplotype estimates appear to converge.
Figure 4
Figure 4
Predicting Recombinations for Three Generations. This figure shows accuracy results from a six-individual, three-generation pedigree. Again, five hundred simulation replicates were performed, and the average accuracy of estimates from the haplotype data is superior to those from genotype data. Once again, as the number of untyped founders increases, the accuracy of estimates from haplotype data drop relative to the accuracy from genotype data. The accuracies of genotype and haplotype estimates appear to converge.

Similar articles

Cited by

References

    1. Coop G, Wen X, Ober C, Pritchard J, Przeworski M. High-Resolution Mapping of Crossovers Reveals Extensive Variation in Fine-Scale Recombination Patterns Among Humans. Science. 2008;319(5868):1395–1398. doi: 10.1126/science.1151851. - DOI - PubMed
    1. MY N, DF L. et al.Meta-analysis of 32 genome-wide linkage studies of schizophrenia. Mol Psychiatry. 2009;14:774–85. doi: 10.1038/mp.2008.135. - DOI - PMC - PubMed
    1. Romero I, Ober C. CFTR mutations and reproductive outcomes in a population isolate. Human Genet. 2008;122:583–588. doi: 10.1007/s00439-007-0432-1. - DOI - PubMed
    1. Geiger D, Meek C, Wexler Y. Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics. 2009;25(12):i196. doi: 10.1093/bioinformatics/btp224. - DOI - PMC - PubMed
    1. Xiao J, Liu L, Xia L, Jiang T. Efficient Algorithms for Reconstructing Zero-Recombinant Haplotypes on a Pedigree Based on Fast Elimination of Redundant Linear Equations. SIAM Journal on Computing. 2009;38:2198. doi: 10.1137/070687591. - DOI