Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;4(12):1286-94.
doi: 10.1093/gbe/evs100.

A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes

A tight link between orthologs and bidirectional best hits in bacterial and archaeal genomes

Yuri I Wolf et al. Genome Biol Evol. 2012.

Abstract

Orthologous relationships between genes are routinely inferred from bidirectional best hits (BBH) in pairwise genome comparisons. However, to our knowledge, it has never been quantitatively demonstrated that orthologs form BBH. To test this "BBH-orthology conjecture," we take advantage of the operon organization of bacterial and archaeal genomes and assume that, when two genes in compared genomes are flanked by two BBH show statistically significant sequence similarity to one another, these genes are bona fide orthologs. Under this assumption, we tested whether middle genes in "syntenic orthologous gene triplets" form BBH. We found that this was the case in more than 95% of the syntenic gene triplets in all genome comparisons. A detailed examination of the exceptions to this pattern, including maximum likelihood phylogenetic tree analysis, showed that some of these deviations involved artifacts of genome annotation, whereas very small fractions represented random assignment of the best hit to one of closely related in-paralogs, paralogous displacement in situ, or even less frequent genuine violations of the BBH-orthology conjecture caused by acceleration of evolution in one of the orthologs. We conclude that, at least in prokaryotes, genes for which independent evidence of orthology is available typically form BBH and, conversely, BBH can serve as a strong indication of gene orthology.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Schematic of the genome comparison for testing the BBH–orthology conjecture.
F<sc>ig</sc>. 2.—
Fig. 2.—
Dependency of the relationship between the middle genes in syntenic gene triplets on the distance between the compared genomes.
F<sc>ig</sc>. 3.—
Fig. 3.—
Examples of deviations from the predictions of the BBH–orthology conjecture. (A) Haloarcula marismortui–Pyrococcus furiosus, phosphopyruvate hydratase (COG0148). BBH–orthology conjecture violated due to an acceleration of evolution of one of the in situ homologs. (B) Escherichia coli K12–Ralstonia eutropha, NADH:ubiquinone oxidoreductase NuoE (COG1905). BBH–orthology conjecture violated, probably due to an acceleration of evolution of one of the in situ homologs. (C) Escherichia coli K12–Cronobacter turicensis. Outer membrane porin OmpC (COG3203). BBH–orthology conjecture violated, complex evolutionary relationships between multiple paralogs. (D) Escherichia coli K12–Bacillus subtilis, ribosomal protein S14 (COG0199). Compatible with the BBH–orthology conjecture. The in situ homolog is markedly more distant from the query gene in sequence and domain architecture as well as the position in the tree. (E) Escherichia coli K12–R. eutropha, Thiol-disulfide isomerase and thioredoxins (COG0526). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha. (F) Escherichia coli K12–R. eutropha, Nitrate reductase alpha subunit (COG5013). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha.
F<sc>ig</sc>. 3.—
Fig. 3.—
Examples of deviations from the predictions of the BBH–orthology conjecture. (A) Haloarcula marismortui–Pyrococcus furiosus, phosphopyruvate hydratase (COG0148). BBH–orthology conjecture violated due to an acceleration of evolution of one of the in situ homologs. (B) Escherichia coli K12–Ralstonia eutropha, NADH:ubiquinone oxidoreductase NuoE (COG1905). BBH–orthology conjecture violated, probably due to an acceleration of evolution of one of the in situ homologs. (C) Escherichia coli K12–Cronobacter turicensis. Outer membrane porin OmpC (COG3203). BBH–orthology conjecture violated, complex evolutionary relationships between multiple paralogs. (D) Escherichia coli K12–Bacillus subtilis, ribosomal protein S14 (COG0199). Compatible with the BBH–orthology conjecture. The in situ homolog is markedly more distant from the query gene in sequence and domain architecture as well as the position in the tree. (E) Escherichia coli K12–R. eutropha, Thiol-disulfide isomerase and thioredoxins (COG0526). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha. (F) Escherichia coli K12–R. eutropha, Nitrate reductase alpha subunit (COG5013). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha.
F<sc>ig</sc>. 3.—
Fig. 3.—
Examples of deviations from the predictions of the BBH–orthology conjecture. (A) Haloarcula marismortui–Pyrococcus furiosus, phosphopyruvate hydratase (COG0148). BBH–orthology conjecture violated due to an acceleration of evolution of one of the in situ homologs. (B) Escherichia coli K12–Ralstonia eutropha, NADH:ubiquinone oxidoreductase NuoE (COG1905). BBH–orthology conjecture violated, probably due to an acceleration of evolution of one of the in situ homologs. (C) Escherichia coli K12–Cronobacter turicensis. Outer membrane porin OmpC (COG3203). BBH–orthology conjecture violated, complex evolutionary relationships between multiple paralogs. (D) Escherichia coli K12–Bacillus subtilis, ribosomal protein S14 (COG0199). Compatible with the BBH–orthology conjecture. The in situ homolog is markedly more distant from the query gene in sequence and domain architecture as well as the position in the tree. (E) Escherichia coli K12–R. eutropha, Thiol-disulfide isomerase and thioredoxins (COG0526). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha. (F) Escherichia coli K12–R. eutropha, Nitrate reductase alpha subunit (COG5013). Compatible with BBH–orthology conjecture; the best hit and the in situ homolog are closely related in-paralogs in R. eutropha.

Similar articles

Cited by

References

    1. Altenhoff AM, Dessimoz C. Phylogenetic and functional assessment of orthologs inference projects and methods. PLoS Comput Biol. 2009;5:e1000262. - PMC - PubMed
    1. Altenhoff AM, Studer RA, Robinson-Rechavi M, Dessimoz C. Resolving the ortholog conjecture: orthologs tend to be weakly, but significantly, more similar in function than paralogs. PLoS Comput Biol. 2012;8:e1002514. - PMC - PubMed
    1. Altschul SF, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. - PMC - PubMed
    1. Dewey CN. Positional orthology: putting genomic evolutionary relationships into context. Brief Bioinform. 2011;12:401–412. - PMC - PubMed
    1. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. - PMC - PubMed

Publication types