ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation
- PMID: 28053163
- PMCID: PMC5210634
- DOI: 10.1093/nar/gkw934
ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation
Abstract
The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny) and a small number of synonymous substitutions in the protein-coding genes. As such, each ATGC is suited for analysis of microevolutionary variations within a cohesive group of organisms (e.g. species), whereas the entire collection of ATGCs is useful for macroevolutionary studies. The ATGC database includes many forms of pre-computed data, in particular ATGC-COGs (Clusters of Orthologous Genes), multiple sequence alignments, a set of 'index' orthologs representing the most well-conserved members of each ATGC-COG, the phylogenetic tree of the organisms within each ATGC, etc. Although the ATGC database contains several million proteins from thousands of genomes organized into hundreds of clusters (roughly a 4-fold increase since the last version of the ATGC database), it is now built with completely automated methods and will be regularly updated following new releases of the NCBI RefSeq database. The ATGC database is hosted jointly at the University of Iowa at dmk-brain.ecn.uiowa.edu/ATGC/ and the NCBI at ftp.ncbi.nlm.nih.gov/pub/kristensen/ATGC/atgc_home.html.
Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Figures



Similar articles
-
ATGC: a database of orthologous genes from closely related prokaryotic genomes and a research platform for microevolution of prokaryotes.Nucleic Acids Res. 2009 Jan;37(Database issue):D448-54. doi: 10.1093/nar/gkn684. Epub 2008 Oct 9. Nucleic Acids Res. 2009. PMID: 18845571 Free PMC article.
-
Prokaryotic Virus Orthologous Groups (pVOGs): a resource for comparative genomics and protein family annotation.Nucleic Acids Res. 2017 Jan 4;45(D1):D491-D498. doi: 10.1093/nar/gkw975. Epub 2016 Oct 26. Nucleic Acids Res. 2017. PMID: 27789703 Free PMC article.
-
COG database update 2024.Nucleic Acids Res. 2025 Jan 6;53(D1):D356-D363. doi: 10.1093/nar/gkae983. Nucleic Acids Res. 2025. PMID: 39494517 Free PMC article.
-
DOOR: a prokaryotic operon database for genome analyses and functional inference.Brief Bioinform. 2019 Jul 19;20(4):1568-1577. doi: 10.1093/bib/bbx088. Brief Bioinform. 2019. PMID: 28968679 Review.
-
Microbial genome analysis: the COG approach.Brief Bioinform. 2019 Jul 19;20(4):1063-1070. doi: 10.1093/bib/bbx117. Brief Bioinform. 2019. PMID: 28968633 Free PMC article. Review.
Cited by
-
Purifying and positive selection in the evolution of stop codons.Sci Rep. 2018 Jun 18;8(1):9260. doi: 10.1038/s41598-018-27570-3. Sci Rep. 2018. PMID: 29915293 Free PMC article.
-
Assessment of assumptions underlying models of prokaryotic pangenome evolution.BMC Biol. 2021 Feb 10;19(1):27. doi: 10.1186/s12915-021-00960-2. BMC Biol. 2021. PMID: 33563283 Free PMC article.
-
No Evidence that Selection on Synonymous Codon Usage Affects Patterns of Protein Evolution in Bacteria.Genome Biol Evol. 2024 Feb 1;16(2):evad232. doi: 10.1093/gbe/evad232. Genome Biol Evol. 2024. PMID: 38149940 Free PMC article.
-
In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons.PLoS Genet. 2019 Sep 17;15(9):e1008386. doi: 10.1371/journal.pgen.1008386. eCollection 2019 Sep. PLoS Genet. 2019. PMID: 31527909 Free PMC article.
-
Impact of C-terminal amino acid composition on protein expression in bacteria.Mol Syst Biol. 2020 May;16(5):e9208. doi: 10.15252/msb.20199208. Mol Syst Biol. 2020. PMID: 32449593 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials