全球非冗余微生物基因集GMGCv1的构建
0. 原文信息
标题:Towards the biogeography of prokaryotic genes
期刊:Nature, December 2021
第一作者:Luis Pedro Coelho
通讯作者:Luis Pedro Coelho
作者单位:Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
doi: 10.1038/s41586-021-04233-4
1. 摘要
Microbial genes encode the majority of the functional repertoire of life on earth. However, despite increasing efforts in metagenomic sequencing of various habitats, little is known about the distribution of genes across the global biosphere, with implications for human and planetary health (对全球生物圈的基因分布情况知之甚少,其对于人体或地球环境卫生健康的影响也是如此。). Here we constructed a non-redundant gene catalogue of 303 million species-level genes (clustered at 95% nucleotide identity) from 13,174 publicly available metagenomes across 14 major habitats and use it to show that most genes are specific to a single habitat. The small fraction of genes found in multiple habitats is enriched in antibiotic-resistance genes and markers for mobile genetic elements. By further clustering these species-level genes into 32