Download
Download pre-built protein databases for each biome. These biome-specific protein databases were constructed using open-access 16S rRNA gene amplicon sequencing data. Overall, 26,293 publicly available samples were analyzed to determine the 20 most abundant bacterial genera in 20 different biomes. A total of 299,356 genomes and 145,662,299 protein sequences from these genera were then compiled into ready-to-use databases.