Objective To process, annotate and analyze the genomic data of the high temperature resistant Bacillus amyloliticus BA-DES4 by biological software and databases, and to functionally annotate the strain characteristics at the gene level. Methods Metabolic signaling pathways were analyzed by metabolic signaling pathways kyoto encyclopedia of genes and genomes (KEGG) and gene ontology (GO), clusters of orthologous groups of proteins (COG), non-redundant protein (NR) database, etc., which were compared with the predicted gene sequences to obtain the gene function annotation table. The protein sequences of the predicted genes were compared with COG, KEGG and GO databases for BLASTP analysis, to realize the prediction of gene annotation information and function prediction. Results The database analysis showed that the number of bases was 4188731, and the GC content accounted for 46.18%. There were 4445 protein genes, with a total length of 3696380 bp. Functional annotation of genomic protein coding genes showed that the highest proportion of COG-annotated genes was for G (carbohydrate transport and metabolism), followed by K (transcription), which indicated that protein genes encoded in the COG database were mainly involved in the basic cellular function. The GO annotation revealed that the predominant gene types and genes were classified under biological processes. The carbohydrate-active enzymes (CAZy) annotation identified hydrolytic enzymes as the most prevalent category. The KEGG annotation showed that carbohydrate metabolism accounted for the highest number of genes. In the environmental information category, information transduction emerged as the most significant percentage. In the high-temperature resistant Bacillus amyloliticus BA-DES4, 11 genes were found to encode cellulases, with β-glucosidase and endoglucanase being the genes encoding enzymes. Conclusion In this study, the genomic data of Bacillus amyloliticus BA-DES4 is processed, annotated and analysed in order to further explore the research potential of the strain, to better investigate the regulatory mechanism of cellulose production in Bacillus amyloliticus BA-DES4 and to provide a theoretical basis for subsequent experiments.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |