Abstract Petroleum contamination presents a significant environmental challenge, contributing to soil and water pollution. Bioremediation provides a sustainable and cost-effective approach. In this study, we isolated and characterized a novel petroleum-degrading strain, Rhodococcus indonesiensis SARSHI1. Whole-genome sequencing of SARSHI1 was conducted using a hybrid sequencing approach, integrating Oxford Nanopore Technologies (ONT) (PromethION) and Illumina (NovaSeq 6000) platforms. The complete genome of SARSHI1 comprises 5.7 Mbp, along with a plasmid of 159,118 bp, encoding a total of 5,150 coding sequences (CDS). The genome consists of 5,695,289 base pairs, with 5,220 identified genes comprising 5,094 protein-coding genes. Additionally, it contains 12 ribosomal RNA (rRNA) genes, 55 transfer RNA (tRNA) genes, one non-coding RNA, one CRISPR array, 56 pseudogenes, and 243 hypothetical proteins. The raw reads obtained were 13,900,477 from Illumina and 2,539,063 from ONT, with processed reads of 13,169,190 and 1,567,736, respectively. Genome assembly achieved 100% completeness, confirming the reconstruction of a fully intact genome without missing sequences. A total of 570 single-copy marker genes were identified, resulting in a coding density of 91.4%. Functional annotation and comparative genomic analysis revealed key genes associated with hydrocarbon degradation, including alkB , ahyA , and almA (Group I) families for long-chain alkane degradation, as well as bph , ben , and xylC clusters for aromatic hydrocarbon degradation under aerobic conditions. Additionally, multiple antibiotic resistance genes, including those conferring resistance to beta-lactams, were identified. Secondary metabolite analysis identified 19 distinct biosynthetic gene clusters (BGCs), encoding variants of known compounds, highlighting the genomic potential for diverse secondary metabolite production. The complete genome sequence has been deposited in GenBank under accession numbers CP180630 (chromosome) and CP180631 (plasmid). The raw sequencing reads have been submitted to the Sequence Read Archive (SRA), NCBI, under accession numbers SRX27520007 (Illumina) and SRX27520006 (ONT).
Date: | 2025-09-26 |
---|
Authors: | Zaman SAU, Sharma K, Nayarisseri A, Khazanehdari KA, Bhuyan R. |
---|
Ref: | Research Square |
---|