Metapcbin: plasmid/chromosome classification for metagenomic contigs using machine learning techniques

dc.contributor.authorNandasiri, C
dc.contributor.authorAlahakoon, S
dc.contributor.authorDassanayake, G
dc.contributor.authorWickramarachchi, A
dc.contributor.authorPerera, I
dc.contributor.editorRathnayake, M
dc.contributor.editorAdhikariwatte, V
dc.contributor.editorHemachandra, K
dc.date.accessioned2022-10-27T09:55:26Z
dc.date.available2022-10-27T09:55:26Z
dc.date.issued2022-07
dc.description.abstractChromosomes and plasmids are the major carriers of genetic material in microorganisms such as bacteria. Separating chromosomal and plasmid DNA from large datasets is important as plasmids and chromosomes affect functions and other environmental adaptations. Bioinformatics methodologies have been developed for plasmid classification with the advancements in sequencing technologies. The usage of normalized short k-mer counts with machine learning models has been popular in the characterization of plasmids and chromosomes. Furthermore, bio-markers from DNA sequences as features have also been studied in plasmid classification. However, both approaches suffer from the trade-off between precision and recall. MetaPCbin is a plasmid detection tool that combines computational and genetic approaches into a hybrid method of plasmid prediction. MetaPCbin uses an artificial neural network that uses k-mer counts as features and a random forest model that uses biomarkers. MetaPCbin evaluates the precision and the recall of the classification of real-world DNA sequences from the RefSeq database and simulated sequences. The results show that it is capable of performing plasmid classification while maintaining high precision and recall compared to the state of the art. MetaPCbin is freely available at: https://github.com/MetaGSC/MetaPCbinen_US
dc.identifier.citationC. Nandasiri, S. Alahakoon, G. Dassanayake, A. Wickramarachchi and I. Perera, "MetaPCbin: Plasmid/Chromosome Classification for Metagenomic Contigs using Machine Learning Techniques," 2022 Moratuwa Engineering Research Conference (MERCon), 2022, pp. 1-6, doi: 10.1109/MERCon55799.2022.9906214.en_US
dc.identifier.conferenceMoratuwa Engineering Research Conference 2022en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon55799.2022.9906214en_US
dc.identifier.emailchamika.17@cse.mrt.ac.lk
dc.identifier.emailsasindu.17@cse.mrt.ac.lk
dc.identifier.emailgayal.17@cse.mrt.ac.lk
dc.identifier.emailanuradha.wickramarachchi@anu.edu.au
dc.identifier.emailindika@cse.mrt.ac.lk
dc.identifier.facultyEngineeringen_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of Moratuwa Engineering Research Conference 2022en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19275
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9906214/en_US
dc.subjectPlasmiden_US
dc.subjectChromosomeen_US
dc.subjectMetagenomicsen_US
dc.subjectBioinformaticsen_US
dc.subjectK-meren_US
dc.titleMetapcbin: plasmid/chromosome classification for metagenomic contigs using machine learning techniquesen_US
dc.typeConference-Full-texten_US

Files

Collections