Structured Sparse Regularization based Random Vector Functional Link Networks for DNA N4-methylcytosine sites predictionShow others and affiliations
2024 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 235, article id 121157Article in journal (Refereed) Published
Abstract [en]
As an epigenetic modification that plays an important role in modifying gene function and controlling gene expression during cell development, DNA N4-methylcytosine (4mC) is still lack of researching. It is therefore necessary to accurately predict the 4mC sites to make fully aware of its mechanism and function. In this paper, we propose a novel model which is called Structural Sparse Regularized Random Vector Functional Link Network (SSR-RVFL) for predicting 4mC sites. Compared with other state-of-the-art methods, SSR-RVFL performs better and achieves higher prediction accuracy. There are total six benchmark datasets used in the experiments, namely C.elegans, D.elanogaster, E.coli, A.thaliana G.subterraneus and G.pickeringii. Our model improves the accuracy by 0.42%, 0.45%, 0.48%, 0.91%, 0.66% and 0.7% on these six benchmark datasets respectively, so it can be regarded as a more effective prediction tool. © 2023 The Author(s)
Place, publisher, year, edition, pages
Oxford: Elsevier, 2024. Vol. 235, article id 121157
Keywords [en]
Biological sequence classification, DNA N4-methylcytosine, Group sparse regularization, Machine learning, Random Vector Functional Link Network
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:hh:diva-51569DOI: 10.1016/j.eswa.2023.121157ISI: 001059505300001Scopus ID: 2-s2.0-85168407671OAI: oai:DiVA.org:hh-51569DiVA, id: diva2:1794024
2023-09-042023-09-042025-10-01Bibliographically approved