A deep multiple kernel learning-based higher-order fuzzy inference system for identifying DNA N4-methylcytosine sitesShow others and affiliations
2023 (English)In: Information Sciences, ISSN 0020-0255, E-ISSN 1872-6291, Vol. 630, p. 40-52Article in journal (Refereed) Published
Abstract [en]
N4-methylcytosine (4mC), as a DNA modification, plays a crucial role in epigenetic regulation. However, the existing experimentation methods for accurately identifying 4mC sites are inefficient and highly consumable, making them difficult to implement. Although a variety of new identification methods are continuously being proposed, existing techniques are not yet fully mature. Compared to traditional 4mC site predictors, based on support vector machine or convolutional neural network, we present an alternative computational approach. In this study, we propose a method based on a kernelized higher-order fuzzy inference system (KHFIS) and deep multiple kernel learning, called DMKL-HFIS, to improve the accuracy of 4mC site identification DNA sequences. We use PSTNP to process the benchmark datasets, and then apply KHFIS to obtain multiple fuzzy kernel matrices. A deep neural network is used to fuse multiple fuzzy kernel matrices. Finally, the predicted value is derived from the fused matrix. Our approach was compared with existing mainstream computational methods. On the benchmark datasets (G. subterraneus, D. melanogaster, E. coli, A. thaliana, and C. elegans), the accuracy of our approach exceeded that of a state-of-the-art method by 0.4%, 0.44%, 1.51%, 0.55%, and 0.25%, respectively. Compared to mainstream methods, our approach exhibits a higher level of accuracy and can therefore be considered an effective prediction tool. © 2023 Elsevier Inc. All rights reserved.
Place, publisher, year, edition, pages
Philadelphia, PA: Elsevier, 2023. Vol. 630, p. 40-52
Keywords [en]
DNA N4-methylcytosine, Bioinformatics, Fuzzy inference system, Deep learning, Multiple kernel learning
National Category
Computer Systems Cell and Molecular Biology
Identifiers
URN: urn:nbn:se:hh:diva-50085DOI: 10.1016/j.ins.2023.01.149ISI: 000962431700001Scopus ID: 2-s2.0-85148324707OAI: oai:DiVA.org:hh-50085DiVA, id: diva2:1742248
Note
Funding: The National Natural Science Foundation of China (NSFC 62172296, 62250028, U22A2038, 62172076, and 61972280), Zhejiang Provincial Natural Science Foundation of China (Grant No. LY23F020003), the Municipal Government of Quzhou (Grant No. 2022D006), the Excellent Young Scientists Fund in Hunan Province (2022JJ20077), and the Scientific Research Fund of the Hunan Provincial Education Department (22A0007). The work of Victor Hugo C. de Albuquerque has been supported by the CNPq via Grant No. 305517/2022-8.
2023-03-082023-03-082023-08-21Bibliographically approved