Have a personal or library account? Click to login

Implementation of Enzyme Family Classification by using Autoencoders in a Study Case with Imbalanced and Underrepresented Classes

Open Access
|Mar 2025

Abstract

In the field of Bioinformatics, the scientific community is fully aware of the challenges associated with enzyme classification. In this study, a novel strategy is proposed based on the use of Anomalous Autoencoders to characterize chitinases belonging to glycoside hydrolases. Python and TensorFlow programming technologies were employed to conduct this analysis. The designed classifier consists of two levels that determine both the enzymatic nature of an amino acid sequence and its corresponding chitinase enzyme family. These levels considered class imbalance and the underrepresentation of those enzyme families in the CAZy.org database. Furthermore, a comprehensive comparison was made with other available software in the field. To represent the amino acid sequences, embeddings generated from the ProtFlash model were used. The results obtained in this study confirm the effectiveness of the proposed implementation compared to the methods EzyPred, ECPred, and Proteinfer.

DOI: https://doi.org/10.14313/jamris-2025-005 | Journal eISSN: 2080-2145 | Journal ISSN: 1897-8649
Language: English
Page range: 42 - 48
Submitted on: Apr 15, 2024
Accepted on: May 20, 2024
Published on: Mar 31, 2025
Published by: Łukasiewicz Research Network – Industrial Research Institute for Automation and Measurements PIAP
In partnership with: Paradigm Publishing Services
Publication frequency: 4 times per year

© 2025 Darian Fernández Gutiérrez, Ariadna Arbolaez Espinosa, Deborah Raquel Galpert Cañizares, María Matilde García Lorenzo, published by Łukasiewicz Research Network – Industrial Research Institute for Automation and Measurements PIAP
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.