DeepBovC2H2-ZF: deep learning-guided prediction and molecular dynamics validation of C2H2 zinc finger transcription factors in Bovidae
Bharati Pandey, Manbir Singh,
DeepBovC2H2-ZF: deep learning-guided prediction and molecular dynamics validation of C2H2 zinc finger transcription factors in Bovidae,
Journal of Genetic Engineering and Biotechnology,
Volume 23, Issue 4,
2025,
100620,
ISSN 1687-157X,
https://doi.org/10.1016/j.jgeb.2025.100620.
(https://www.sciencedirect.com/science/article/pii/S1687157X25001647)
Abstract: C2H2 zinc finger (ZF) transcription factors (TFs) are among the most abundant and versatile regulatory proteins, playing critical roles in development, differentiation, apoptosis, stress response, and immune regulation. In livestock, especially within the Bovidae family, these TFs regulate gene expression linked to economically important traits such as growth, reproduction, milk production, and disease resistance. However, genome-wide identification of C2H2-ZF TFs in Bovidae remains limited due to the lack of specialized computational tools. To address this, we developed DeepBovC2H2-ZF, a deep learning-based framework for predicting C2H2-ZF TFs using only protein sequence information. The model was trained on a curated dataset of validated C2H2-ZF and non-C2H2-ZF TFs, utilizing sequence-derived features that capture the unique domain signatures. DeepBovC2H2-ZF achieved high prediction accuracy, sensitivity, and specificity, outperforming traditional machine learning models. A correctly predicted C2H2-ZF protein, Krüppel-like factor 4 (KLF4), was further validated through molecular docking and three independent molecular dynamics (MD) simulations of both the protein and its DNA-bound complex. The simulations confirmed structural stability and strong DNA-binding affinity, supporting the reliability of DeepBovC2H2-ZF for functional genomics studies in Bovidae.
Keywords: Bovidae; Deep learning; CNN; BiLSTM; KLF4