ORCID
0000-0001-6658-6591
Keywords
mRNA, AI, Geometric Deep Learning, DTI, Representation Learning, fairness
Abstract
The rapid emergence of global health crises, exemplified by the Coronavirus Disease 2019 (COVID-19) pandemic, underscores the profound limitations of conventional drug discovery methods, which are inherently protracted and resource-intensive. A robust, accelerated response framework demands the sophisticated deployment of Artificial Intelligence (AI) and Machine Learning (ML) across the entire therapeutic pipeline. We propose a unified approach centered on \textbf{Structural and Representation Learning} to capture the intricate topological and chemical nuances of biological systems, thereby transforming complex molecular and genomic data into effective vector embeddings suitable for deep learning, enhancing predictive power, and crucially, interpretability. This dissertation details advancements across the entire therapeutic development cycle, strictly following the flow: Drug-Target Interaction (DTI) Drug-Target Affinity (DTA) De Novo Drug Generation mRNA Sequence Modeling. For the initial stages, we developed structure-augmented, attention-based models, such as \textit{AttentionSiteDTI} and \textit{FragXsiteDTI}, to refine repurposing efforts by integrating predicted binding site information and fragment-level interpretability. Moving to novel design, we established generative pipelines, including, leveraging diffusion models optimized via Multi-Objective Optimization (MOO) to synthesize novel, high-quality, target-specific ligands for SARS-CoV-2 targets. Finally, we addressed the design of next-generation prophylaxis by introducing advanced language models, such as \textit{Equi-mRNA}, which explicitly encode hierarchical and group-theoretic biological priors (e.g., equivariant codon symmetry) to optimize mRNA stability and translational efficiency for vaccine development. The entire framework was validated against the urgent demands of the COVID-19 crisis, providing a comprehensive toolkit for rapid drug repurposing, \textit{de novo} antiviral design, and accelerated mRNA vaccine optimization. We further generalize the utility of MOO through the \textit{FairBiNN} framework to manage trade-offs inherent in translational data science, such as balancing predictive accuracy with fairness metrics. This work presents a substantial contribution to computational drug design, demonstrating that explicitly encoding structural and biological knowledge through advanced representation learning techniques is paramount for building robust, interpretable, and generalizable AI systems capable of accelerating pandemic response.
Completion Date
2025
Semester
Fall
Committee Chair
Ozmen Garibay, Ozlem
Degree
Doctor of Philosophy (Ph.D.)
College
College of Engineering and Computer Science
Department
Computer Science
Format
Identifier
DP0029720
Document Type
Thesis
Campus Location
Orlando (Main) Campus
STARS Citation
Yazdani-Jahromi, Mehdi, "Advancing Drug Discovery with Structural and Representation Learning of Biological Systems" (2025). Graduate Thesis and Dissertation post-2024. 515.
https://stars.library.ucf.edu/etd2024/515