Kernkonzepte
A computational framework combining molecular docking and machine learning regression models can effectively identify potential inhibitors against the multi-target proteins of COVID-19, including Spike, 3CLpro, and Nucleocapsid.
Zusammenfassung
This study proposes a computational framework to investigate potential inhibitors against COVID-19 through drug repurposing strategies. The framework consists of three key modules:
-
Data Preparation and Molecular Docking:
- The study targets the Spike, 3CLpro, and Nucleocapsid proteins of SARS-CoV-2 as they are essential for viral entry, replication, and assembly.
- A dataset of 5,903 approved drugs from the ZINC database was preprocessed and used for molecular docking against the target proteins.
-
QSAR Modeling:
- Multiple machine learning regression models, including Decision Tree Regression (DTR), were developed to predict the binding affinities of the drug compounds towards the target proteins.
- The DTR model outperformed other regression models, achieving R^2 values of 0.95, 0.97, and 0.93, and RMSE values of 1.66, 1.57, and 1.49 for Spike, 3CLpro, and Nucleocapsid, respectively, using the MACCS fingerprint feature set.
-
Molecular Docking and Drug Analysis:
- The molecular docking analysis identified five promising drug candidates with binding affinities ranging from -19.7 to -12.6 kcal/mol.
- The physicochemical properties of the selected drug candidates were analyzed to assess their potential efficacy and safety in biological systems.
The study demonstrates the effectiveness of the proposed computational framework in identifying potential COVID-19 inhibitors through drug repurposing, leveraging the combination of molecular docking and machine learning regression models. The findings provide a foundation for further in vitro and in vivo investigations to validate the identified drug candidates as potential COVID-19 therapeutics.
Statistiken
The binding affinities (in kcal/mol) of the top five drug candidates towards the target proteins are:
ZINC003873365: -19.7 (Spike), -15.1 (3CLpro), -19.2 (Nucleocapsid)
ZINC085432544: -15.3 (Spike), -14.4 (3CLpro), -14.0 (Nucleocapsid)
ZINC085536956: -14.4 (Spike), -13.9 (3CLpro), -12.6 (Nucleocapsid)
ZINC008214470: -15.3 (Spike), -13.6 (3CLpro), -14.3 (Nucleocapsid)
ZINC261494640: -13.9 (Spike), -13.6 (3CLpro), -14.4 (Nucleocapsid)
Zitate
"Our findings with best scores of R2 and RMSE demonstrated that our proposed Decision Tree Regression (DTR) model is the most appropriate model to explore the potential inhibitors."
"We proposed five novel promising inhibitors with their respective Zinc IDs ZINC (3873365, 85432544, 8214470, 85536956, and 261494640) within the range of -19.7 kcal/mol to -12.6 kcal/mol."