What is the pyGOMoDo?
The pyGOMoDo library is divided into two main modules: the modeling and docking modules. The modeling module follows the same framework pipeline as the original web-service:
Protein sequence alignment and hidden markov model profile generation with hh-suite(Steinegger et al. 2019).
Homology modeling with the Modeller software(Webb and Sali 2016).
Model quality assessment with QMEANBrane(Studer, Biasini, and Schwede 2014) through the Swiss-Model web-service API(Waterhouse et al. 2018).
The HMM profiles are generated against the most updated version of the UniRef30(Mirdita et al. 2017) database, and the templates are obtained by aligning the generated HMM profile against a database of HMM profiles for the GPCRs whose structures are known. This latter database was built by us from scratch by gathering all the human GPCRs structures deposited on the Protein Data Bank (rcsb.org). All the structures were parsed, cleaned and uniformed, and their HMM profiles were built also against the UniRef30 database(Mirdita et al. 2017). For each pdb structure we have calculated its coverage - percentage of resolved sequence amino acids - and retrieved, when possible, the conformational state of the receptor. Thus, the choice for the best templates for modeling can be done according to the “resolved coverage” of the structure and its conformational state (when possible).
Differently from the original GOMoDo web-service, the docking module of pyGOMoDo works independently from the modeling module. The user can perform molecular docking over a structure modeled with the modeling module, another software or web-server, or even over an experimental structure. The docking module implements the Autodock Vina(Trott and Olson 2010) and rDOCK(Ruiz-Carmona et al. 2014) docking programs in 3 different docking protocols:
A typical Autodock Vina protocol but with automatic selection of the orthosteric binding site (all the autodock Vina setting values can be toggled by the user),
Docking of molecules on the binding site of crystallographic ligand with rDock
Tethered docking of molecules on a crystallographic ligand also with rDOCK. The docking module also implements the ProLIF (Protein-Ligand Interaction Fingerprints)(Bouysset and Fiorucci 2021) tool that allows the analysis of all the ligand-protein interactions.
The pyGOMoDO library was specifically designed having in mind its usage through jupyter notebooks providing the sweet-spot between performance, flexibility and visualization. Tables visualization are rendered by the pandas library and protein visualization by the py3Dmol (Rego and Koes 2015) library.