GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets
Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind
Huge shoutout to the amazing team behind this:
π Lead author Yingze (Eric) Wang and @kunyangsun.bsky.social
π PI Prof. Teresa Head-Gordon
π Teammates Jie Li, Xingyi Guan, Oufan Zhang, Dorian Bagni
π Collaborators Dr. Heather A. Carlson and Prof. Yang Zhang
07.04.2025 23:49
π 1
π 0
π¬ 1
π 0
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets
Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind
Whatβs next?
Weβre exploring:
π Rotamer refinement
π€ Binding label extraction with LLMs (maybe π)
π§ Better data splits (possibly inspired by PLINDER) to support ML research!
07.04.2025 23:49
π 0
π 0
π¬ 1
π 0
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets
Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind
Since we're focused on structural data with binding labels, we applied this workflow to major open-access datasets (BioLiP, BindingDB, and BindingMOAD) to generate HiQBind: a cleaned, corrected dataset comparable in size to PDBBind v2020 but with significantly improved structural quality! π₯
07.04.2025 23:49
π 1
π 0
π¬ 1
π 0
Copying Oliver's post from Linkedin to help us gain some visibility here!
π¨ Our paper is out! π¨
"A workflow to create a high-quality proteinβligand binding dataset for training, validation, and prediction tasks" is now published in Digital Discovery! π
07.04.2025 23:49
π 2
π 1
π¬ 1
π 0
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets
Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind
Check it out, and feel free to drop your questions here or on GitHub!
π GitHub: github.com/THGLab/HiQBind
π Paper: pubs.rsc.org/en/content/a...
#machinelearning #proteinligand #proteinligandbindingaffinity #structuralbiology #ai4sci
07.04.2025 23:49
π 0
π 0
π¬ 0
π 0
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets
Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind
In this work, we built HiQBind-workflow, a semi-automated workflow that processes proteinβligand structures from the RCSB PDB by adding missing atoms, correcting ligand geometries, fixing bond orders and protonation states, and much more!
07.04.2025 23:49
π 0
π 0
π¬ 1
π 0
We have left X for greener pastures and bluer skies - good riddance to Nazi-saluting Musk and his abuse of science and engineering that in fact enriched him.
20.03.2025 00:43
π 7
π 0
π¬ 1
π 0