Teresa Head-Gordon Lab's Avatar

Teresa Head-Gordon Lab

@thglab

Student-run THG Lab account @UC Berkeley. We develop physics-based and machine learning-based models for various systems.

72
Followers
40
Following
7
Posts
20.03.2025
Joined
Posts Following

Latest posts by Teresa Head-Gordon Lab @thglab

Preview
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind

Check it out, and feel free to drop your questions here or on GitHub!
πŸ”— GitHub: github.com/THGLab/HiQBind
πŸ“„ Paper: pubs.rsc.org/en/content/a...

#machinelearning #proteinligand #proteinligandbindingaffinity #structuralbiology #ai4sci

07.04.2025 23:49 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind

Huge shoutout to the amazing team behind this:
πŸ‘ Lead author Yingze (Eric) Wang and @kunyangsun.bsky.social
πŸ‘ PI Prof. Teresa Head-Gordon
πŸ‘ Teammates Jie Li, Xingyi Guan, Oufan Zhang, Dorian Bagni
πŸ‘ Collaborators Dr. Heather A. Carlson and Prof. Yang Zhang

07.04.2025 23:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind

What’s next?

We’re exploring:
πŸ” Rotamer refinement
πŸ€– Binding label extraction with LLMs (maybe πŸ‘€)
🧠 Better data splits (possibly inspired by PLINDER) to support ML research!

07.04.2025 23:49 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind

Since we're focused on structural data with binding labels, we applied this workflow to major open-access datasets (BioLiP, BindingDB, and BindingMOAD) to generate HiQBind: a cleaned, corrected dataset comparable in size to PDBBind v2020 but with significantly improved structural quality! πŸ’₯

07.04.2025 23:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - THGLab/HiQBind: Workflow to clean up and fix structural problems in protein-ligand binding datasets Workflow to clean up and fix structural problems in protein-ligand binding datasets - THGLab/HiQBind

In this work, we built HiQBind-workflow, a semi-automated workflow that processes protein–ligand structures from the RCSB PDB by adding missing atoms, correcting ligand geometries, fixing bond orders and protonation states, and much more!

07.04.2025 23:49 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Copying Oliver's post from Linkedin to help us gain some visibility here!

🚨 Our paper is out! 🚨
"A workflow to create a high-quality protein–ligand binding dataset for training, validation, and prediction tasks" is now published in Digital Discovery! πŸŽ‰

07.04.2025 23:49 πŸ‘ 2 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

We have left X for greener pastures and bluer skies - good riddance to Nazi-saluting Musk and his abuse of science and engineering that in fact enriched him.

20.03.2025 00:43 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0