Gabriel Rocklin (@grocklin)

New lab preprint: if ML struggles with extrapolation, let's expand the diversity of training data with gene synth, DNA shuffling, and ML gen.... also lots of #FluorescentProteins #ProteinEngineering #MachineLearning #SyntheticBiology 1/n

03.03.2026 19:39 👍 11 🔁 5 💬 1 📌 0

High Diversity Gene Libraries Facilitate Machine Learning Guided Exploration of Fluorescent Protein Sequence Space https://www.biorxiv.org/content/10.64898/2026.03.01.706892v1

03.03.2026 04:04 👍 3 🔁 2 💬 0 📌 1

I’m looking to hire a research technician for my lab at Harvard & DFCI, who would primarily work in the wet lab expressing and characterizing designed proteins, starting this summer. A great role for a recent college grad looking for an immersive research experience before grad school.

20.02.2026 14:33 👍 11 🔁 13 💬 1 📌 0

Here’s a preprint from the Keating Lab: www.biorxiv.org/content/10.6.... Foster Birnbaum and Amy E. Keating demonstrate that sequence design models, such as ProteinMPNN, are limited because they were trained only on native sequences.

18.01.2026 06:23 👍 9 🔁 4 💬 1 📌 0

Yeah this seems terrible. On the other hand, if funding levels stay ~constant, I wonder if this will be undone in the future, leading to a kind of random jump up in the number of projects supported for a few years (far above even 2024) in ~2030? Not that that's much to get excited about.

20.01.2026 18:41 👍 1 🔁 0 💬 1 📌 0

Nope, just trying to force some honesty from the side that introduced a very dishonest mechanism of cutting budgets. Fruitless I know

16.01.2026 18:42 👍 1 🔁 0 💬 0 📌 0

I know negotiating with a bunch of bad-faithed liars can be a waste of time, but has anyone proposed a compromise where we switch to MYF but allocate extra funds so that the transition is net neutral to the # of awards? Seems like that would clarify what we're supposedly debating here.

16.01.2026 18:05 👍 0 🔁 0 💬 2 📌 0

Gonna be funny when people get rejected for being at the wrong career stage (like the K99)- “PI is WAY overqualified for retirement”. “It is not clear PI will benefit from additional research prior to retirement”. “PI not qualified for retirement, hasn’t graduated enough students to be competitive.”

07.01.2026 19:52 👍 12 🔁 0 💬 0 📌 0

Congratulations!!

01.01.2026 16:27 👍 1 🔁 0 💬 0 📌 0

The final Calvin and Hobbes, which appeared in papers 30 years ago today.

31.12.2025 17:00 👍 13111 🔁 4265 💬 126 📌 290

Large scale prospective evaluation of co-folding across 557 Mac1-ligand complexes and three virtual screens

Figure 1

Figure 2

Figure 3

Large scale prospective evaluation of co-folding across 557 Mac1-ligand complexes and three virtual screens [new]
Mac1 co-folding eval: Pose prediction, conform. change, & hit ID post-training.

29.12.2025 02:37 👍 5 🔁 3 💬 0 📌 0

The most precious commodity you have is your attention. You don’t have to waste it on poor-faith debates or arguments with strangers if you don’t think they’ll be productive. You can prioritize the things that matter to you and make your life richer.

30.11.2025 20:00 👍 11825 🔁 3005 💬 131 📌 196

Immunopeptidomics can inform the design of mRNA vaccines for the delivery of Mycobacterium tuberculosis MHC class II antigens MHC class II immunopeptidomics in human phagocytes infected with Mycobacterium tuberculosis enables vaccine immunogen design and optimization.

Excited to share our newest manuscript on antigen discovery and vaccine design for tuberculosis lead by Owen Leddy (everyone hiring new faculty in a few years, remember this name!)

www.science.org/doi/10.1126/... #TBsky

05.11.2025 21:57 👍 21 🔁 7 💬 1 📌 1

I don't know yet- clearly something has to do with strength of intermolecular interactions (e.g. the Arg effect) but also I bet something has to do with refolding rates. Although surprisingly we don't see much correlation with topology (% alpha vs % beta) so maybe not folding rates... ???

24.11.2025 16:21 👍 1 🔁 0 💬 0 📌 0

That Savas lab looks cool, but this was actually done in collaboration with Jeff Savas's lab at Northwestern, who is not on bsky that I can find :) thanks for the highlight!

21.11.2025 21:28 👍 3 🔁 0 💬 0 📌 0

Thank you so much :)

20.11.2025 14:28 👍 0 🔁 0 💬 0 📌 0

Thanks! Yeah, lots to learn!

20.11.2025 12:13 👍 0 🔁 0 💬 0 📌 0

Thanks!!

20.11.2025 12:06 👍 0 🔁 0 💬 0 📌 0

I hope this is useful and we're excited to use this approach to explore more protein stresses- shelf life, vortexing, freeze-thaw, etc. I also love this paper because it's our lab's first time using quantitative proteomics! Congrats Cydney et al!!

19.11.2025 21:16 👍 10 🔁 0 💬 0 📌 0

Global Analysis of Aggregation Determinants in Small Protein Domains Protein aggregation is an obstacle for engineering effective recombinant proteins for biotechnology and therapeutic applications. Predicting protein aggregation propensity remains challenging due to t...

All data are available on Zenodo and we'd love to see what you can do with it! Cydney also made a nice notebook to run the predictive model (including DMS scan) in Colab!

Preprint: biorxiv.org/content/10.1...
Colab: colab.research.google.com/drive/1KNWvG...
Data: docs.google.com/forms/d/e/1F...

19.11.2025 21:16 👍 11 🔁 4 💬 1 📌 0

There’s much more in the paper, including the surprising observation that higher folding stability correlates with aggregation even controlling for hydrophobicity (?!) Stabilities measured by Kotaro Tsuboyama!

19.11.2025 21:16 👍 7 🔁 0 💬 2 📌 0

Existing aggregation predictors show only modest correlations with our results, but fine tuning SaProt does well on our (carefully split) test set!

19.11.2025 21:16 👍 4 🔁 0 💬 1 📌 0

Arginine is strongly aggregation promoting! This was known in literature but not to me.

19.11.2025 21:16 👍 6 🔁 0 💬 2 📌 0

Hydrophobicity and isoelectric point (pI) are correlated with temp- and pH-propensity, but don’t tell the whole story. Other factors also influenced aggregation.

19.11.2025 21:16 👍 3 🔁 0 💬 1 📌 0

Really cool result: changes in soluble abundance from having all protein domains mixed together are nicely correlated with similar experiments on single, purified domains (also at 10 mg/mL).

19.11.2025 21:16 👍 5 🔁 0 💬 1 📌 0

Overall, 25-50% of the total protein became insoluble, but the proteomics revealed which proteins aggregated (became insoluble) more and which aggregated less.

19.11.2025 21:16 👍 2 🔁 0 💬 1 📌 0

We expressed the protein domains together as a mixture in one E. coli culture, purified our mixture of domains and concentrated to 10mg/mL, then used TMT proteomics to monitor the change in soluble protein abundance after high temp/low pH.

19.11.2025 21:16 👍 3 🔁 0 💬 1 📌 0

Cydney M. Martell

This work was led by the incredible Cydney Martell, who worked out the entire approach from scratch and did virtually all the experiments and computational analysis (and earned her own PhRMA fellowship)! phrmafoundation.org/grants-fello... With proteomics help from the Savas Lab at NU!

19.11.2025 21:16 👍 3 🔁 0 💬 1 📌 0

New preprint! We measured temperature- and pH-induced aggregation for over 18,000 natural and de novo designed protein domains!

19.11.2025 21:16 👍 121 🔁 42 💬 4 📌 3

Global Analysis of Aggregation Determinants in Small Protein Domains https://www.biorxiv.org/content/10.1101/2025.11.11.687847v1

13.11.2025 01:47 👍 11 🔁 12 💬 0 📌 0

Gabriel Rocklin

Latest posts by Gabriel Rocklin @grocklin