Janani Durairaj (Jay) (@ninjani)

Five years ago, we released FLIP. The core question was: can ML models for protein fitness prediction generalize in the ways that actually matter for protein engineering, i.e. low data, extrapolation to more mutations, out-of-distribution sequences?

26.02.2026 21:58 👍 4 🔁 5 💬 2 📌 0

Remote homology and protein design: two sides of the same coin. Instead of finding remote homologs, we used TEA to design completely de novo proteins, folding into desired TEA sequences.

I always love working with Jay, and “speed-running” this proof of concept was no exception.

11.02.2026 16:47 👍 9 🔁 3 💬 0 📌 0

Also a great time to showcase @lorenzopantolini.bsky.social's awesomeness as he slowly starts the job hunt! If you need someone with a deep understanding of biological latent spaces and how to exploit them for practical applications, he's your guy.

11.02.2026 10:52 👍 5 🔁 0 💬 0 📌 0

This was a speed-run to validate the in silico proof-of-concept, but the possibilities are endless. It may represent a path orthogonal to current structure-based methods. We're working on adaptations and, of course, looking to experimentally validate. (8/n)

11.02.2026 10:52 👍 0 🔁 0 💬 1 📌 0

Previous MCMC works used contact map loss (needed ~170k steps, Verkuil et al. 2022) or ESMFold pTM (i.e folding at every step, Hie et al. 2022). By optimising a 1D sequence with TEA, we see a >10x speed increase. (7/n)

11.02.2026 10:52 👍 0 🔁 0 💬 1 📌 0

For unconditional design, we get high-pLDDT proteins unlike any known sequences. A small TEA k-mer diversity loss helped steer us away from simple coiled-coils toward complex secondary structure combos. (6/n)

11.02.2026 10:52 👍 1 🔁 0 💬 1 📌 0

For template-guided design, we generated novel sequences predicted to fold into both de novo and natural scaffolds (AF2 single seq). Many have a NEFF of 1. No structures were used in the making of these designs. (5/n)

11.02.2026 10:52 👍 0 🔁 0 💬 1 📌 0

The approach:
1. Take a random sequence
2. Randomly mutate
3. Accept/reject via Metropolis criterion based on ESM2 likelihood + TEA template match (or TEA entropy if unconditional).
This is fast, 30k steps in ~25min. (4/n)

11.02.2026 10:52 👍 1 🔁 0 💬 1 📌 0

We noticed that TEA logit entropy correlates well with structure prediction confidence (pLDDT). Ideally, we could combine the ESM2 likelihood (naturalness) with TEA (structural consistency) to guide design. (3/n)

11.02.2026 10:52 👍 1 🔁 0 💬 1 📌 0

We recently released The Embedded Alphabet (TEA), a tiny head on top of ESM2 converting amino acids into a new 20-letter structural alphabet. Great for search (see bsky.app/profile/lore...), but we wondered: could we use it for generation? (2/n)

11.02.2026 10:52 👍 1 🔁 2 💬 1 📌 0

A fun little idea that worked surprisingly well, using a structure-informed yet structure-independent alphabet for de novo protein design: www.biorxiv.org/content/10.6...
🧵(1/n)

11.02.2026 10:52 👍 32 🔁 7 💬 1 📌 1

Mirdita Lab - Laboratory for Computational Biology & Molecular Machine Learning Mirdita Lab builds scalable bioinformatics methods.

My time in @martinsteinegger.bsky.social's group is ending, but I’m staying in Korea to build a lab at Sungkyunkwan University School of Medicine. If you or someone you know is interested in molecular machine learning and open-source bioinformatics, please reach out. I am hiring!
mirdita.org

20.01.2026 11:07 👍 104 🔁 55 💬 7 📌 1

Science Tree Farm

open.spotify.com/episode/5EXO...

20.12.2025 23:03 👍 2 🔁 2 💬 0 📌 0

Know when to co-fold'em This is the official web page for the James Fraser Lab at UCSF.

I'm really excited to break up the holiday relaxation time with a new preprint that benchmarks AlphaFold3 (AF3)/“co-folding” methods with 2 new stringent performance tests.

Thread below - but first some links:
A longer take:
fraserlab.com/2025/12/29/k...

Preprint:
www.biorxiv.org/content/10.6...

29.12.2025 22:25 👍 72 🔁 30 💬 5 📌 2

Thanks a lot for the review! We somehow missed it until quite recently but I think addressed a good chunk of the comments in revision anyway, looking forward to your thoughts when it's out

05.01.2026 09:40 👍 0 🔁 0 💬 0 📌 1

🚀 New paper in @natmethods.nature.com!
We present OpenStructure's powerful scoring capabilities, used to assess predictionsin CAMEO and CASP.
Read the full study here:
🔗 doi.org/10.1038/s415...
#StructuralBiology #Bioinformatics #OpenStructure #CASP #CAMEO #ProteinStructure

23.12.2025 15:18 👍 5 🔁 4 💬 1 📌 0

Been excited about this one for a while! What would you do with a new alphabet and the wealth of protein sequence bioinformatics at your disposal? We're also around at #EMBOComp3D Heidelberg and MLSB Copenhagen this week to discuss

01.12.2025 10:58 👍 27 🔁 8 💬 0 📌 0

OpenFold3-preview (OF3p) is out: a sneak peek of our AF3-based structure prediction model. Our aim for OF3 is full AF3-parity for every modality. We now believe we have a clear path towards this goal and are releasing OF3p to enable building in the OF3 ecosystem. More👇

28.10.2025 18:30 👍 126 🔁 42 💬 1 📌 3

This October I’m drawing one molecule a day inspired by proteins in pdb @rcsbpdb.bsky.social

Day 2/31
Prompt WEAVE

N-terminal domain of a Fibrion - a building block of silk fiber produced by silkworms.

Pdb: 3UA0

Next prompt is CROWN and I would love your suggestions!

03.10.2025 03:29 👍 52 🔁 14 💬 2 📌 2

The Viral AlphaFold Database of monomers and homodimers reveals conserved protein folds in viruses of bacteria, archaea, and eukaryotes VAD is a Viral AlphaFold Database of protein monomers and homodimers from viruses infecting hosts across the tree of life.

Viral AlphaFold Database (VAD) is live in Science Advances

~27,000 predicted viral protein monomers & homodimers

Conserved folds across bacteria, archaea & eukaryotic viruses

New toxin–antitoxin system KreTA uncovered

Vast “functional darkness” remains uncharted

www.science.org/doi/10.1126/...

02.10.2025 08:48 👍 83 🔁 35 💬 1 📌 0

Océane Follonier @oceanef.bsky.social for
“From bytes to binders: design, score and optimize” #bc2basel #posterprize

10.09.2025 14:54 👍 2 🔁 2 💬 1 📌 0

Critical benchmarking of structure prediction methods has been crucial for measuring progress and detecting breakthroughs. But how will the future look like? Join the discussion at our workshop in Basel on September 8 - just before the [BC]2 conference.

@sib.swiss @biozentrum.unibas.ch

⬇️⬇️⬇️

27.08.2025 17:32 👍 11 🔁 3 💬 0 📌 0

Exciting to see our protein binder design pipeline BindCraft published in its final form in @Nature ! This has been an amazing collaborative effort with Lennart, Christian, @sokrypton.org, Bruno and many other amazing lab members and collaborators.

www.nature.com/articles/s41...

27.08.2025 16:14 👍 305 🔁 109 💬 14 📌 11

Still some spots left, join us in Basel on Sep 8 (before [BC]2) to discuss structure prediction benchmarking and more!

27.08.2025 08:14 👍 2 🔁 1 💬 0 📌 0

The future of structure prediction benchmarking: measuring progress and breakthroughs · Luma Benchmarking has been a key driver of progress in protein structure prediction methods. As the field continues to evolve, several key questions prevail: How…

🔬 Workshop: Future of Structure Prediction Benchmarking
📅 Sept 8, 2025 | Basel
💡 Talks + breakout sessions on #CASP #CAPRI #CAMEO & benchmarking for drug discovery
🎟️ Free registration (limited spots): lu.ma/ws9nu1xf
Join us to explore how benchmarking can drive breakthroughs in structure prediction.

27.08.2025 07:59 👍 4 🔁 1 💬 0 📌 2

Join us at #EMBOComp3D to explore cutting-edge breakthroughs in computational structural biology, AI, drug design, and innovative software! 💻

Find out about molecular modelling to systems-level analyses and evolution, and more.

Submit your abstract by 26 Aug ➡️ s.embl.org/csb25-01-bl

24.07.2025 11:14 👍 5 🔁 3 💬 0 📌 2

Protein Annotations in the age of AI A not-for-profit symposium hosted at UCL - more details about speakers and venue below.

CATH turns 30 years old this year!

We are organising a 1-day symposium on September 16th at UCL, highlighting recent AI-based developments to enhance protein family classifications, annotations and analyses.

www.eventbrite.co.uk/e/protein-an...

22.08.2025 10:45 👍 12 🔁 7 💬 2 📌 0

AtomWorks is out! Building upon @biotite_python, we built a toolkit for all things biomolecules and trained RF3 with it. All open-source, test it via `pip install atomworks`!

AtomWorks: github.com/RosettaCommo...
RF3: github.com/RosettaCommo...
Paper: tinyurl.com/y2w4z65b

1/6

15.08.2025 17:44 👍 22 🔁 6 💬 1 📌 0

Scatter plot of LDDT as a function of sequence identity for high coverage homology models. The horizontal red line at 40% sequence identity highlights the presence high quality models in the low sequence identity region.

Filtering out homologous structures from the PDB at 40% sequence identity is not enough to create a robust test set. Significant leakage persists at this level, and comparative modeling can still produce high quality models.

08.08.2025 07:48 👍 6 🔁 4 💬 1 📌 0

Biozentrum PhD Fellowships Share your passion for life sciences. If you are talented and highly motivated, want to broaden your horizons and are interested in a wide range of research topics, apply for one of the sought after B...

Looking for a #fellowship for an independent #PhD at one of the best places for life sciences in the world?

The summer call at @biozentrum.unibas.ch @unibas.ch is open until October 12, 2025.

www.biozentrum.unibas.ch/phd/internat...

07.08.2025 04:17 👍 14 🔁 10 💬 0 📌 0

Janani Durairaj (Jay)

Latest posts by Janani Durairaj (Jay) @ninjani