New preprintπ¨
Imagine (re)designing a protein via inverse folding. AF2 predicts the designed sequence to a structure with pLDDT 94 & you get 1.8 Γ
RMSD to the input. Perfect design?
What if I told u that the structure has 4 solvent-exposed Trp and 3 Pro where a Gly should be?
Why to be waryπ§΅π
16.12.2025 15:15
π 58
π 22
π¬ 4
π 1
As a bonus, here's a video of ProteinEBM folding up the fast-folder NTL9, rendered in stunning 2D by py2Dmol from @sokrypton.org! We hope models like ProteinEBM can serve as a step toward solving the "real" protein folding problem.
12.12.2025 04:09
π 33
π 5
π¬ 0
π 0
An energy-based model of protein conformational space can be used to predict structure from sequence, sample from the conformational landscape, rank structures, and predict mutation effects.
@sokrypton.org
www.biorxiv.org/content/10.6...
10.12.2025 23:17
π 47
π 14
π¬ 0
π 1
I'm super excited to announce the first preprint of my PhD, together with Chenxi Ou and @sokrypton.org!
ML has revolutionized protein modeling, but crucial challenges remain. For example, we can't reliably predict complicated protein structures without MSAs, which limits what we can design.
10.12.2025 15:02
π 31
π 6
π¬ 2
π 1
Adding support for interactive MSA viewing, including coloring by entropy! Auto download from AFDB (for both uniprot and pdb entries). (4/4)
19.11.2025 08:15
π 9
π 1
π¬ 0
π 0
Save vectorized SVG for infinitely βΎ zoomable figures. (including support for contacts).
See example:
biorxiv.org/content/10.1...
Zhidian Zhang @yoakiyama.bsky.social @yehlincho.bsky.social @jajoosam.bsky.social
(3/4)
19.11.2025 08:15
π 6
π 1
π¬ 1
π 0
Adding record button to save animation. (2/4)
19.11.2025 08:15
π 2
π 0
π¬ 1
π 0
A few py2Dmol updates π§¬
py2dmol.solab.org
Integration with AlphaFoldDB (will auto fetch results). Drag and drop results from AF3-server or ColabFold for interactive experience! (1/4)
19.11.2025 08:15
π 104
π 31
π¬ 1
π 0
Is 3D dragging you down? Wish you could instead use the 2D ColabFold representation for all your work? π€
Introducing: py2Dmol π§¬
(feedback, suggestions, requests are welcome)
29.10.2025 01:39
π 29
π 4
π¬ 1
π 1
Working on the protein-hunter-chai google colab notebook. π
@yehlincho.bsky.social
28.10.2025 03:34
π 33
π 5
π¬ 0
π 0
Will it bind? A little worried about all the "TTTTTTT" π§ But looks cool π
27.10.2025 00:07
π 16
π 1
π¬ 1
π 0
Thrilled to announce our new preprint, βProtein Hunter: Exploiting Structure Hallucination within Diffusion for Protein Design,β in collaboration with @Griffin, @GBhardwaj8 and @sokrypton.org
π§¬Code and notebooks will be released by the end of this week.
π§Golden- Kpop Demon Hunters
13.10.2025 15:45
π 52
π 16
π¬ 3
π 2
Looks like someone has already tried to replace me with an AI agent π«£
11.09.2025 16:17
π 21
π 0
π¬ 1
π 1
hmmm... any ideas why mpnn would make things worse for af2, but make things about the same as af2 when used with boltz?
02.09.2025 22:55
π 1
π 0
π¬ 1
π 0
Exciting to see our protein binder design pipeline BindCraft published in its final form in @Nature ! This has been an amazing collaborative effort with Lennart, Christian, @sokrypton.org, Bruno and many other amazing lab members and collaborators.
www.nature.com/articles/s41...
27.08.2025 16:14
π 305
π 109
π¬ 14
π 11
Now that OpenCRISPR is in nature and rekindled the 'what's-a-novel-sequence' debate, I'm happy to share an app to check this, which I built for fun some time ago.
fuerstlab.shinyapps.io/SeqNovelty/
quick π§΅
01.08.2025 11:50
π 17
π 10
π¬ 1
π 0
Yeah, we would expect the pseudo-likelihood to be maximized for best paired MSA!
06.08.2025 00:46
π 2
π 0
π¬ 0
π 0
We'll add an option to add custom MSA inputs to the notebook (later tonight). π
05.08.2025 17:45
π 2
π 0
π¬ 0
π 0
MMseqs2 v18 is out
- SIMD FW/BW alignment (preprint soon!)
- Sub. Mat. Ξ» calculator by Eric Dawson
- Faster ARM SW by Alexander Nesterovskiy
- MSA-Pairformerβs proximity-based pairing for multimer prediction (www.biorxiv.org/content/10.1...; avail. in ColabFold API)
πΎ github.com/soedinglab/M... & π
05.08.2025 08:25
π 64
π 17
π¬ 0
π 0
2Y69 is technically "prokaryotic" as it's from the mitochondria. π§
05.08.2025 13:27
π 1
π 0
π¬ 1
π 0
We find this to be true across a number of targets. Where method used to pair sequences and filter them makes a big difference. This we find to be important when trying to disentangle paralogs from orthologs (4/4).
05.08.2025 07:39
π 6
π 0
π¬ 2
π 0
The big difference is in the pairing. The MMseqs2 server pairs sequences based on species, while our old HHblits MSAs were paired based on genome proximity (number of genes apart). Working w/ @milot.bsky.social and @martinsteinegger.bsky.social we implemented the proximity filtering in server (3/4)
05.08.2025 07:39
π 11
π 3
π¬ 1
π 0
Side story: While working on the Google Colab notebook for MSA pairformer. We encountered a problem: The MMseqs2 ColabFold MSA did not show any contacts at protein interfaces, while our old HHblits alignments showed clear contacts π«₯... (2/4)
05.08.2025 07:39
π 13
π 3
π¬ 1
π 0
Excited to re-share work from
@yoakiyama.bsky.social and Zhidian Zhang on MSA pairformer. (1/4)
05.08.2025 07:39
π 26
π 5
π¬ 1
π 0
Scaling down protein language modeling with MSA Pairformer
Recent efforts in protein language modeling have focused on scaling single-sequence models and their training data, requiring vast compute resources that limit accessibility. Although models that use ...
Excited to share work with
Zhidian Zhang, @milot.bsky.social, @martinsteinegger.bsky.social, and @sokrypton.org
biorxiv.org/content/10.1...
TLDR: We introduce MSA Pairformer, a 111M parameter protein language model that challenges the scaling paradigm in self-supervised protein language modelingπ§΅
05.08.2025 06:29
π 97
π 43
π¬ 1
π 1
Not sure, I was referring to the earlier observation that alphafold adjust it's confidence when additional proteins are provided. Was making a point that this is expected, and just math π
03.08.2025 01:27
π 2
π 0
π¬ 0
π 0