Sergey Ovchinnikov's Avatar

Sergey Ovchinnikov

@sokrypton.org

Scientist, Assistant Professor at MIT biology, #FirstGen

3,892
Followers
525
Following
133
Posts
10.09.2023
Joined
Posts Following

Latest posts by Sergey Ovchinnikov @sokrypton.org

Video thumbnail

New preprint🚨
Imagine (re)designing a protein via inverse folding. AF2 predicts the designed sequence to a structure with pLDDT 94 & you get 1.8 Γ… RMSD to the input. Perfect design?
What if I told u that the structure has 4 solvent-exposed Trp and 3 Pro where a Gly should be?

Why to be waryπŸ§΅πŸ‘‡

16.12.2025 15:15 πŸ‘ 58 πŸ” 22 πŸ’¬ 4 πŸ“Œ 1

As a bonus, here's a video of ProteinEBM folding up the fast-folder NTL9, rendered in stunning 2D by py2Dmol from @sokrypton.org! We hope models like ProteinEBM can serve as a step toward solving the "real" protein folding problem.

12.12.2025 04:09 πŸ‘ 33 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

An energy-based model of protein conformational space can be used to predict structure from sequence, sample from the conformational landscape, rank structures, and predict mutation effects.

@sokrypton.org

www.biorxiv.org/content/10.6...

10.12.2025 23:17 πŸ‘ 47 πŸ” 14 πŸ’¬ 0 πŸ“Œ 1

I'm super excited to announce the first preprint of my PhD, together with Chenxi Ou and @sokrypton.org!

ML has revolutionized protein modeling, but crucial challenges remain. For example, we can't reliably predict complicated protein structures without MSAs, which limits what we can design.

10.12.2025 15:02 πŸ‘ 31 πŸ” 6 πŸ’¬ 2 πŸ“Œ 1
Preview
CIRPIN: Learning Circular Permutation-Invariant Representations to Uncover Putative Protein Homologs Protein structure-based homology detection has been revolutionized by deep learning methods that can rapidly search massive databases. However, current structural search tools often miss proteins rela...

An interesting study from @aidenkzj.bsky.social, @abulnaga.bsky.social and @sokrypton.org that builds on our Progres model to find pairs of proteins with circular permutations:

www.biorxiv.org/content/10.1...

25.11.2025 14:43 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
Proteome-wide in silico screening for human protein-protein interactions Protein-protein interactions (PPIs) drive virtually all biological processes, yet most PPIs have not been identified and even more remain structurally unresolved. We developed a two-step computational...

Thrilled to share that the final piece of my PhD work is now on bioRxiv! biorxiv.org/content/10.1... With support from @nvidia and the @NSF, we used AlphaFold to screen 1.6M+ protein pairs, revealing thousands of potential novel PPIs. All data can be viewed at predictomes.org/hp

12.11.2025 21:26 πŸ‘ 163 πŸ” 67 πŸ’¬ 5 πŸ“Œ 4
Post image

Adding support for interactive MSA viewing, including coloring by entropy! Auto download from AFDB (for both uniprot and pdb entries). (4/4)

19.11.2025 08:15 πŸ‘ 9 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

Save vectorized SVG for infinitely β™Ύ zoomable figures. (including support for contacts).

See example:
biorxiv.org/content/10.1...

Zhidian Zhang @yoakiyama.bsky.social @yehlincho.bsky.social @jajoosam.bsky.social

(3/4)

19.11.2025 08:15 πŸ‘ 6 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Adding record button to save animation. (2/4)

19.11.2025 08:15 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

A few py2Dmol updates 🧬

py2dmol.solab.org
Integration with AlphaFoldDB (will auto fetch results). Drag and drop results from AF3-server or ColabFold for interactive experience! (1/4)

19.11.2025 08:15 πŸ‘ 104 πŸ” 31 πŸ’¬ 1 πŸ“Œ 0
Post image

Is 3D dragging you down? Wish you could instead use the 2D ColabFold representation for all your work? πŸ€“

Introducing: py2Dmol 🧬

(feedback, suggestions, requests are welcome)

29.10.2025 01:39 πŸ‘ 29 πŸ” 4 πŸ’¬ 1 πŸ“Œ 1
Video thumbnail

Working on the protein-hunter-chai google colab notebook. 😈

@yehlincho.bsky.social

28.10.2025 03:34 πŸ‘ 33 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0
Post image

Will it bind? A little worried about all the "TTTTTTT" 🧐 But looks cool 😎

27.10.2025 00:07 πŸ‘ 16 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Thrilled to announce our new preprint, β€œProtein Hunter: Exploiting Structure Hallucination within Diffusion for Protein Design,” in collaboration with @Griffin, @GBhardwaj8 and @sokrypton.org

🧬Code and notebooks will be released by the end of this week.
🎧Golden- Kpop Demon Hunters

13.10.2025 15:45 πŸ‘ 52 πŸ” 16 πŸ’¬ 3 πŸ“Œ 2
Preview
Paper page - Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents Join the discussion on this paper page

Maybe something akin to paper2agent:
huggingface.co/papers/2509....

11.09.2025 16:29 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Looks like someone has already tried to replace me with an AI agent 🫣

11.09.2025 16:17 πŸ‘ 21 πŸ” 0 πŸ’¬ 1 πŸ“Œ 1

hmmm... any ideas why mpnn would make things worse for af2, but make things about the same as af2 when used with boltz?

02.09.2025 22:55 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Exciting to see our protein binder design pipeline BindCraft published in its final form in @Nature ! This has been an amazing collaborative effort with Lennart, Christian, @sokrypton.org, Bruno and many other amazing lab members and collaborators.

www.nature.com/articles/s41...

27.08.2025 16:14 πŸ‘ 305 πŸ” 109 πŸ’¬ 14 πŸ“Œ 11

Now that OpenCRISPR is in nature and rekindled the 'what's-a-novel-sequence' debate, I'm happy to share an app to check this, which I built for fun some time ago.

fuerstlab.shinyapps.io/SeqNovelty/

quick 🧡

01.08.2025 11:50 πŸ‘ 17 πŸ” 10 πŸ’¬ 1 πŸ“Œ 0

Yeah, we would expect the pseudo-likelihood to be maximized for best paired MSA!

06.08.2025 00:46 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

We'll add an option to add custom MSA inputs to the notebook (later tonight). 😎

05.08.2025 17:45 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

MMseqs2 v18 is out
- SIMD FW/BW alignment (preprint soon!)
- Sub. Mat. Ξ» calculator by Eric Dawson
- Faster ARM SW by Alexander Nesterovskiy
- MSA-Pairformer’s proximity-based pairing for multimer prediction (www.biorxiv.org/content/10.1...; avail. in ColabFold API)
πŸ’Ύ github.com/soedinglab/M... & 🐍

05.08.2025 08:25 πŸ‘ 64 πŸ” 17 πŸ’¬ 0 πŸ“Œ 0
Preview
How AlphaFold and related models predict protein-peptide complex structures Protein-peptide interactions mediate many biological processes, and access to accurate structural models, through experimental determination or reliable computational prediction, is essential for unde...

It was expected. The surprising part for me was that AlphaFold doesn't seem to care....

See study here from @lindseyguan.bsky.social
www.biorxiv.org/content/10.1...

This makes me wonder if the reason it doesn't seem to care is because it was trained on and evaluated on poorly paired MSAs. πŸ€”

05.08.2025 15:58 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

2Y69 is technically "prokaryotic" as it's from the mitochondria. 🧐

05.08.2025 13:27 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

We find this to be true across a number of targets. Where method used to pair sequences and filter them makes a big difference. This we find to be important when trying to disentangle paralogs from orthologs (4/4).

05.08.2025 07:39 πŸ‘ 6 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Post image

The big difference is in the pairing. The MMseqs2 server pairs sequences based on species, while our old HHblits MSAs were paired based on genome proximity (number of genes apart). Working w/ @milot.bsky.social and @martinsteinegger.bsky.social we implemented the proximity filtering in server (3/4)

05.08.2025 07:39 πŸ‘ 11 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
Post image

Side story: While working on the Google Colab notebook for MSA pairformer. We encountered a problem: The MMseqs2 ColabFold MSA did not show any contacts at protein interfaces, while our old HHblits alignments showed clear contacts πŸ«₯... (2/4)

05.08.2025 07:39 πŸ‘ 13 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0

Excited to re-share work from
@yoakiyama.bsky.social and Zhidian Zhang on MSA pairformer. (1/4)

05.08.2025 07:39 πŸ‘ 26 πŸ” 5 πŸ’¬ 1 πŸ“Œ 0
Preview
Scaling down protein language modeling with MSA Pairformer Recent efforts in protein language modeling have focused on scaling single-sequence models and their training data, requiring vast compute resources that limit accessibility. Although models that use ...

Excited to share work with
Zhidian Zhang, @milot.bsky.social, @martinsteinegger.bsky.social, and @sokrypton.org
biorxiv.org/content/10.1...
TLDR: We introduce MSA Pairformer, a 111M parameter protein language model that challenges the scaling paradigm in self-supervised protein language modeling🧡

05.08.2025 06:29 πŸ‘ 97 πŸ” 43 πŸ’¬ 1 πŸ“Œ 1

Not sure, I was referring to the earlier observation that alphafold adjust it's confidence when additional proteins are provided. Was making a point that this is expected, and just math πŸ™ƒ

03.08.2025 01:27 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0