Jeremie Kalfon πŸ‘¨β€πŸ’»πŸ§¬πŸ€–πŸš€'s Avatar

Jeremie Kalfon πŸ‘¨β€πŸ’»πŸ§¬πŸ€–πŸš€

@jkobject.com

Doing a Ph.D. AI in Bio. | Ex @WhiteLabGx @BroadInstitute @MIT | Built @PiPleteam | ML, Cancer, Genomics, Data Sci, Entrepreneur, FullStack Dev | All views are mine

704
Followers
3,258
Following
143
Posts
06.11.2024
Joined
Posts Following

Latest posts by Jeremie Kalfon πŸ‘¨β€πŸ’»πŸ§¬πŸ€–πŸš€ @jkobject.com

We prefer some people to get cancer, MS, parkinson than to give a virus to people that will get the virus anyway. Many people might volunteer! Indeed, we give so much to associations against cancer, MS, and dementia, but when it comes time to do something about it, it seems no one wants to.
6/6

29.01.2026 08:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

← Nowadays, no regulatory agency, even less in Europe, would let you do that.
5/6

29.01.2026 08:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

You could recruit kids, give them the vaccine, and infect them with EBV directly, since you know that almost all of them will be at some point, then check if they get infected or not using sequencing (PCR tests, B-cell antigen sequencing), and accept this as the endpoint of the trial.
4/6

29.01.2026 08:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Because you need to recruit tens of thousands of young kids, test them often for infection, and wait decades to see symptoms of other diseases appear in some of them. The statistics are terrible.

But it could be cheap...
3/6

29.01.2026 08:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

From different cancers, skin diseases, dementia, parkinson and more.
The reason why there is no vaccine yet in 2026 is very interesting.
Basically, it is super expensive. But why is it so?
2/6

29.01.2026 08:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Did you know that likely most cases of multiple sclerosis (MS) are driven by the EBV virus (herpes/mononucleosis disease)?

>90% of us get infected in our teens, and some will go on to develop many diseases later in life because of it.
1/6

29.01.2026 08:39 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

And then lucky to pursue through an atlas of cells of many types and species, with a focus on quality and diversity mattering more than quantity with @jkobject.com

27.01.2026 13:09 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

@cantinilab.bsky.social

17.12.2025 07:08 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
scPRINT-2: Towards the next-generation of cell foundation models and benchmarks Cell biology has been booming with foundation models trained on large single-cell RNA-seq databases, but benchmarks and capabilities remain unclear. We propose an additive benchmark across a gymnasium of tasks to discover which features improve performance. From these findings, we present scPRINT-2, a single-cell Foundation Model pre-trained across 350 million cells and 16 organisms. Our contributions in pre-training tasks, tokenization, and losses made scPRINT-2 state-of-the-art in expression denoising, cell embedding, and cell type prediction. Furthermore, with our cell-level architecture, scPRINT-2 becomes generative, as demonstrated by our expression imputation and counterfactual reasoning results. Finally, thanks to our pre-training database, we uncover generalization to unseen modalities and organisms. These studies, together with improved abilities in gene embeddings and gene network inference, place scPRINT-2 as a next-generation cell foundation model. ### Competing Interest Statement The authors have declared no competing interest.

Paper: www.biorxiv.org/cont... β€’ Code: github.com/cantinila...

Curious: **what’s the one benchmark you wish every single-cell foundation model reported by default?**
6/6

16.12.2025 19:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

4. **Generalization:** evaluation on **unseen organisms, tasks, and modalities.** It is also a push to rethink some evaluation of scFM; **SOTA on many tasks**. πŸ₯‡Β πŸƒΒ β›·οΈΒ β›ΉοΈβ€β™€οΈ

🎁 If you’re reading papers over the break, I hope this is useful.
5/6

16.12.2025 19:49 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

3. **Data + pipeline:** unified **scBaseCount + Tahoe-100M + CELLxGENE**, with consistent preprocessing + weighted random sampling ****(and other practical bits that usually stay hidden) β†’ **350M cells, 16 species, ~300 tissues, ~500 cell types**. 🌍🫁🐭
4/6

16.12.2025 19:48 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

1. **Benchmark:** **42 components** of scFMs across a gymnasium of tasks; looking at dataset size, encoding, training, architectures, losses, etc. πŸ“Š

2. **Model:** **scPRINT-2** β€” *small but mighty* with **~20M active parameters**, built from the strongest ingredients we found. πŸ€–πŸ§¬
3/6

16.12.2025 19:48 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

After a few years building scFMs (scPRINT, Xpressor, scPRINT-2…), I wanted to do something more β€œcomplete” than just shipping a new model: understand what matters, train the best version we can, and stress-test generalization properly.

So this work is a 4-in-1 release:
2/6

16.12.2025 19:48 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

πŸ§‘β€πŸŽ„πŸŽ„ Christmas Foundation Model Release: scPRINT-2

**One-liner:** a **20M-active-param** single-cell foundation model trained on **350M cells / 16 species / 300 tissues / 500 cell types**.
1/6

16.12.2025 19:48 πŸ‘ 3 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0

Thanks to Future4Care, TimothΓ© Cynober, Whitelab Genomics, and Scienta Lab for organizing the event, and to Matteo Marengo, Clara Brouaux, and Gabriel Michaux for helping me manage the round table.

And thanks to my all-star panel: Yann Fleureau, Jeremy Besnard, Sofia Dahoune, and Steven Jerome

05.12.2025 09:30 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

It was a blast hosting our Nucleate Inside AI roundtable at the France Techbio 2025 event.

05.12.2025 09:30 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
TechBio France 2025 Join TechBio France 2025 to shape the future of France's TechBio ecosystem, fostering innovation and collaboration in biotech and technology

Join us Friday the 4th at the πŸ‡«πŸ‡· France TechBio2025 event!!
www.eventbrite.fr/e/...
3/3

07.11.2025 09:15 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Without double-talk and with amazing panelistsπŸ§‘β€πŸ”¬:

- Yann Fleureau, CEO, Blossom Life Sci & Founder of Cardiologs
- Steven Jerome, Director, Lead of Hit Discovery, SchrΓΆdinger
- JΓ©rΓ©my Besnard, Advisor, InFocusTx & Co-founder of Exsciencia
- Sofia Dahoune, Partner at Daphni
2/3

07.11.2025 09:15 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

🌐🧬I am excited to present you a round table I am doing together with Matteo Marengo Gabriel Michaux as part of our emerging Nucleate Parisian chapter led by Clara Brouaux πŸ”₯.

Title: **Inside AI: Choosing the Right Path to Value Creation**
1/3

07.11.2025 09:15 πŸ‘ 1 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Open Conference of AI Agents for Science: 2025 The 1st Open Conference of AI Agents for Science (agents4science 2025). AI serves as both primary authors and reviewers of research papers.

Next week we will see the first conference where both the main authors and reviewers are LLM Agents!

This might be fun to follow: agents4science.stanf...
πŸ‘€Β πŸ€–

17.10.2025 12:59 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I am presenting my PhD work today at the conference on immuno oncology in Toulouse's CRCT Oncopole!

Happy to talk about how we can use foundation models in the real world πŸ§¬Β πŸ§‘β€βš•οΈ

16.10.2025 10:35 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
LinkedIn This link will take you to a page that’s not on LinkedIn

πŸ‘‰Learn more & apply:

06.10.2025 08:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸ”· Alnylam BioVenture Challenge β€” one day at Alnylam HQ, one shot at $100K in non-dilutive funding. Apply by Oct 17.

And β€” we’re also recruiting the next generation of Nucleate Leaders. If you’re ready to build biotech and strengthen the community behind it, apply today.

06.10.2025 08:46 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It’s about growth, collaboration, and the chance to give back by lifting others.
Two flagship opportunities are now open:

πŸ”· Activator 2026
β€” our equity-free accelerator equipping scientific founders with the tools to launch biotech ventures. Apply by Oct 20.

06.10.2025 08:46 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Proud to be a Nucleate Leader! πŸš€

Being a Nucleate Leader means joining a community of peers who step up to shape the future of biotech β€” leading teams, driving programs, and building ventures that make real impact.

06.10.2025 08:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

The first 1 million prime numbers vizualized in 2D according to their prime factors (Umap)

Source: johnhw.github.io/uma...

25.09.2025 08:05 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image Post image

what they sell you, what you get...

20.09.2025 09:54 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Main Made with Softr, the easiest way to turn your data into portals and internal tools.

If you are launching your biotech / techbio startup in Paris or anywhere else in the world actually, think about applying to Nucleate's Global Activator Program! πŸš€Β πŸ§¬

Many people to meet and things to learn from Researchers, Investors, CEOs and more!

12.09.2025 07:28 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Of course, but you first need the info. Right now it is like driving a car blindfolded...

24.07.2025 14:06 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

We then spend hundreds of billions treating what could have been avoided.

Why aren’t we doing this by default?
2/2

24.07.2025 11:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0