I'm happy to share that our gReLU package is now published in Nature Methods!
www.nature.com/articles/s41...
I'm happy to share that our gReLU package is now published in Nature Methods!
www.nature.com/articles/s41...
scverse turns 3!
What started as a shared vision for interoperable single-cell analysis has become a vibrant, global community.
From AnnData to full multimodal pipelines, weβre building the future of everything single-cell and spatial omics, together.
Hereβs to whatβs next!
π£ Mark your calendars! The 2025 edition of the scverse conference will take place on 17-19 November at Stanford University (US) scverse.org/conference20...
Call for abstracts and registrations coming soon!
Our preprint on designing and editing cis-regulatory elements using Ledidi is out! Ledidi turns *any* ML model (or set of models) into a designer of edits to DNA sequences that induce desired characteristics.
Preprint: www.biorxiv.org/content/10.1...
GitHub: github.com/jmschrei/led...
genomebiology.biomedcentral.com/articles/10....
Quite an indictment of some of the current single cell "virtual cell" foundation models. Even for the relatively mundane applications, cell labeling, batch correction etc, they are poor compared to much simpler & cheaper methods.
First-ever CODE ML workshop at ICML!
July 18 or 19, 2025, Vancouver, Canada
Submit papers on OSS libraries, maintenance, best practices & more.
Format: 4-page non-archival papers
Due: May 19
codeml-workshop.github.io/codeml2025/#...
Most people havenβt heard of this test, which is available in the US. It accurately predicts Alzheimerβs (not just if thereβs a risk, but when). It is modulated by exercise and likely other lifestyle factors.
Hereβs (almost) everything we know about it
erictopol.substack.com/p/the-breakt...
Some encouraging news for cross-gene generalization of allele effects in S2F models. www.biorxiv.org/content/10.1...
New preprint out!
This is probably my most important paper. To my deep chagrin, it has no math.
XIST is a non-coding RNA exclusive to XX females. It silences one of the X chromosomes.
So what is it doing in male heart Schwann cells?
Photo of Anne Carpenter with STATus List 2025 wording
As an academic who works on tech to discover causes and cures of disease, contributing to novel drugs reaching patients has been thrilling.
Thanks to @statnews.com naming me to STATUS List 2025 honoring leaders in health, medicine, and science!
#STATUSList
www.statnews.com/status-list/...
This!!! I hope someone in Washington is listening
www.wsj.com/tech/biotech...
Data collected with the new sequencing platform HyDrop v2 is shown. First, a schematic overview of the bead batches of the microfluidic beads is followed by a tSNE and a barplot showing the costs in comparison to 10x Genomics. Then, a track of mouse data (cortex) is shown together with nucleotide contribution scores in the FIRE enhancer in microglia. Here, the HyDrop and 10x based models show the same contributions. On the right, the Drosophila embryo collection is explained; in the paper HyDrop v2 and 10x data are compared to sciATAC data. Then, a nucleotide contribution score is also shown, whereas HyDrop v2 and 10x models show the same contribution, just as in mouse.
Our new preprint is out! We optimized our open-source platform, HyDrop (v2), for scATAC sequencing and generated new atlases for the mouse cortex and Drosophila embryo with 607k cells. Now, we can train sequence-to-function models on data generated with HyDrop v2!
www.biorxiv.org/content/10.1...
The cover of Nature Biomedical Engineering features work from #UWAllenβs @suinlee.bsky.social on techniques for auditing #AI dermatology image classifiersβone of two projects from the lab highlighted in this issue, alongside a deep learning model for cancer insights. www.nature.com/natbiomedeng...
Human Body Single-Cell Atlas of 3D Genome Organization and DNA Methylation https://www.biorxiv.org/content/10.1101/2025.03.23.644697v1
Our new pre-print, investigating a few important questions when we train S2F models on different types of MPRA datasets. Congrats to Yilun and @xinmingtu.bsky.social www.biorxiv.org/content/10.1...
Wow. "NIH" canceled my co-mentored (with Dave Sulzer) PhD student's F31 funding. His work is on understanding the genetics and neuroscience of language learning disorders. F31 provides no indirect $ to Columbia, just pays his salary. Not that it should matter, but he's an American citizen. W.T.F.
Portrait of Su-In Lee looking off to the side, holding a pen in front of a whiteboard with part of a handwritten algorithm visible behind her
Congratulations to #UWAllen professor @suinlee.bsky.social on her election as a Fellow of the International Society for Computational Biology! @iscb.bsky.social honored Lee for her pioneering work on explainable #AI for biology and medicine. www.iscb.org/iscb-news-it... #PopulationHealth #ThisIsUW
Awesome summary of the field. An important point is to separate the design method from the oracle model being used. Sometimes, people say they're proposing a new design method but mean a cool new oracle model.
Modelling and design of transcriptional enhancers
www.nature.com/articles/s44...
Workshop on Advances in Post-Bayesian methods (May 15--16, UCL): postbayes.github.io/workshop2025/
Our new paper describing a scalable approach for training sequence-to-function models on personal genomes ("personal genome training"), includes our observations on when this works and its limitations. www.biorxiv.org/content/10.1...
Congrats: Anna, @xinmingtu.bsky.social , @lxsasse.bsky.social
My heart goes out to all of the people at the NIH and CDC who were fired recently. These people weren't fired for being bad at their job or a waste of resources -- they were fired because they were easy to fire by outsiders trying to meet a quota. They worked years/decades.. for this?
Given that science funding is under attack, it might be as good a time as any to reflect on how we spend our precious dollars. Cutting out expenditure publishing papers in overpriced journals might be a good thing to seriously consider once again.
MLCB is an excellent conference and a great opportunity to meet other people in the field. Highly recommend attending!
[SAVE THE DATE] MLCB 2025 is happening Sept 10-11 at the NY Genome Center in NYC!
Attend the premier conference at the intersection of ML & Bio, share your research and make lasting connections!
Submission deadline: June 1
More details: mlcb.github.io
Help spread the wordβplease RT! #MLCB2025
Screenshot of preprint saying "To benchmark the methods, we evaluated the performance of six methods: Linear, Linear-GPT, CellOracle, GEARS, scGPT, and scFoundation (Methods). We also included a basic approach that averages gene expression across all cells within all known perturbations as the prediction of unseen perturbations (referred to as KnownAverage). The benchmarking results across the 17 datasets were summarized in Fig. 2b. Notably, the KnownAverage method consistently demonstrated some of the best overall performance across all four types of metrics."
Mean of the training data still absolutely crushing it for perturbation prediction.
www.biorxiv.org/content/10.1...
Excited to see Moscot (moscot-tools.org) published in @Nature! We scaled Optimal Transport (OT) in single-cell genomics & added multimodality together with spatiotemporal trajectory inference, finding exciting new biology in the pancreas! π Read at www.nature.com/articles/s41...
Congrats to Johannes Linder, David Kelley et al. on the journal publication of Borzoi - a long context sequence models of RNA-seq coverage profiles with many nice applications for transcriptional & post-transcriptional regulation & variant effect prediction.
www.nature.com/articles/s41... 1/
Very excited to announce that the single cell/nuc. RNA/ATAC/multi-ome resource from ENCODE4 is now officially public. This includes raw data, processed data, annotations and pseudobulk products. Covers many human & mouse tissues. 1/
www.encodeproject.org/single-cell/...
The first preprint of 2025! Together with Matvei, @halfacrocodile.bsky.social, & our amazing team, we are excited to share PARADE: an AI framework for designing mRNA UTRs with enhanced cell-type specificity & stability. www.biorxiv.org/content/10.1...
Our ChromBPNet preprint out!
www.biorxiv.org/content/10.1...
Huge congrats to Anusri! This was quite a slog (for both of us) but we r very proud of this one! It is a long read but worth it IMHO. Methods r in the supp. materials. Bluetorial coming soon below 1/