Home New Trending Search
About Privacy Terms
Posts
John Arevalo's posts

CytoSummaryNet is a Deep Sets-based approach that uses self-supervised contrastive learning in a multiple-instance learning framework. Try it out!

Paper: doi.org/10.1371/jour...

Code: github.com/carpenter-si...

With @johnarevalo.bsky.social @drannecarpenter.bsky.social Mehrtash Babadi

3/n

1 year ago 8 1 0 0
(a) Human U2OS cells treated with dimethyl sulfoxide (DMSO) and stained using the Cell Painting assay, which employs six dyes in five channels to label eight cellular compartments. The top row (from left to right) shows mitochondrial staining; actin, Golgi, and plasma membrane staining; and nucleolar and cytoplasmic RNA staining. The bottom row (from left to right) displays endoplasmic reticulum staining, DNA staining, and a montage of all five channels (from Cimini et al. [21]). (b) Thousands of features are extracted from each segmented cell in microscopy images of wells. A learned function f(x) (CytoSummaryNet) aggregates this data into a single feature vector: the sample’s profile. (c) An in-depth look at the model architecture used in this study. The model consists of three elements: a function φ(x), which maps the input data from ℝD to ℝL space, a summation, which collapses the cell dimension, and ρ(z), which maps the collapsed representation from ℝN to ℝL space. (d) During training, replicate compound profiles are forced to attract each other (green arrows) and simultaneously repel every other compound (red arrows) in the learned feature space. Here, all forces are drawn for a single profile of compound B.

(a) Human U2OS cells treated with dimethyl sulfoxide (DMSO) and stained using the Cell Painting assay, which employs six dyes in five channels to label eight cellular compartments. The top row (from left to right) shows mitochondrial staining; actin, Golgi, and plasma membrane staining; and nucleolar and cytoplasmic RNA staining. The bottom row (from left to right) displays endoplasmic reticulum staining, DNA staining, and a montage of all five channels (from Cimini et al. [21]). (b) Thousands of features are extracted from each segmented cell in microscopy images of wells. A learned function f(x) (CytoSummaryNet) aggregates this data into a single feature vector: the sample’s profile. (c) An in-depth look at the model architecture used in this study. The model consists of three elements: a function φ(x), which maps the input data from ℝD to ℝL space, a summation, which collapses the cell dimension, and ρ(z), which maps the collapsed representation from ℝN to ℝL space. (d) During training, replicate compound profiles are forced to attract each other (green arrows) and simultaneously repel every other compound (red arrows) in the learned feature space. Here, all forces are drawn for a single profile of compound B.

Taking pictures of cells with a microscope, then extracting thousands of features from them is uncannily effective for quantifying cell state, esp. for genes and chemicals (e.g., Cell Painting). But we often average the rich single-cell data to simplify analysis. Can we do better?
#bioML 🧪
1/n

1 year ago 73 18 1 0
Post image

Excited to present our spotlight paper at #NeurIPS!

MOTIVE is a new dataset + benchmark for predicting drug-target interactions, using Cell Painting data

Location: Fri 13 Dec 4:30 p.m. PST @ East Exhibit Hall A-C #4208
Poster: neurips.cc/virtual/2024...
Paper: arxiv.org/abs/2406.08649

1 year ago 20 6 0 3
John Arevalo
John Arevalo
@johnarevalo
74 Followers 16 Following 1 Posts
Posts Following