Claire McWhite's Avatar

Claire McWhite

@clairemcwhite

Systems Bio, comparing things to each other, protein language models, plants, Asst prof UArizona MCB

58
Followers
84
Following
4
Posts
14.04.2025
Joined
Posts Following

Latest posts by Claire McWhite @clairemcwhite

Become a colleague in my department!
Applications close in four days, December 15.

11.12.2025 19:15 πŸ‘ 4 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image Post image

Alignment of each motif concept peaks at the position of that motif in the protein. We also detect a few motifs absent from individual databases, though these are typically annotated in other databases. 4/4

08.12.2025 22:45 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Building on the idea of Concept Activation Vectors from arxiv.org/pdf/1711.11279 3/4

08.12.2025 22:45 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We take embeddings of protein fragments w/ and w/o a motif, train a simple linear classifier, and use the normal vector to the decision boundary as the β€œmotif direction.” So for motif detection, all you need is a dictionary of learned motif concept vectors, and a PLM to embed the protein with. 2/4

08.12.2025 22:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Alignment profiles of several motifs scanning across a protein. Each line represents a concept activation vector from a different layer.

Alignment profiles of several motifs scanning across a protein. Each line represents a concept activation vector from a different layer.

Vision models have directions in embedding space for concepts like β€œstripes” or "corgi"
We show that protein language has directions for motifs, use this as a new way to detect and localize motifs!
New preprint w/@ahmadshamail.bsky.social
Feedback very welcome arxiv.org/abs/2511.21614 1/4

08.12.2025 22:45 πŸ‘ 8 πŸ” 2 πŸ’¬ 1 πŸ“Œ 1