Chris Mungall (@cmungall)

Every Valentine’s Day I ponder the ontological, ontogenic, and phylogenetic basis of the metazoan primary pulsatile organ. Enjoy this old thread (originally posted on twitter many years ago, the import didn’t preserve the dates)

14.02.2026 16:16 👍 1 🔁 0 💬 0 📌 0

I agree but it’s a big jump outside of comfort zone for some!

13.02.2026 01:22 👍 0 🔁 0 💬 0 📌 0

AI4Curators - AI Guides Documentation for AI for curation

You can also find training material, how-to guides, links to tools, tips for making your knowledge base agentic curation hallucination-resistant etc here: ai4curation.io/aidocs/

12.02.2026 15:59 👍 0 🔁 0 💬 0 📌 0

OBO Foundry

For staying up to date, I recommend joining the Monarch/OBO Academy (free) and following along with the excellent training material on all things semantics and AI. Find a link to our Slack here: obofoundry.org

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

Gene Ontology Curators AI Workshop (Part 1) Goals: Equip curators with general purpose AI skills and literacy, usable in a variety of different contexts, and to unblock paths to continued exploration. By the end of training curators will b...

The first part was 4 hours, and a mix of foundational basics and hands on activities (thanks to Jonah Cool of @anthropic.com for complimentary Pro accounts!). Slides + recordings here:
doi.org/10.5281/zeno...

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

Gene Ontology Curators AI Workshop (Part 1) Goals: Equip curators with general purpose AI skills and literacy, usable in a variety of different contexts, and to unblock paths to continued exploration. By the end of training curators will b...

If you have more time and are looking for more of a foundational introduction to genAI (with lots of bio examples, and no maths) we are running a training series for members of the @geneontology.bsky.social consortium.

12.02.2026 15:59 👍 1 🔁 1 💬 1 📌 0

By the way, codex is also great, and we'd love to try and incorporate more material on opencode and other tools, but we have limited time and resources, and more of us were familiar with CC, so we went with that.

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

GitHub - ai4curation/icbo-ai-tutorial: Material for 2025 ICBO AI Tutorial. Material for 2025 ICBO AI Tutorial. . Contribute to ai4curation/icbo-ai-tutorial development by creating an account on GitHub.

For these kinds of workshops it can be a challenge getting everyone set up with both agent code installation AND coordinating subscription access. We found GitHub spaces works great for this, everyone gets a vscode + claude code + skills directly in their browser! github.com/ai4curation/...

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

ICBO 2025: Accelerating Ontology Curation with Agentic AI and GitHub YouTube video by Monarch Initiative

We also ran a workshop at ICBO last year "Accelerating Ontology Curation with Agentic AI and GitHub" aimed at ontology developers where we had hands-on session using CC for live ontology editing. www.youtube.com/watch?v=_9Re...

12.02.2026 15:59 👍 1 🔁 0 💬 1 📌 0

A key message here is that coding agents are not just for code! Yes, most run in the terminal or vscode, and they get a lot of their power from running command line tools. But you don't need to know anything about the command line! Coding agents can edit any kind of file (and any kind of verifier)

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

Monarch Ontology Training - OBO Semantic Engineering Training

Part 2 will be posted here shortly: oboacademy.github.io/obook/course...

12.02.2026 15:59 👍 0 🔁 0 💬 1 📌 0

Getting Started with Claude Code - OBO Semantic Engineering Training

As part of the @monarchinitiative.bsky.social /OBO Academy series, we had @christabone.bsky.social give us a two part introduction to "Efficient Biocuration and Bioinformatics with Claude Code". Part 1 (video and hands-on material) is here: oboacademy.github.io/obook/tutori...

12.02.2026 15:59 👍 2 🔁 1 💬 1 📌 0

What are those tools? I have been waiting for the agent harness that marries the power of a coding agent with a less intimidating UI. There are some great candidates: Goose (has CLI + UI), Claude Desktop, now Claude Co-work. But increasingly I'm recommending: go straight for a coding agent tool!

12.02.2026 15:59 👍 0 🔁 0 💬 2 📌 0

Staying in the Loop: A Biocurator's Guide to Agentic AI Developments Staying in the Loop: A Biocurator’s Guide to Agentic AI Developments Large language models have rapidly become part of everyday scientific workflows, yet most biocurators still interact with AI primar...

This week I participated in the excellent @biocurator.bsky.social virtual AI workshop. I presented some general tips for learning about agents. zenodo.org/records/1861... A lot of the advice comes down to: find time to learn+don't wait for the perfect curation tool, start using existing agent tools!

12.02.2026 15:59 👍 1 🔁 2 💬 1 📌 0

Over the last few months I've been helping organize various tutorials and workshops on agentic AI, aimed mostly at biocurators, ontology developers, and PIs of knowledge bases / data resources. Some of this might be generally useful to folks who don't identify as a 'technical' or an 'AI' person.🧵

12.02.2026 15:59 👍 9 🔁 4 💬 1 📌 0

To be fair, the main finding was the delta between LLMs alone and LLMs in hands of users: “We identify user interactions as a challenge to the deployment of LLMs for medical advice”. Current models blow away 4o, and likely more forgiving of inexperienced users, but I suspect the delta remains

12.02.2026 06:19 👍 2 🔁 0 💬 0 📌 0

Claude code in a loop plus some markdown files for skills and agents is all you need!

16.01.2026 06:19 👍 1 🔁 0 💬 1 📌 0

Ralph Wiggum from the Simpson's looking dopey, with text underneath him saying "I'm helping".

Last year we made a CLI wrapper for different deep research APIs. As a baseline implementation we do a simple Claude Code in a loop. It works rather well!
Well, I discovered there is a name for this pattern: Ralph. We made a Ralph Wiggum deep researcher. monarch-initiative.github.io/deep-researc...

12.01.2026 02:40 👍 4 🔁 1 💬 0 📌 0

Ontologically, ontogenetically, and phylogenetically, yes

20.12.2025 19:29 👍 3 🔁 0 💬 0 📌 0

Phage Foundry

📣 New preprint from us at phagefoundry.org 📣
A solid machine learning framework & to predict strain-level phage-host interactions across diverse bacterial genera from genome sequences alone. Avery Noonan from the Arkin Lab led this massive effort
www.biorxiv.org/content/10.1...

16.11.2025 17:58 👍 27 🔁 17 💬 1 📌 0

Chris Mungall (@Cmungall@genomic.social) Attached: 1 image How can we scale up manual classification of chemical structures in databases like ChEBI? Can we help curators place new structures into classes like "terpenoid", based on their che...

See the thread (from the original arXiv preprint) over on Mastodon: genomic.social/@Cmungall/11...

03.10.2025 14:58 👍 1 🔁 0 💬 0 📌 0

Chemical classification program synthesis using generative artificial intelligence - Journal of Cheminformatics Accurately classifying chemical structures is essential for cheminformatics and bioinformatics, including tasks such as identifying bioactive compounds of interest, screening molecules for toxicity to humans, finding non-organic compounds with desirable material properties, or organizing large chemical libraries for drug discovery or environmental monitoring. However, manual classification is labor-intensive and difficult to scale to large chemical databases. Existing automated approaches either rely on manually constructed classification rules, or are deep learning methods that lack explainability. This work presents an approach that uses generative artificial intelligence to automatically write chemical classifier programs for classes in the Chemical Entities of Biological Interest (ChEBI) database. These programs can be used for efficient deterministic run-time classification of SMILES structures, with natural language explanations. The programs themselves constitute an explainable computable ontological model of chemical class nomenclature, which we call the ChEBI Chemical Class Program Ontology (C3PO). We validated our approach against the ChEBI database, and compared our results against deep learning models and a naive SMARTS pattern based classifier. C3PO outperforms the naive classifier, but does not reach the performance of state of the art deep learning methods. However, C3PO has a number of strengths that complement deep learning methods, including explainability and reduced data dependence. C3PO can be used alongside deep learning classifiers to provide an explanation of the classification, where both methods agree. The programs can be used as part of the ontology development process, and iteratively refined by expert human curators.

We developed and evaluated a method to learn python chemical structure classifiers using LLMs. These can give classifications+explanations at runtime. With @jannahastings.bsky.social @justaddcoffee.bsky.social Noel O'Boyle, Daniel Korn, Adnan Malik jcheminf.biomedcentral.com/articles/10....

03.10.2025 14:57 👍 8 🔁 0 💬 1 📌 0

A busy tool wall in a shed. At the bottom there are instructions saying "Find the 10 hidden enhancers!" Across the wall between the tools are 10 enhancers, represented as DNA helices, but they are difficult to find in the style of a "hidden object" puzzle. Original photo by Lachlan Donald, https://www.flickr.com/photos/lox/9408028555

Hiding in plain sight - how close are we to mapping ALL 🧬enhancers🧬 in the genome?

Our new paper by Mannion et al. takes a systematic look at "hidden enhancers" and why they remain so hard to find. With @mosterwalder.bsky.social, @jlopezrios.bsky.social & many more

www.nature.com/articles/s41...

08.08.2025 18:09 👍 45 🔁 20 💬 2 📌 2

rbio1-training scientific reasoning LLMs with biological world models as soft verifiers Reasoning Models are typically trained against verification mechanisms in formally specified systems such as code or symbolic math. However, in open domains like biology, we do not generally have acce...

Check out the pre-print here www.biorxiv.org/content/10.1.... Not sure if the other authors beyond @tkaraletsos.bsky.social are on bsky #CellBiology #AI #GeneSky #genomics #VirtualCell

25.08.2025 00:41 👍 1 🔁 0 💬 0 📌 0

One super pedantic minor ontological pet peeve is the use of the term "simulation", since that leads me to expect a agent-based or physics-style simulation of cell perturbations. But in fact this pattern could be used for those too! And I guess the terminological horse has long bolted here..

25.08.2025 00:41 👍 3 🔁 0 💬 1 📌 0

two dalek robots are standing next to each other in a room with the words 76totterslane above them Alt: Dalek meme: Daleks saying "EX-PLAIN" (alluding to use of technique to make foundation models, the "daleks", more explainable)

But of course rBio is very cool independent of my nerdy obsession with FMs using ontologies/KGs! This general distillation pattern is likely to be very useful for integrating knowledge with the weights in massive omics FMs..

25.08.2025 00:41 👍 1 🔁 0 💬 1 📌 0

For another use of ontologies in genomic foundation models, see the recent AlphaGenome paper bsky.app/profile/cmun...

25.08.2025 00:41 👍 0 🔁 0 💬 1 📌 0

yoda from star wars is smoking a cigarette and says `` teach you i will '' Alt: Yoda meme: "Teach you I will". Alluding to using the ontology as a "teacher" in the RL loop

Aside: I find that too many "defenses" of ontologies/KGs in the face of genAI fall back on a kind of GraphRAG use case, where the ontology/KG is used as some kind of bullwark against hallucination. Valid... but they can do so much more! Using as teacher in RL-loop on reasoner traces is v cool!

25.08.2025 00:41 👍 1 🔁 0 💬 1 📌 0

Table 1. Verifiers used during RL training and their descriptions, as well as example prompts. Verifiers: "Exp" is experimental; "MLP" is multi-layer perceptron; "TF" is Transcriptformer; "GO" is Gene Ontology.

In order to fine tune the reasoner model, the authors used three kinds of soft verifiers in the RL loop - experimental (e.g. CRISPRi knockdown), "simulation" (e.g Transcriptformer), and knowledge-based. For knowledge-based, they used GO @geneontology.bsky.social!

25.08.2025 00:41 👍 1 🔁 0 💬 1 📌 0

Fig 7 from paper - The figure shows three example responses to the query “Is a knockdown of ISCA2 in RPE1 cells likely to result in differential expression of CEP295?”, each demonstrating different reasoning strategies. Basic answer: ISCA2 is linked to cell cycle progression and DNA repair, so its knockdown could affect CEP295 expression, though experimental data would be needed to confirm directionality. Chain-of-Thought: Provides background—ISCA2 is involved in cell cycle regulation; CEP295 in cilia formation. Knockdown of ISCA2 may influence cell cycle–related genes but there’s no direct evidence connecting it to CEP295 regulation. Self-aware Chain-of-Thought: Notes ISCA2’s role in autophagy and related processes, but emphasizes that its relationship to CEP295 is indirect. Suggests that literature review would be required for confirmation, and stresses the absence of direct experimental evidence, while acknowledging possible indirect effects. Overall, all answers converge on the idea that ISCA2 knockdown could plausibly influence CEP295 but highlight the uncertainty and need for direct experimental validation.

The applications of this are very interesting, allowing for interrogation in natural language, as well as background reasoning over the wealth of biology in the literature. So you can ask what happens to other genes if you knock down a gene in a cell type, and get a biological explanation

25.08.2025 00:41 👍 3 🔁 0 💬 1 📌 0

Chris Mungall

Latest posts by Chris Mungall @cmungall