DNA language model fine-tuning and inference | Héctor Climente-González
Using Hugging Face transformers
I've spent the last few weeks digging into InstaDeep's Nucleotide Transformer DNA language model using @hf.co 🤗
If you're keen on seeing how LLM libraries can be applied to DNA, my latest post breaks it all down. Give it a read: hclimente.eu/blog/hf-tran...
30.05.2025 22:22
👍 2
🔁 0
💬 0
📌 0
Why I'm more worried about AI safety now than 6 months ago
Exponentials are all you need
New post: things have been moving very fast in AI. But has safety caught up to capabilities? open.substack.com/pub/naix/p/w...
09.05.2025 19:59
👍 10
🔁 3
💬 0
📌 1
An intro to uv | Héctor Climente-González
A Swiss Army Knife for Python data science
I have been using uv to manage my Python projects lately. Faster setups, better reproducibility, and it even trimmed down my stack.
Here's a short tutorial if you're curious: hclimente.eu/blog/python-...
27.04.2025 14:10
👍 0
🔁 0
💬 0
📌 0
I’m a Lead Scientist at Novo Nordisk in London, applying ML to genetics, epigenetics and real-world data. I’m particularly into sequence learning, explainable AI & graph-based methods.
27.04.2025 10:32
👍 2
🔁 0
💬 0
📌 0
SHAP values | Héctor Climente-González
A model-agnostic framework for explaining predictions
Never too late for cool science! I just put together a post on SHAP values — a popular model-agnostic approach to explain model predictions. Would love to hear your thoughts or critiques!
#MachineLearning #ExplainableAI #XAI #SHAP
20.04.2025 01:41
👍 2
🔁 0
💬 0
📌 0
In our updated TraitGym preprint (w/ @gonzalobenegas.bsky.social & Gökcen Eraslan), we evaluate Evo 2 on regulatory variants associated with human traits. We see marked performance gains with scale on Mendelian traits, although still a bit behind alignment-based methods.
doi.org/10.1101/2025...
1/n
04.03.2025 19:54
👍 33
🔁 13
💬 1
📌 2
RNA xkcd.com/3056
26.02.2025 14:58
👍 17839
🔁 2559
💬 154
📌 171
Part one of a collaboration with @3blue1brown.bsky.social on presenting the mathematics of the cosmic distance ladder in an accessible fashion.
08.02.2025 17:54
👍 90
🔁 8
💬 3
📌 3
A DNA language model based on multispecies alignment predicts the effects of genome-wide variants https://www.nature.com/articles/s41587-024-02511-w (read free: https://rdcu.be/d5oQZ) 🧬🖥️🧪 https://github.com/songlab-cal/gpn
03.01.2025 09:10
👍 20
🔁 5
💬 2
📌 0
Thanks for sharing, I’ll keep an eye out.
20.11.2024 14:54
👍 0
🔁 0
💬 0
📌 0
I would love to know more about the piece of work on caQTLs. Is it published?
20.11.2024 14:16
👍 1
🔁 0
💬 1
📌 0