John Lees's Avatar

John Lees

@johnlees.bacpop.org

Pathogen informatics and modelling http://bacpop.org (Research group leader at EMBL-EBI - http://ebi.ac.uk/research/lees/)

1,520
Followers
367
Following
140
Posts
17.10.2023
Joined
Posts Following

Latest posts by John Lees @johnlees.bacpop.org

Post image Post image

New paper showing that bacteria with more genes for cooperation can live in a broader range of habitats and that genes for cooperation are more more likely to be in the accessory genome www.pnas.org/doi/10.1073/... @lauriebelch.bsky.social

04.03.2026 10:23 πŸ‘ 54 πŸ” 24 πŸ’¬ 2 πŸ“Œ 0

except my tokens keep running out

25.02.2026 14:57 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

wow what a backdrop!

18.02.2026 17:31 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
AlphaFold Database welcomes community datasets Latest AlphaFold Database update adds high-value datasets for microbial and viral proteins, generated by specialist communities

Delighted to see over 17 million new protein structure predictions from novel proteins in AllTheBacteria are now integrated into the AlphaFold Database at @ebi.embl.org !
Huge work from @gbouras13.bsky.social @oschwengers.bsky.social and friends to generate these.

www.ebi.ac.uk/about/news/u...

17.02.2026 13:52 πŸ‘ 97 πŸ” 26 πŸ’¬ 1 πŸ“Œ 2

Congratulations! (and also happy to know these cryptic messages are so universal. Do I mean happy really? Well it made me feel better about them at leasy)

09.02.2026 18:02 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

And we think contributes to the 'why prokaryotes have pangenomes' debate on neutrality vs adaptation, hopefully bringing a lot more population genomes to bear on the issue!

Proud of this work and Sam for sticking with some very challenging modelling!

09.02.2026 10:55 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Core trees, core-accessory distance plots, and model fits for four bacterial species (A) Mycobacterium tuberculosis, (B) Streptococcus pneumoniae, (C) Escherichia coli and (D) Listeria monocytogenes.)

Core trees, core-accessory distance plots, and model fits for four bacterial species (A) Mycobacterium tuberculosis, (B) Streptococcus pneumoniae, (C) Escherichia coli and (D) Listeria monocytogenes.)

This started as an opportunistic reuse of poppunk/sketchlib core-accessory distances, which we noticed differed between species in their shape (e.g. accessory distance at same core; faster/slower accumulation of accessory changes with core)

09.02.2026 10:55 πŸ‘ 1 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

How do bacterial pangenomes evolve, what controls their dynamics, why do they exist?
Fitting a mechanistic model to 450 species from allthebacteria.org suggesting fast vs slow gene exchange (i.e. amount of MGEs) is a major differentiating factor, correlated with phylogeny rather than lifestyle

09.02.2026 10:55 πŸ‘ 71 πŸ” 32 πŸ’¬ 1 πŸ“Œ 0
Preview
Hugging Face – The AI community building the future. We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Super excited to announce the release of gene and intergenic region annotation from the largest bacterial genome and MAG datasets available, including AllTheBacteria, GTDB, SPIRE, HRGM, mOTUs and MGnify - dereplicated and available from HuggingFace huggingface.co/AllTheBacteria

05.02.2026 13:27 πŸ‘ 16 πŸ” 13 πŸ’¬ 2 πŸ“Œ 0
Person standing outside next to a building wearing a red jacket

Person standing outside next to a building wearing a red jacket

Meet Leonie Johanna Lorenz πŸ‡©πŸ‡ͺ, a Predoctoral Fellow at EMBL-EBI who is bringing mathematical modelling to microbes.

Find out more about how Leonie’s passion for modelling patterns extends from bacterial evolution to sewing 🧡

www.ebi.ac.uk/about/news/p...

02.02.2026 09:58 πŸ‘ 22 πŸ” 8 πŸ’¬ 1 πŸ“Œ 0
Preview
Research Group Leader Do you want to lead groundbreaking research in computational biology? Join us at EMBL-EBI! EMBL's European Bioinformatics Institute (EMBL-EBI) is seeking talented and highly-motivated scientists to jo...

We are hiring for group leaders again β€” EBI is a great place to start your research group!

embl.wd103.myworkdayjobs.com/EMBL/job/Hin...

30.01.2026 09:02 πŸ‘ 66 πŸ” 89 πŸ’¬ 0 πŸ“Œ 2

Fine, Le Guin from now on

19.01.2026 11:43 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Wow

19.01.2026 11:43 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

πŸŽ‰ New year, NEW PREPRINT!

Bacteria exhibit astonishing genetic diversity, but where do new genes come from?

My best friend Arya Kaul (/labmate in the @baym lab) investigates how advantageous deletions can spawn new genes - "deletion-born fusions." 🧡:

06.01.2026 16:09 πŸ‘ 49 πŸ” 30 πŸ’¬ 1 πŸ“Œ 2
STUdying Balancing Evolution (Nfds) To Investigate GEnome Replacement
(also a German word for cat)

STUdying Balancing Evolution (Nfds) To Investigate GEnome Replacement (also a German word for cat)

Thank you for asking, yes Stubentiger does indeed stand for STUdying Balancing Evolution (Nfds) To Investigate GEnome Replacement

06.01.2026 15:49 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
GitHub - bacpop/Stubentiger: An R package for a negative frequency-dependent selection (NFDS) model to describe vaccine type replacement in Streptococcus pneumoniae populations An R package for a negative frequency-dependent selection (NFDS) model to describe vaccine type replacement in Streptococcus pneumoniae populations - bacpop/Stubentiger

There's also an R package: github.com/bacpop/Stube...
So others can more easily apply/reuse/adapt this important model

06.01.2026 15:49 πŸ‘ 0 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Our new preprint which reimplements "the NFDS model" (of Corander et al) to forecast populations after vaccination as a compartmental model, and uses new bioinformatic tools to create and process the pangenome data

We look at which surveillance strategies are best to correctly forecast changes

06.01.2026 15:49 πŸ‘ 16 πŸ” 6 πŸ’¬ 1 πŸ“Œ 0

good god haha

06.01.2026 15:44 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

At least nothing else is getting worse

05.01.2026 18:34 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It’s behind cloudflare captcha now (although I’m now wondering if that’s just the genome campus IPs) β€” so paperpile at least can’t auto download the pdf

05.01.2026 18:06 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Looks very promising!!

One thing I didn't quite follow, when you mention 'a novel parameter for trait simulation that accounts for uncertainty in deep ancestral branches of a phylogeny' -- is that referring to the time to first event parameter?

05.01.2026 17:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

RIP automated downloads of PDFs from biorxiv it seems, I suppose we have AI to thank for that

05.01.2026 17:05 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Christmas dinner Mostly for personal reference for next year, here were my xmas dinner notes. I forgot about stuffing, and didn’t make yorkshire puddings properly because our main oven is broken (we have a convection ...

Some blogging from over the holiday, something for everyone I'm sure:
Cooking johnlees.me/posts/xmas-d...
Video games johnlees.me/posts/review...
DIY johnlees.me/posts/towel-...

(back to preprints tomorrow)

05.01.2026 13:49 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We're organising a microbes & deep learning session at SMBE next year -- looking forward to seeing your abstracts!

15.12.2025 11:26 πŸ‘ 14 πŸ” 8 πŸ’¬ 0 πŸ“Œ 0

Nice to see some of our tools getting used in public health
(in this case PopPIPE, which is downstream subclustering/transmission tool to be run on PopPUNK data)

25.11.2025 09:10 πŸ‘ 18 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

The data was collected by the CABBAGE project, which has just been preprinted too: www.biorxiv.org/content/10.1...

19.11.2025 12:27 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Happy to share our new AMR resource which has phenotypic AMR (usually MIC data) collected from publications and databases. This is paired with assemblies and annotations

We're excited for users who might train new models, find phenotype/genotype mismatches, or any other use

19.11.2025 12:27 πŸ‘ 62 πŸ” 36 πŸ’¬ 1 πŸ“Œ 0
Preview
Historical genomics of the declining red squirrel in Britain | Aries Dr Anders BergstrΓΆm, School of Biological Sciences, University of East Anglia Professor Cock van Oosterhout, School of Environmental Sciences, University of East Anglia Dr Selina Brace…

A PhD project on historical genomics in the declining red squirrel in Britain is available in my group, through the @aries-dtp.bsky.social. Use historical genomes to track the effects of decline and genetic rescue in this charismatic species. aries-dtp.ac.uk/studentships...

17.11.2025 13:12 πŸ‘ 62 πŸ” 41 πŸ’¬ 1 πŸ“Œ 2

Congratulations to Zam!

14.11.2025 14:26 πŸ‘ 6 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - samhorsfield96/ExpEvoAnalyzer: A workflow to analyse experimental evolution data. A workflow to analyse experimental evolution data. - samhorsfield96/ExpEvoAnalyzer

Just a quick plug: I've made a few updates to ExpEvoAnalyzer (variant functional annotation in experimental evolution studies) to use bwa as well as ska2, and to use existing or de novo annotations. It just might help streamline your pesky bioinformatics analysis! github.com/samhorsfield...

07.11.2025 15:07 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0