Paul Medvedev 's Avatar

Paul Medvedev

@pashadag

Algorithmic Bioinformatics Researcher and Teacher. Posts about research results and educational/mentorship topics (for details, see http://bit.ly/380vX22).

1,855
Followers
158
Following
84
Posts
07.09.2023
Joined
Posts Following

Latest posts by Paul Medvedev @pashadag

How would you design a *multithreaded*, *concurrent* & *dynamic* hash table if you are focused specifically on common k-mer workloads, where streaming query & insertion are common? Jamshed, Prashant and I explore this in kache-hash, a cache-friendly k-mer hash table!
www.biorxiv.org/content/10.6...

17.02.2026 18:49 πŸ‘ 20 πŸ” 13 πŸ’¬ 0 πŸ“Œ 0

I feel it quite possible that those relying on AI from the very start may form a different skill set that will accelerate some software, but result in key regressions (without the expertise to address them) in other types. combine-lab.github.io/blog/2026/02... see the caveats section here…

16.02.2026 16:07 πŸ‘ 12 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0

CVs are broadly distributed across the department. Evaluations are kept to people on the hiring committee (though there is some flexibility around that). I don't know if there are any formal restrictions on the statements, but in practice they are typically shared with people who express interest

12.02.2026 20:50 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

depends on the materials (letters are treated more confidentially than research statements than CVs)

11.02.2026 21:33 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

At long last, my final PhD chapter is out: we developed a novel evolutionary simulator of bacterial pangenomes, Pansim, fitting it to data from >600K genomes using a likelihood-free framework, PopPUNK-mod, to explore neutral and adaptive pangenome dynamics www.biorxiv.org/content/10.6...

07.02.2026 10:08 πŸ‘ 45 πŸ” 18 πŸ’¬ 2 πŸ“Œ 1

🚨UPCOMING DEADLINES🚨

RECOMB-CG: 13 February
RECOMB-RSG: 15 February
RECOMB-Privacy: 9 March
RECOMB-Seq: 12 March (abstract registration)
RECOMB-Arch: 12 March (abstract registration)
RECOMB-Genetics: 13 March

#RECOMB2026 #deadlines

05.02.2026 20:41 πŸ‘ 6 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

There is still many Amish villages around where Penn State is. Local farmer markets often carry their produce. I think they speak some form of Germanic, IIRC.

But its not rare in the US that states used to speak the language of the colonizers (cali->spanish, louisiana->french)

07.02.2026 13:53 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Time for a thread on our Christmas preprint β€œOrigin and evolution of acrocentric chromosomes in human and great apes”. I had so much fun with this project and paper. It will be hard to summarize in a thread, but I’ll try www.biorxiv.org/content/10.6... [1/21]

02.02.2026 14:58 πŸ‘ 41 πŸ” 29 πŸ’¬ 1 πŸ“Œ 1

PREPRINT ALERT

I heard you craving for more combinatorics, here are some more for y'all !

04.02.2026 17:22 πŸ‘ 5 πŸ” 4 πŸ’¬ 0 πŸ“Œ 1
Preview
ZOR filters: fast and smaller than fuse filters Probabilistic membership filters support fast approximate membership queries with a controlled false-positive probability $\varepsilon$ and are widely used across storage, analytics, networking, and b...

Preprint alert!
arxiv.org/abs/2602.03525
TLDR:
ZOR filters are STATIC filters with false positives.
-Almost memory optimal: <1% overhead over the theoretical lower bound (!!!)
-Fast queries: ~100 ns
-Construction cannot fail

A thread:

04.02.2026 12:28 πŸ‘ 31 πŸ” 12 πŸ’¬ 1 πŸ“Œ 1
Programs

If you are an Israeli PhD student and are interested in a postdoc at Harvard Medical (my lab included!), I strongly recommend looking into the Kalaniyot fellowship program, providing 2-3 years of full support:
globalprograms.hms.harvard.edu/kalaniyot-hm...

15.01.2026 19:23 πŸ‘ 5 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0
DSB 2026 Venice - February 18-19 Workshop Data Structures in Bioinformatics

The 12th edition of the 2-days workshop β€œData Structures in Bioinformatics” (DSB) will take place in Venice (Italy) on February 18-19th, 2026: dsb-meeting.github.io/DSB2026/

10.12.2025 14:29 πŸ‘ 10 πŸ” 9 πŸ’¬ 1 πŸ“Œ 1

This thread gives really interesting and relevant history!

03.12.2025 15:29 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Kraken 2 (K2) community: we are giving more attention to our new `k2` wrapper, and a NEW functionality since 2.17.0 is: you can build several component K2 indexes, e.g. each covering a different Refseq database, and then query them all at once...
github.com/DerrickWood/... 1/6

03.12.2025 14:08 πŸ‘ 4 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0

Preprint alert!

We introduce new ideas to revisit the notion of sampling with window guarantees, also known as minimizers.

A thread:

02.12.2025 11:11 πŸ‘ 15 πŸ” 7 πŸ’¬ 1 πŸ“Œ 2
International Postdoctoral Fellowship - The Azrieli Foundation The Azrieli Fellows Program is an elite group of academics who cultivate a network of leading professionals in Israel and around the world.

Interested in a post-doc in Israel? The deadline for the Azrieli International Postdoctoral Fellowship is November 19. The fellowship offers generous funding for postdocs to conduct research in any academic discipline at eligible Israeli institutions: azrielifoundation.org/fellows/inte...

10.11.2025 08:59 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image Post image

Haonan Wu gives a talk on "A k-mer-based estimator of the substitution rate between repetitive sequences"
www.biorxiv.org/content/10.1...
This work tackles the issue of Mash which ignores repeats in the genome, providing better distance estimation #GI2025

06.11.2025 16:38 πŸ‘ 9 πŸ” 4 πŸ’¬ 1 πŸ“Œ 0
Preview
Efficient and accurate search in petabase-scale sequence repositories - Nature MetaGraph enables scalable indexing of large sets of DNA, RNA or protein sequences using annotated de Bruijn graphs.

After years of research and continuous refinement, we’re thrilled to share that our paper on the MetaGraph framework β€” enabling Petabase-scale search across sequencing data β€” has been published today in Nature (www.nature.com/articles/s41...)

08.10.2025 20:56 πŸ‘ 30 πŸ” 17 πŸ’¬ 3 πŸ“Œ 2

And it's posted! If you're interested and eligible, please consider applying through the UMD portal: umd.wd1.myworkdayjobs.com/en-US/UMCP/j....

If you're a PI working in algorithmic genomics (& you can recommend my lab to your top graduating students ;P), please let them know!

08.10.2025 16:53 πŸ‘ 22 πŸ” 21 πŸ’¬ 0 πŸ“Œ 3
Preview
Burrows-Wheeler Indexing - YouTube Videos on : (a) the Burrows-Wheeler Transform (BWT), (b) the FM Index, which uses the BWT to construct a full-text index, (c) Wheeler graphs, (d) r-index, an...

I've added 7 videos to my Burrows-Wheeler indexing playlist (www.youtube.com/playlist?lis...), rounding out the r-index series and adding a 5-part series on the move structure. Now 27 videos in that playlist. I aim to add videos on prefix-free parsing, PBWT, Wheeler languages/automata in the future.

07.10.2025 14:17 πŸ‘ 62 πŸ” 15 πŸ’¬ 2 πŸ“Œ 1

Sounds like someone is trying to solve a bidirected flow problem..

07.10.2025 03:15 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

i've let the person in charge know

03.10.2025 19:11 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

There seems to be a self-contradiction within the CFP, since it also says: "Submissions to peer-reviewed journals other than the partnering ones are also allowed.."

03.10.2025 17:53 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Preview
Alice: fast and haplotype-aware assembly of high-fidelity reads based on MSR sketching We introduce Mapping-friendly Sequence Reduction (MSR) sketches, a sketching method for high-fidelity (HiFi) long reads, and Alice, an assembler that operates directly on these sketches. MSR produces ...

Our preprint on our new metagenomic HiFi assembler Alice is out πŸ₯³ Based on a *new sketching method* (🧡1/6)
πŸ‘‰ Preprint www.biorxiv.org/content/10.1...
πŸ‘‰ Github github.com/rolandfaure/...

03.10.2025 14:51 πŸ‘ 25 πŸ” 21 πŸ’¬ 2 πŸ“Œ 0

Alice: fast and haplotype-aware assembly of high-fidelity reads based on MSR sketching https://www.biorxiv.org/content/10.1101/2025.09.29.679204v1

01.10.2025 01:47 πŸ‘ 7 πŸ” 6 πŸ’¬ 0 πŸ“Œ 0

It could be. Or it could be that the decision process is not consistent? Hard to tell...

28.09.2025 22:13 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I see. Do you know if the list of papers that are posted there get disseminated somehow through mail lists or social media?

26.09.2025 17:48 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

They do, but they did not accept our paper. From what we understood, it was because it was a review paper and not novel research

26.09.2025 17:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

#RECOMB2026 will be in Thessaloniki, Greece on May 26-29, 2026. Satellites on May 24-25. Save the date!

΀ο συνέδριο #RECOMB2026 ΞΈΞ± πραγματοποιηθΡί στη Ξ˜Ξ΅ΟƒΟƒΞ±Ξ»ΞΏΞ½Ξ―ΞΊΞ·, στις 26-29 ΞœΞ±ΞΞΏΟ… 2026. Οι δορυφορικές Ξ΅ΞΊΞ΄Ξ·Ξ»ΟŽΟƒΞ΅ΞΉΟ‚ ΞΈΞ± διΡξαχθούν στις 24-25 ΞœΞ±ΞΞΏΟ… 2026. Ξ£Ξ·ΞΌΞ΅ΞΉΟŽΟƒΟ„Ξ΅ την ημΡρομηνία!

26.09.2025 15:03 πŸ‘ 23 πŸ” 13 πŸ’¬ 0 πŸ“Œ 1

Hi Gaurav, I'm not sure what you mean. (But it sounds like you are asking for a library with all these implemented in one place? That would be quite an undertaking! As these things are always evolving, I'd guess it would also not age well.

25.09.2025 21:25 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0