Slides are here! docs.google.com/presentation...
I had a great time at @halfstackconf.bsky.social! Check out my slides to build lightning fast data viz on millions of rows with Mosaic and DuckDB-Wasm!
It was great to get to know more of the Phoenix dev community and I picked up some new ideas around AI as well.
#halfstack
Phoenix area front end devs, join me at the HalfStack Conference Jan 30th! Use my link for 10% off!
I'll be talking about 200 FPS data viz in your browser on millions of rows. I'm also excited to meet all of you! Don't miss trivia and karaoke afterwards...
ti.to/halfstack/ha...
Tron soundtrack is goated in my book. Remix is awesome too
Planetscale Postgres and MotherDuck are an awesome combo - super easy to add analytics to your existing platform! Get the benefits of HTAP without the compromises
Simple OLAP cache based on DuckDB, adding instant speed up for your DuckDB queries.
This is a deep dive into OLAP and Caches, a never-ending (love/hate) story π.
Do you like databases?
Do you want to hear two database professors rant about them?
Do you need one of those professors to have a Turing Award for databases?
If yes, then join Mike Stonebraker and I next Wed Dec 10 @ 1:00pm EST for database hot takes: www.dbos.dev/webcast-2025...
Check out the DuckLake roadmap!
What do you think is missing?
There are some very practical features that learn from other table formats, plus some further investment in unique features like data inlining for fast tiny inserts.
ducklake.select/roadmap
Today's Cloudflare outage has affected the DuckDB documentation site. While it seems to be mostly recovered now, it will take some time until everything stabilizes.
If you need to look things up in the DuckDB documentation now, feel free to use our PDF:
duckdb.github.io/duckdb-docs-...
3 options:
- Keep the amazing name Duck Lake
- Get rid of that pesky space and go full DuckLake
- Towny McTownFace
Bird names are the real mvp, no bias here
Today's Future Data Systems Seminar Speaker: Jordan Tigani (@jrdntgn.bsky.social) will present how @motherduck.com supports modern workloads with DuckLake. Zoom talk open to public at 4:30pm ET. YouTube video available after: db.cs.cmu.edu/events/futur...
Maybe a checksum repository would be enough?
Are you streaming into your Lakehouse?
Traditional formats suffered with the βmany small filesβ problem β OLAP engines merge them reactively with long jobs. β³
DuckLake takes a proactive path: Data Inlining + async flush to parquet while always keeping data queryable β‘
I think we might all perish....
when I say βstorage is cheaper nowβ this is what I mean
topicpartition.io/definitions/...
How's the async io boost looking?
Welcome to the flock!!!
How many 6 year old databases get 3x faster sorting??? When it was already world-class??? Amazing stuff from DuckDB!
It me
π We released version 0.3 of the DuckLake specification and the DuckDB ducklake extension today. It includes interoperability with Iceberg, support for geometry types and more.
Check the announcement blog for more details ducklake.select/2025/09/17/d...
Future Data Systems Seminar Schedule (Fall 2025)
Fall 2025 Seminar Schedule:
Sep 22: Apache Iceberg
Sep 29: Apache Hudi
Oct 06: @motherduck.com
Oct 13: SpiralDB Vortex
Oct 27: @singlestore.com
Nov 03: @deltalakeoss.bsky.social
Nov 10: Mooncake
Nov 17: @firebolthq.bsky.social
Nov 24: @xtdb.com
Dec 01: Apache Polaris
Such a fun listen on ducklake and duckdb with @hannes.muehleisen.org and @markraasveldt.bsky.social!
Learned a lot, the future of ducklake looks very bright!
overcast.fm/+AAH1YOLrL6Q
Is says "This is satire" in very fine print all the way at the bottom
Excited to be a keynote speaker at PyData Amsterdam 2025 (September 24β26). My talk is titled 'Minus Three Tier: Data Architecture Turned Upside Down'.
Use code PYDATADB10 for 10% off tickets
amsterdam.pydata.org/conference
#PDAmsterdam2025 #10YearsPDAmsterdam
I've been leading the data infrastructure efforts at my job (I used to work as a Data Engineer in big tech) and the stack we've landed on is so enjoyable to work with.
@dagster.io - Orchestration
@duckdb.org - Database
@motherduck.com - Data Warehouse/storage
DBT - Data modeling
Laurens Kuiper from @duckdb.org presented DuckDB's new memory assignment policy to run multi-join pipelines out-of-core with gracious performance degradation when join hash tables increasingly do not fit in RAM.
A well-attended and -delivered talk!
paper: vldb.org/pvldb/vol18/p2748-kuiper.pdf
Bad news: we had to postpone today's episode.
Good news: @tylerhillery.com has a great blog entry that features some of the Oxide and Friends back catalog you might have missed!
Thank you, I really appreciate it!!