Andi Zimmerer's Avatar

Andi Zimmerer

@andizimmerer

PhD at University of Technology Nuremberg, researching on Database Systems. Formerly engineer at Snowflake Inc. on query acceleration; spent some academic time at MIT πŸ‡ΊπŸ‡Έ, TUM πŸ‡©πŸ‡ͺ and NTU πŸ‡ΈπŸ‡¬. 🎯 Berlin https://www.andi-zimmerer.com

63
Followers
85
Following
28
Posts
29.11.2024
Joined
Posts Following

Latest posts by Andi Zimmerer @andizimmerer

Data Systems Lab | Launching Our Blog And Wrapping Up 2025 I'm super excited to launch our blog! We'll use this space to share what's happening in our lab, from research papers and systems to the day-to-day life of our team. To kick things off, let's look bac...

I hope you've had a great start to the year! I'm excited to announce our blog. We're kicking things off with a look back at everything that happened in 2025.

utndatasystems.github.io/blog/2025/re...

08.01.2026 18:04 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

Can your database system predict underprovisining before it even happens?

Meet β—’ xBound, the very first framework for join size lower bounds. xBound tells you how many tuples your SQL query will produce *at least*.

Brought to you by @microsoft.com Gray Systems Lab & @utndatasystems.bsky.social.

21.01.2026 09:40 πŸ‘ 3 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

This is hilarious. I wonder if the chocolate windmill still sits somewhere on a shelf or if it had been devoured in the process of trying to prove it

07.05.2025 16:21 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
"we shall be happy to award a chocolate windmill to the first person..."

"we shall be happy to award a chocolate windmill to the first person..."

I love older papers.

Lenstra and Kan, 1979 "Computational Complexity of Discrete Optimization Problems" Annals of Discrete Mathematics

#orms

06.05.2025 16:45 πŸ‘ 30 πŸ” 10 πŸ’¬ 1 πŸ“Œ 1
Preview
Andi Zimmerer | Pruning in Snowflake: Working Smarter, Not Harder Modern cloud-based data analytics systems must efficiently process petabytes of data residing on cloud storage. A key optimization technique in state-of-the-art systems like Snowflake is partition pru...

"The fastest way of processing data is to not process it."

Our SIGMOD 2025 paper shows how Snowflake skips 99.4% of data with new pruning techniques for LIMIT, top-k, and JOIN queries.

Blog: snowflakepruning.github.io
Paper: arxiv.org/abs/2504.11540

@sigmod2025.bsky.social

05.05.2025 05:09 πŸ‘ 6 πŸ” 1 πŸ’¬ 0 πŸ“Œ 1

Paper clickbait

30.04.2025 09:56 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
a woman is sitting at a table with a dart board in the background and says `` i 'm done '' . ALT: a woman is sitting at a table with a dart board in the background and says `` i 'm done '' .

Camera-ready version of the paper submitted => 115 tabs in Chrome closed.

08.04.2025 06:42 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - utndatasystems/redbench: Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. Redbench is a set of 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. - utndatasystems/redbench

We just released Redbench, a new benchmark that contains 30 analytical SQL workloads that can be used to benchmark workload-driven optimizations. Go check it out!

GitHub: github.com/utndatasyste...

25.03.2025 21:42 πŸ‘ 4 πŸ” 1 πŸ’¬ 0 πŸ“Œ 1

This.

20.03.2025 22:18 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image

The first day of the BTW Conference in Bamberg is coming to an end.

Some personal favorites:
- Ismail's talk on Pruning in Snowflake
- @stefan-grafberger.com's talk on what-if analysis in ML pipelines and automatically patching ML pipelines in the background
- Observe Inc's presentation

04.03.2025 19:26 πŸ‘ 4 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Please help spread the word by reposting!

We've just created the official DEEM Workshop account: @deem-workshop.bsky.social

07.02.2025 21:10 πŸ‘ 6 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

The review comment I'm most proud of: "The paper is [...] a pleasure to read". Thanks anonymous reviewer πŸ™πŸ˜Š

26.02.2025 17:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

My very first paper got accepted to @sigmod2025.bsky.social! Yay! Means I'll be playing a home game in Berlin

26.02.2025 17:13 πŸ‘ 0 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Strong agree. It's just that the "normal" model starts to cause pain on high velocity teams only

14.02.2025 13:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I agree that something has to improve, following the points you outlined. But I also see that c++ has a large path dependency, so change has to be done carefully. I still prefer Rust, but I like the overall discussion around the future of C++

10.02.2025 20:07 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Thanks for sharing, that was an interesting read!

10.02.2025 20:04 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Currently reading through your post.
Nit: typo in "falls in the β€œcontacts” and β€œprofiles” camps of Steve’s list above."

10.02.2025 19:59 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Valid points! I'm still curious what the next decade of C++ will look like

10.02.2025 19:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
21st Century C++ – Communications of the ACM

Reading cacm.acm.org/blogcacm/21s... makes me think that Rust was just a giant research project and valuable findings are now being streamed back into C++, making them usable to a broader audience.

10.02.2025 17:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

Agreed. Buddy C++ goes undefined behavior if it's not a strict weak ordering and I've seen it crashing a couple of times as well.

07.02.2025 11:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

47 *is* really good! It very much depends on age, gender and body weight - I am just fortunate enough to be in a lucky group

06.02.2025 15:54 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Garmin showing a VO2 Max of 58

Garmin showing a VO2 Max of 58

My professor jokingly threatened me that I would get fired if my VO2 Max is too low. After a run with him it's at 58 now. I guess I can continue my PhD πŸ˜‹

04.02.2025 19:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

Happy for everyone in that room who got the chance to listen to Ismail. He's smart, pragmatic, and inspiring

31.01.2025 15:52 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

And it took only a tiny amount of month-long convincing given that the other platform is completely broken.

29.01.2025 18:07 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

The Nuremberg Data Systems Lab is now on Bluesky πŸ™Œ @utndatasystems.bsky.social

29.01.2025 18:03 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

In academia, everyone always has a Colleague Working On Exactly This Problem. I still have to find one. Applications open.

15.01.2025 18:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
EDBT/ICDT 2026 Joint Conference - 24th March - 27th March, 2026 - Tampere, Finland

Exciting News! πŸŽ‰
#Tampere will host EDBT/ICDT 2026! ✨
Even before the 2025 edition, the important dates are already out:
πŸ“… Round 1 starts on:
February 5 for EDBT Papers
March 13 for ICDT Papers
edbticdt2026.github.io

We can’t wait to see your great submissions and welcome you to Tampere! πŸ™Œ

14.01.2025 17:30 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Post image

The @sigmod2025.bsky.social Programming Contest goes into another round. We (Bo Tang, Tilmann Rabl, and myself) just published the timeline and task overview:
sigmod-contest-2025.github.io/index.html

Thanks to Carlo Curino and @microsoft.com for the continued support.

04.01.2025 10:30 πŸ‘ 13 πŸ” 8 πŸ’¬ 1 πŸ“Œ 0

If I’m ever a professor again, I want to give a graduate seminar, topics to include:

- how not to say stupid shit about fields outside your expertise
- what is your expertise, anyway?
- how not to be an insufferable bore
- your PhD doesn’t make you a better person: coping with that

Other ideas?

12.12.2024 20:30 πŸ‘ 2980 πŸ” 182 πŸ’¬ 116 πŸ“Œ 2

I love how dedicated some students are. They are supposed to create a 5min video about a topic and one of them sends me their slides for review. 15(!) detailed(!) slides(!)

12.12.2024 17:25 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0