See here in action how EU sovereign HPC AI + ray allow to easily scale document processing with docling georgheiler.com/2026/02/22/m...
See here in action how EU sovereign HPC AI + ray allow to easily scale document processing with docling georgheiler.com/2026/02/22/m...
Multimodal data handling is different! Especially with regards to complexity and cost. Daniel Gafni and I built Metaxy (docs.metaxy.io) to simplify Efficient Multimodal Pipelines
Read how Magenta (π @geoheil.com and @milicevica23.bsky.social ) uses Dagster+ to feel a bit of that joy: dagster.io/customers/ho...
And docs.metaxy.io/main/ also integrating with I.e. lance for again a different kind of versioning
Donβt forget lance lancedb.com/blog/branchi... for multimodal fit for data
An European sovereign GPU cloud does not come out of nowhere maybe this project can support making HPC systems more accessible. The recently started projects will take a long time to complete. I hope github.com/ascii-supply... will help.
#great talk www.youtube.com/watch?v=WRg1... on #architecture for #engineers
The OSACon recordings are available now www.youtube.com/watch?v=31LH...
interesting new #timeseries #database #xtdb xtdb.com see the great #cmu video for details www.youtube.com/watch?v=zzqD...
github.com/mxschmitt/ac... #tmux #action-tmate - really neat #debugging
A great video about LLMs and the data they can provide to the world - even though perhaps they should not | www.youtube.com/watch?v=O7BI... - DEF CON 33 - Exploiting Shadow Data from AI Models and Embeddings - Patrick Walsh
#rust #fory #serialization fory.apache.org/blog/fory_ru...
The real AI win isn't superhuman agents, it's scaled mediocrity.
Doing less with less at massive scale unlocks tasks that were once uneconomical.
The magic is in aggregate value, not perfect outputs. Empower teams with practical AI tools.Β
π https://dlthub.com/blog/the-real-ai-win-scaled-mediocrity
#dsc-dach #data it was a. pleasure to share an introductory workshop about spark and data pipelines. Thank you Aleks for the great collaboration!
Find the workshop files here if you want to follow along github.com/l-mds/dsc-da...
Something about super and computing in the making anyone daring out there who wants to explore? Or folks who want to exchange ideas about SLURM, HET jobs and advanced resource management? github.com/ascii-supply...
good point. I think I only have < 1 hour so BI/vis will have to wait a bit. But otherwise it would be a great addition
#duckdb #dagster #ray #ducklake
Simple Sovereign Scalable Data Stack georgheiler.com/event/tdwi-2... precursor: pypi.org/project/dags... github.com/dagster-io/c... if you want to see this in action join in NΓΌrnberg or Vienna for some sovereign, scalable data talks in the coming weeks
#duckdb #multimodal #rag www.youtube.com/watch?v=2qSZ... blobs.duckdb.org/events/duckd...
#compliance #anonymization #python www.youtube.com/watch?v=EqQd...
#gis #medium-data #sedona #rust #datafusion sedona.apache.org/latest/blog/...
π DuckDB 1.4.0 is out! This is our first LTS release which comes with *one year of community support*. It also supports database encryption, the MERGE SQL statement and Iceberg writes.
For more details, read the announcement blog post at
duckdb.org/2025/09/16/a...
A living Elo leaderboard for analytics/OLAP engines. Public benchmarks (TPC-DS/H, SSB, vendor & community posts) becomes a βmatch.β Upsets + context matter. Browse the board & poke holes: rebrand.ly/ey6y7hf
#owasp now gearing up for #llm and #genai - Multi-Agentic system Threat Modeling Guide v1.0 genai.owasp.org/resource/mul...
so we go back to faster than s3 alternatives?