Very nice post! I've long been thinking we should get models and datasets from package managers.
On the conda side, I've been wanting to do a CEP or server to automatically package arbitrary artifacts while avoiding duplicating the storage.
Very nice post! I've long been thinking we should get models and datasets from package managers.
On the conda side, I've been wanting to do a CEP or server to automatically package arbitrary artifacts while avoiding duplicating the storage.
At @quantstack.bsky.social we designed novel bit-unpacking SIMD optimizations for @arrow.apache.org and #ApacheParquet, and implemented them entirely using C++ metaprogramming instead of Python-based code generation.
We'll publish a deep dive blog post soon.
github.com/apache/arrow...
Everybody is talking about secure Python sandboxes for LLM code execution, but what about using the browser sandbox?
Quick demo using JupyterLite and the Pyodide kernel 💡
A viewer for Parquet, SQLite, and Avro files in JupyterLab.
Check out our new JupyterLab extension: Arbalister. 🏹
Built upon Apache Datafusion, @jupyter.org , and @arrow.apache.org , it lazily fetches rows so that you can view files larger than memory!
blog.jupyter.org/instantly-vi...
We are thrilled to introduce notebook.link, a platform that lets you create, share, and run Jupyter notebooks instantly in your browser.
Powered by JupyterLite and WebAssembly, it supports Python, R, C++, and a full in-browser terminal experience.
📖 Read the full story: medium.com/@QuantStack/...
SIMD coding is hard: platforms, inconsistencies, lane constraints... but xsimd abstracts a lot away. With my first contributions, I improved byte shuffling, now available in the latest 14.0 release.
#C++ #SIMD #xsimd #openSource
I'm getting some of it as well. You can mark *all* notifications as read with
`gh api -X PUT notifications -F "last_read_at=$(date -u +'%Y-%m-%dT%H:%M:%SZ')"`
Register for the Apache Arrow Summit Paris 25 (October 2nd) at: www.meetup.com/pydata-paris...
The event is hosted by @pydataparis.bsky.social
We are looking forward to seeing you there and talking about all things Arrow.
I'll be attending the event, looking forward to meet Python folks!
As for constraints, you can model all decision variables as binaries and express conjunctive normal form as linear constraints.
In both cases (SAT and ILP), I think you'd need to make dedicated heuristics to have smth reasonable for package manager, so it boils down to the most adaptable codebase.
What would be your optimization objective? Probably some heuristic of what it means to be "up to date" with open questions as "Is it better to have indirect dependency very outdated rather than a direct one slightly outdated", but you'd also need to defined "very" and "slightly".