FLUX.2 klein inference in pure C, written in a weekend with 0 manual code
FLUX.2 klein inference in pure C, written in a weekend with 0 manual code
Excited to share our new research at Jasper Research! ๐
LBM: Latent Bridge Matching for Fast Image-to-Image Translation
Try out our @hf.co space for object relighting!
๐ค @gradio-hf.bsky.social demo: huggingface.co/spaces/jaspe...
๐ Paper: arxiv.org/abs/2503.07535
๐ป Repo: github.com/gojasper/LBM
Amazing!
Wow awesome work!! ๐คฉ
Alright whoโs making this
oh wow looks awesome, somehow missed it
๐
Hopefully they'll remove some of the FP16 nerfs they have on the 4090.. The 2-slots factor is also a nice improvement for multi-gpu builds
๐ฅฐ
How does one keep track? The monodepth/tracking field these days:
Align3R: estimates camera poses and consistent depth maps from monocular videos.
Combining it with trackers like Cotracker3 or SAM2 could unlock many fun applications! (cf: VideoDoodles by Yu & al)
Project page (with demo): igl-hkust.github.io/Align3R.gith...
Code: github.com/jiah-cloud/A...
Am I the only one amazed that this is what 2*4TB (with thermal case) looks like now?
๐ฎ - wall street seems to take the news well though
quite the illustration ๐
The UX of LEGO Interface Panels interactionmagic.com/UX-LEGO-Inte...
Banned from bsky or HF?
fair enough!
Oh no ๐ Iโm torn between rofl for this troll and a fear to see this little drama escalating
Why the preference for multiview? Maybe something like github.com/ttxskk/AiOS can be adapted/finetuned with multiple views from a synthetic dataset like microsoft.github.io/SynthMoCap/
oh nice, bookmarked!
The past few months have been... intense! There's still quite some work to do before the finish line, but excited to launch in the coming weeks ๐ชโก๏ธ
And early social media platforms!
Resemble Enhance seems pretty good: github.com/resemble-ai/...
Adobe Podcast V2 is a really impressive audio enhancer.
Is there any open-source tech close to it?
bsky.app/profile/pins...
SAMURAI: improve the tracking robustness of SAM2 with 2 main contributions:
- adding motion information to the mask selection
- curating the memory bank based on motion cues
Project: yangchris11.github.io/samurai
Code: github.com/yangchris11/...
Paper: arxiv.org/abs/2411.11922
Pyramid Flow is quite impressive for img2video, given than it was only trained on public datasets. Clearly not as dynamic and stable as commercial solutions, but the gap seems to be closing github.com/jy0205/Pyram...
A bit surprised with this data from Clerk on sign-in methods preferences: From a sample of 2.5M sign-in, <2% of users chose to use magic links.
Yessss ๐ฅ๐ฅ
"A nicely maintained and over-specโd server just has a smell to it" - great writeup by @kcimc.bsky.social benchmarking various cloud GPU providers for a realtime diffusion installation: kcimc.medium.com/realtime-dif...
new life goal: be added to that list ๐