Sayan Deb Sarkar's Avatar

Sayan Deb Sarkar

@sayandsarkar

PhD in 3D Vision @Stanford | MSc CS @ETH | Ex @Qualcomm, @MercedesBenz W: sayands.github.io

332
Followers
396
Following
15
Posts
03.12.2024
Joined
Posts Following

Latest posts by Sayan Deb Sarkar @sayandsarkar

πŸ“° Paper: arxiv.org/abs/2502.15011
▢️ Project Page: sayands.github.io/crossover/
πŸ’» Codebase: github.com/GradientSpaces…

Work w/ Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social ✨

(3/3)

10.06.2025 19:55 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

πŸ—“οΈ Thursday 12 June 3:00 p.m. - 3:45 p.m. CDT
πŸ“ OpenSun3D Workshop Poster Session Arch 211

(2/3)

10.06.2025 19:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

✨ Excited to head off to Nashville for #CVPR2025

🎀 Catch me at the poster sessions or just come say hi to grab β˜•

πŸ—“οΈ Friday 13 June 4:00 p.m. - 6:00 p.m. CDT
πŸ“ Poster Session #2 β€” Exhibit Hall D Highlight Poster #346

(1/3)

10.06.2025 19:52 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

πŸ₯³Excited to share our latest work, WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments, accepted to #CVPR2025 🌐

We present a robust monocular RGB SLAM system that uses uncertainty-aware tracking and mapping to handle dynamic scenes.

10.04.2025 14:58 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 2

πŸ† CrossOver is accepted as a π—›π—Άπ—΄π—΅π—Ήπ—Άπ—΄π—΅π˜ at #CVPR2025! ✨
πŸ’» Fully open-sourced code with all pre-trained checkpoints: github.com/GradientSpac...

πŸ“‘ Stay tuned for a deep-dive thread and what else we are cooking 🍳

07.04.2025 22:20 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Looking forward to it!

02.03.2025 20:45 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

But, the multimodal problem is same as in image generative tasks β€” as in, what is the perfect 3D scan given a text input?

28.02.2025 06:45 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

In this case, what would be a definitive ground truth?

27.02.2025 07:38 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Thanks for sharing our work! Yes, I think that’d be a pretty neat downstream application but maybe it is more multimodal generation rather than reconstruction.

27.02.2025 03:38 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
CrossOver: 3D Scene Cross-Modal Alignment Multi-modal 3D object understanding has gained significant attention, yet current approaches often assume complete data availability and rigid alignment across all modalities. We present CrossOver, a ...

πŸ”— arXiv: arxiv.org/abs/2502.15011
πŸ“‚ Project page: sayands.github.io/crossover/

Joint work with Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social πŸ€πŸ’‘

26.02.2025 22:02 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

πŸŽ‰ Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨

We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities β€” no semantic annotations needed!πŸš€

26.02.2025 22:02 πŸ‘ 18 πŸ” 3 πŸ’¬ 2 πŸ“Œ 3
Preview
CrossOver: 3D Scene Cross-Modal Alignment Multi-modal 3D object understanding has gained significant attention, yet current approaches often assume complete data availability and rigid alignment across all modalities. We present CrossOver, a ...

πŸ”— arXiv: arxiv.org/abs/2502.15011
πŸ“‚ Project page: sayands.github.io/crossover/

Joint work with Ondrej Miksik, @marcpollefeys.bsky.social @danielbarath.bsky.social and @ir0armeni.bsky.social πŸ€πŸ’‘

26.02.2025 21:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

πŸš€πŸš€PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7

05.12.2024 18:16 πŸ‘ 69 πŸ” 21 πŸ’¬ 1 πŸ“Œ 5

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

10.12.2024 09:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

10.12.2024 09:27 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

10.12.2024 09:27 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Could you add me? I’m a PhD student working on 3D scene understanding.

10.12.2024 09:25 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0