π° Paper: arxiv.org/abs/2502.15011
βΆοΈ Project Page: sayands.github.io/crossover/
π» Codebase: github.com/GradientSpacesβ¦
Work w/ Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social β¨
(3/3)
10.06.2025 19:55
π 2
π 1
π¬ 0
π 0
ποΈ Thursday 12 June 3:00 p.m. - 3:45 p.m. CDT
π OpenSun3D Workshop Poster Session Arch 211
(2/3)
10.06.2025 19:52
π 0
π 0
π¬ 1
π 0
β¨ Excited to head off to Nashville for #CVPR2025
π€ Catch me at the poster sessions or just come say hi to grab β
ποΈ Friday 13 June 4:00 p.m. - 6:00 p.m. CDT
π Poster Session #2 β Exhibit Hall D Highlight Poster #346
(1/3)
10.06.2025 19:52
π 3
π 0
π¬ 1
π 0
π₯³Excited to share our latest work, WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments, accepted to #CVPR2025 π
We present a robust monocular RGB SLAM system that uses uncertainty-aware tracking and mapping to handle dynamic scenes.
10.04.2025 14:58
π 4
π 1
π¬ 1
π 2
π CrossOver is accepted as a ππΆπ΄π΅πΉπΆπ΄π΅π at #CVPR2025! β¨
π» Fully open-sourced code with all pre-trained checkpoints: github.com/GradientSpac...
π‘ Stay tuned for a deep-dive thread and what else we are cooking π³
07.04.2025 22:20
π 5
π 1
π¬ 0
π 0
Looking forward to it!
02.03.2025 20:45
π 1
π 0
π¬ 0
π 0
But, the multimodal problem is same as in image generative tasks β as in, what is the perfect 3D scan given a text input?
28.02.2025 06:45
π 1
π 0
π¬ 1
π 0
In this case, what would be a definitive ground truth?
27.02.2025 07:38
π 0
π 0
π¬ 1
π 0
Thanks for sharing our work! Yes, I think thatβd be a pretty neat downstream application but maybe it is more multimodal generation rather than reconstruction.
27.02.2025 03:38
π 1
π 0
π¬ 1
π 0
π Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 πβ¨
We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities β no semantic annotations needed!π
26.02.2025 22:02
π 18
π 3
π¬ 2
π 3
ππPaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.
1/7
05.12.2024 18:16
π 69
π 21
π¬ 1
π 5
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
10.12.2024 09:28
π 1
π 0
π¬ 0
π 0
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
10.12.2024 09:27
π 2
π 0
π¬ 1
π 0
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
10.12.2024 09:27
π 1
π 0
π¬ 1
π 0
Could you add me? Iβm a PhD student working on 3D scene understanding.
10.12.2024 09:25
π 1
π 0
π¬ 1
π 0