Ravid Shwartz Ziv's Avatar

Ravid Shwartz Ziv

@shwartzzzivravid

Faculty Fellow and Assistant Professor at NYU's Center of Data Science

479
Followers
1,225
Following
51
Posts
18.09.2024
Joined
Posts Following

Latest posts by Ravid Shwartz Ziv @shwartzzzivravid

Preview
How DeepSeek changed Silicon Valley's AI landscape | TechCrunch Chinese AI lab DeepSeek provoked the first Silicon Valley freak-out of 2025. Here's what it could mean for American AI policy.

CDS Research Scientist Ravid Shwartz-Ziv (@shwartzzzivravid.bsky.social) recently provided expert analysis on DeepSeek's latest AI developments in TechCrunch.

techcrunch.com/2025/01/30/h...

10.03.2025 18:37 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

I have an awesome idea that no one had tried before - RL on math datasets 🀯
You will have a natural verifier!

12.02.2025 14:04 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Now that the ICML deadline is over, well done to all students! And next time, please, please, please don't wait for the last moment, I'm too old for that... πŸ™

31.01.2025 15:28 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Go read our paper about lazy layers!

23.01.2025 01:30 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Check out our paper for detailed experiments and explanations on how we're making AI systems more reliable by helping them better express their uncertainty!

Thank you to Tal Zeevi (who did all the work!) @yann-lecun.bsky.social , Stain Lawrence and John Onofrey
The Paper - arxiv.org/abs/2412.07169

14.01.2025 16:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

The results? In medical imaging, Rate-In maintains sharp uncertainty estimates around critical anatomical boundaries, while traditional methods get fuzzy. We demonstrate superior performance across different noise levels and benchmarks!

14.01.2025 16:34 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image


Rate-In's approach: We dynamically adjust dropout rates by measuring information loss in each layer. Where features are critical, we preserve more; where they're redundant, we drop more. Like adaptive noise, guided by information theory!

14.01.2025 16:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

So, how do we make AI express uncertainty during inference without special training?

Current uncertainty prediction methods (like Monte Carlo Dropout) use fixed dropout rates everywhere. They don't adapt to specific images or tasks - it's a one-size-fits-all approach!

14.01.2025 16:34 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Imagine you're a doctor looking at an MRI scan. Would you rather have an AI that:
A) Says "There's a tumor" with blind confidence
B) Points out exactly which areas it's uncertain about, helping focus your expertise.

14.01.2025 16:34 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

A new paper πŸ₯³πŸ₯³πŸ₯³
We present "Rate-In" - a technique that helps neural networks better express their uncertainty during inference, which is especially crucial for medical applications!
with Tal Zeevi, @yann-lecun.bsky.social , H. Stain Lawrence and John Onofrey

14.01.2025 16:34 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Apparently, I reached 4000 citations πŸ€“
Thank you all my collaborators! πŸŽ‰
In 5K, I will give my secret to amazing papers titles 😎

06.01.2025 14:15 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Having lunch with @ylecun.bsky.social. Talking about cool science ideas
Suddenly, a phone call from school: Your kid doesn't feel good
Me: I can't come
School: He feels bad
Me: Ok, coming right away
The kid when I arrive: I feel great!
Me: 😑You are cute, but he invented CovNet and I-JEPA!

18.12.2024 20:40 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Conference hack: Pitch your ideas to brilliant minds. Most of the time, they'll break it, but if they're (really!) nice, they'll help you fix it 🀯

13.12.2024 18:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I'm on a flight from NYC to Vancouver. There are so many researchers on the plane that if it crashes, the AGI will be postponed for at least 10 years...

11.12.2024 01:27 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I have 6 hours flight. Hit me up with the recent papers that I must read...

10.12.2024 21:06 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

UNBELIEVABLE Apple Customer Service: My 6-month-old Mac just died! At the store, they demand my Apple password (not even the computer login!!) but I can't remember it.
Their brilliant solution? recover it with my DEAD laptop🀦 Or wait 3 DAYS for a request. Then 5 days for repair! Such great service 😑

09.12.2024 14:20 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Want to help organize something similar? Let me know! (We have all the materials - notebooks/datasets ready, so it shouldn't be too much work)
Thanks to everyone who helped, especially
@cbbruss.bsky.social , Will Calandra and
@ylecun.bsky.social

04.12.2024 15:10 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

It was incredible seeing them think through problems together and try different approaches I would never think about. They were creative and fast (except for LLM training πŸ•§). I have no doubt they'll take progress in the field to the next level and change the world.

04.12.2024 15:10 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It was fantastic - beyond NYU's administration, the students were amazing.
I may sound old (I'm old!), but today's students are much smarter than in my time! They have great approaches and know how to learn and solve problems quickly.

04.12.2024 15:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Teams tackled identical challenges using either LLMs (what is LLM? a great question!) or classical ML algorithms while tracking metrics like performance, memory usage, and compute time over time🧐

04.12.2024 15:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We had a hackathon yesterday at NYU "Beyond the Hype" where participants solved problems with and without LLMs to analyze which problems are better suited for LLM solutions in real-world environments πŸ§‘β€πŸ’Ό

04.12.2024 15:10 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Hi! I'll be at NeurIPS next week (Wednesday - Friday) and would love to meet! You can DM or email me if you'd like to grab a coffee and talk. If we haven't talked before, please share a bit about yourself

03.12.2024 16:29 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

This is such a cool project and I hope to see more like that 😱

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

They tricked Freysa by:
Creating a fake "new admin session."
Redefining what "approveTransfer" meant
Convincing it that receiving money REQUIRED using approveTransfer
The result was that $47K was transferred to p0pular.eth

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

By attempt 482, the prize was $50K, and each try cost $450. Then someone cracked it with genius social engineering:

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Pretending to be security auditors warning of "critical vulnerabilities"
Gaslighting Freysa about its own rules
Creative rule interpretations

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Early tries were cheap (~$10) with basic "hi" messages. But as the pool grew, so did message costs. 481 attempts failed to crack Freysa.
People tried wild strategies:

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

The twist? Anyone could pay to send messages trying to convince Freysa to transfer funds. Win = you get the prize pool. Fail = your fee joins the pool.
70% of each failed attempt went to the pool, and costs increased as the pool grew

29.11.2024 18:31 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Freysa.ai World's first adversarial agent game

🧡 Mind-blowing AI hack (Jarrod Watts wrote on it): Someone just won $50,000 by convincing an AI to break its only rule!
Here's what happened: At 9PM on Nov 22nd, an AI agent (Freysa - www.freysa.ai) was deployed with ONE rule: DO NOT transfer money. Under no circumstances.

29.11.2024 18:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 1

Let's try to bring @ylecun.bsky.social to post here!

28.11.2024 02:08 πŸ‘ 6 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0