Tejas Srinivasan's Avatar

Tejas Srinivasan

@tejassrinivasan

CS PhD student at USC. Former research intern at AI2 Mosaic. Interested in human-AI interaction and language grounding.

294
Followers
157
Following
22
Posts
08.11.2023
Joined
Posts Following

Latest posts by Tejas Srinivasan @tejassrinivasan

🚨Reminder: Submissioms for the ORIGen workshop at COLM are due today!!! 🚨

CfP: origen-workshop.github.io/submissions/

OpenReview submission page: openreview.net/group?id=col...

27.06.2025 19:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

I'm trying to make "bleet" a thing

30.05.2025 16:36 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Jesse Thomason and Jesse Zhang in their respective PhD robes.

Jesse Thomason and Jesse Zhang in their respective PhD robes.

This month, @jessezhang.bsky.social completed his PhD defense and signed to start a postdoc with @abhishekunique7.bsky.social at UW! Keep an eye on his journey :) www.jessezhang.net
I'm sad to lose one of my sinistral students but glad to produce another Dr. Jesse πŸ˜›

28.05.2025 17:04 πŸ‘ 10 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

The only silver lining of my ACL rejection is that I have something to submit to EMNLP

16.05.2025 19:59 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
ORIGen 2025 Workshop on Optimal Reliance and Accountability in Interactions with Generative LMs

LLMs are all around us, but how can we foster reliable and accountable interactions with them??

To discuss these problems, we will host the first ORIGen workshop at @colmweb.org! Submissions welcome from NLP, HCI, CogSci, and anything human-centered, due June 20 :)

origen-workshop.github.io

16.05.2025 15:35 πŸ‘ 10 πŸ” 4 πŸ’¬ 0 πŸ“Œ 2

This! So much this!!!

01.05.2025 16:29 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
I Tested The AI That Calls Your Elderly Parents If You Can't Be Bothered inTouch says on its website "Busy life? You can’t call your parent every dayβ€”but we can." My own mum said she would feel terrible if her child used it.

Nothing says β€œI love you” like outsourcing your parents’ phone calls to a chatbot. πŸ™ƒ Social isolation in aging is real. Connection isn’t something you can automate.

Why does everyone think we can just throw a chatbot at every problem? </rhetorical>

www.404media.co/i-tested-the...

29.04.2025 19:42 πŸ‘ 42 πŸ” 7 πŸ’¬ 1 πŸ“Œ 2

Ty for the plug πŸ™
Model confidence is a good decision aid (arxiv.org/pdf/2001.02114), while explanations are less useful and can cause over-reliance (arxiv.org/abs/2310.12558, arxiv.org/pdf/2406.19170). Other interaction cues like AI warmth can also make a difference (arxiv.org/abs/2407.07950).

13.03.2025 01:03 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Arresting and threatening to deport students because of their participation in political protest is the kind of action one ordinarily associates with the world’s most repressive regimes. It’s genuinely shocking that this appears to be what’s going on right here. 1/

09.03.2025 23:55 πŸ‘ 3082 πŸ” 746 πŸ’¬ 36 πŸ“Œ 20

What do you mean by core capabilities, for VLMS? IMO core capabilities should be determined by the applications we care about, and I'd argue medical use cases are as important (if not more) as MSCOCO-style images/scenes

10.03.2025 16:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I worry that concerns with "superintelligence" are being blurred with concerns around *ceding human control*.
A "SuperDumb" system can create mutually assured destruction. What it takes is allowing AI systems to execute code autonomously in military operations.

06.03.2025 19:46 πŸ‘ 35 πŸ” 12 πŸ’¬ 3 πŸ“Œ 1

"The first guest on Gavin Newsom's podcast was Charlie Kirk" is more than enough for me to say "absolutely not" to any suggestion Newsom play any role in the future of the Democratic Party. People like him are the past, the failures, the ones who got us here.

06.03.2025 02:25 πŸ‘ 10824 πŸ” 2145 πŸ’¬ 514 πŸ“Œ 220

What are you using o1pro for? And in what aspects do you think it's better than other LLMs?

28.02.2025 19:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Is this advice you reserve for a particular class of problems, or is it just generally applicable because we still don't know the full breadth of LLM capabilities?

28.02.2025 19:39 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I'm always three days away from being three days away

28.02.2025 17:24 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

We hope our work inspires the community to more closely consider how user characteristics, including but not limited to trust, affect how people rely on AI assistance.

Work done with the always-awesome @thomason.bsky.social!

27.02.2025 18:02 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Improving AI reliability is more important than ever as AI systems are increasingly deployed in real-world settings with high stakes. We believe it is important for AI researchers to think about the user-AI dyad πŸ§‘πŸ€–, rather than just the AI in a vacuum.

27.02.2025 18:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

These findings show that being able to estimate users’ trust levels can enhance human-AI collaboration πŸ’ͺ but we also find that modeling user trust is very challenging! πŸ˜“ Our work reveals promising new directions for user modeling that extend beyond merely learning user preferences.

27.02.2025 18:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

We show that adapting AI behavior to user trust levels, by showing AI explanations during moments of low trust and counter-explanations during high trust, effectively mitigates inappropriate reliance and improves decision accuracy! These improvements are also seen with other intervention strategies.

27.02.2025 18:00 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

In two decision-making tasks, we find that low and high user trust levels worsen under-reliance and over-reliance on AI recommendations, respectively πŸ’€πŸ’€πŸ’€

Can the AI assistant do something differently when user trust is low/high to prevent such inappropriate reliance? Yes!

27.02.2025 17:59 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

People are increasingly relying on AI assistance, but *how* they use AI advice is influenced by their trust in the AI, which the AI is typically blind to. What if they weren’t?

We show that adapting AI assistants' behavior to user trust mitigates under- and over-reliance!

arxiv.org/abs/2502.13321

27.02.2025 17:56 πŸ‘ 8 πŸ” 2 πŸ’¬ 1 πŸ“Œ 2
What’s a technology that you think is overhyped?

I’m going to give a sideways answer to this, which is that the venture capital business model needs to be understood as requiring hype. You can go back to the Netscape IPO, and that was the proof point that made venture capital the financial lifeblood of the tech industry.

Venture capital looks at valuations and growth, not necessarily at profit or revenue. So you don’t actually have to invest in technology that works, or that even makes a profit, you simply have to have a narrative that is compelling enough to float those valuations. So you see this repetitive and exhausting hype cycle as a feature in this industry. A couple of years ago, you would have been asking me about the metaverse, then last year, you would have asked me about Web3 and crypto, and for each of these inflection points there’s an Andreessen Horowitz manifesto.

It’s not simply that one piece of technology is overhyped, it’s that hype is a necessary ingredient of the current business ecosystem of the tech industry. We should examine how often the financial incentive for hype is rewarded without any real social returns, without any meaningful progress in technology, without these tools and services and worlds ever actually manifesting. That’s key to understanding the growing chasm between the narrative of techno-optimists and the reality of our tech-encumbered world.

What’s a technology that you think is overhyped? I’m going to give a sideways answer to this, which is that the venture capital business model needs to be understood as requiring hype. You can go back to the Netscape IPO, and that was the proof point that made venture capital the financial lifeblood of the tech industry. Venture capital looks at valuations and growth, not necessarily at profit or revenue. So you don’t actually have to invest in technology that works, or that even makes a profit, you simply have to have a narrative that is compelling enough to float those valuations. So you see this repetitive and exhausting hype cycle as a feature in this industry. A couple of years ago, you would have been asking me about the metaverse, then last year, you would have asked me about Web3 and crypto, and for each of these inflection points there’s an Andreessen Horowitz manifesto. It’s not simply that one piece of technology is overhyped, it’s that hype is a necessary ingredient of the current business ecosystem of the tech industry. We should examine how often the financial incentive for hype is rewarded without any real social returns, without any meaningful progress in technology, without these tools and services and worlds ever actually manifesting. That’s key to understanding the growing chasm between the narrative of techno-optimists and the reality of our tech-encumbered world.

Stand by this: www.politico.com/newsletters/...

19.02.2025 16:42 πŸ‘ 9716 πŸ” 3163 πŸ’¬ 156 πŸ“Œ 350

Do each of these correspond to a particular conf deadline? I'm guessing
May: EMNLP
July: AACL?
Oct: EACL/NAACL
Feb: ACL

19.02.2025 18:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

‼️ Ever wish LLMs would just... slow down for a second?

In our latest work, "Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems", we delve into how strategic delays can enhance dialogue systems.

Paper Website: merterm.github.io/positive-fri...

08.02.2025 22:42 πŸ‘ 15 πŸ” 5 πŸ’¬ 1 πŸ“Œ 2

β€œToward the end of the November dinner, Trump raised the matter of the lawsuit, the people said. The president signaled that the litigation had to be resolved before Zuckerberg could be β€œbrought into the tent,” one of the people said.”

They’re in the tent now. Cowards.

29.01.2025 21:59 πŸ‘ 764 πŸ” 241 πŸ’¬ 55 πŸ“Œ 17

Hi Marc! Could I get added?

14.01.2025 18:45 πŸ‘ 7 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Ooh what agent? Any pointers to how I can set this up?

10.01.2025 02:22 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

EveryPhD EveryLab all at once

07.01.2025 23:16 πŸ‘ 8 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

As long as the last time you saw/spoke to them was last year -- I wish my dentist Happy New Year in August.

07.01.2025 23:15 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1

You forgot about mid-training (which incidentally is also what I call my training runs).

07.01.2025 23:12 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0