Ankit's Avatar

Ankit

@ankits0052

AI Research Enthusiast | Multimedia Analysis | LLMs Associate director at Accenture. PhD in LTI CMU, prev: Google, Bosch, merl, ARM Looking for the next breakthrough that will lead to AGI - understanding why LLMs actually work ankitshah009.github.io

851
Followers
1,368
Following
87
Posts
19.11.2024
Joined
Posts Following

Latest posts by Ankit @ankits0052

Post image

Marc Andreesen dropped the best life advice

02.11.2025 20:25 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

"Once you realize the whole world is run by shameless self-promoters you almost have no choice but to put yourself out there."
- Justin Welsh

31.03.2025 12:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image
30.03.2025 22:17 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image
16.03.2025 02:07 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image
16.03.2025 01:08 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

3. SuperAGI is a dev-first open source autonomous AI agent framework to build, manage & run useful autonomous agents. You can run concurrent agents seamlessly, extend agent capabilities with tools.

13.03.2025 06:51 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

2. AG2 is an open-source framework for building and coordinating multiple AI agents using LLMs, supporting tool use and human-in-the-loop interaction.

It simplifies agent creation, communication, and workflow management through pre-built patterns and configurable options.

13.03.2025 06:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

OpenAI released Agents SDK today.

But you didn't have to wait for OpenAI.

Other AI agent frameworks do the same.

100% opensource.

1. Agno is a lightweight library for building Multimodal Agents with memory, knowledge, and tools. It is 10,000x faster than LangGraph.

13.03.2025 06:51 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Intel CEO Signals That He’ll Stick With Contentious Foundry Plan Incoming Intel Corp. Chief Executive Officer Lip-Bu Tan is signaling that he’ll stick with his predecessor’s plan to make chips for other companies, even as he vows to learn from past mistakes.

Incoming Intel CEO Lip-Bu Tan is signaling that he’ll stick with his predecessor’s plan to make chips for other companies, even as he vows to learn from past mistakes trib.al/8N8IeE1

13.03.2025 05:12 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

A framework to automatically generate robot constitutions from real-world data to steer a robot's behavior using Constitutional AI mechanisms.

13.03.2025 05:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

2. The ASIMOV benchmark is a large-scale and comprehensive collection of datasets for evaluating and improving semantic safety of foundation models serving as robot brains to generate data under undesirable situations from real-world visual scenes for better robot scene understanding.

13.03.2025 05:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

New Dataset & Benchmarks:
1. ASIMOV Dataset for measuring safety implications of robotic actions in real-world scenarios.

13.03.2025 05:11 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

3. A top alignment rate of 84.3% was measured with ASIMOV Benchmark using generated constitutions, outperforming no-constitution baselines and human-written constitutions.

13.03.2025 05:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Some limitations for Gemini Robotics-ER stated in the report include struggles in spatial relationships across long videos and still ways to go for fine-grained robot control.

13.03.2025 05:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

ERQA (Embodied Reasoning Question Answering) is the benchmark introduced for embodied reasoning for VLMs. With over 400 MCVQs in spatial and action reasoning, trajectory reasoning, state estimation, task reasoning and more. It's similar to existing VLM benchmarks.

13.03.2025 05:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Gemini Robotics-ER VLM can enable spatial understanding, trajectory prediction, precise pointing and multi-view. The VLM brings foundational work for real-world robotics applications via zero-shot and few-shot adaptation for perception, planning and code generation to control robot embodiments.

13.03.2025 05:09 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

They also introduced a new dataset & framework for robot constitutionsπŸ‘‡

New Models:
Gemini Robotics taps into Gemini's world understanding to generalize to novel situations and solve a wide variety of tasks out of the box, including tasks it has never seen before in training.

13.03.2025 05:09 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Google Deepmind's latest paper showcases how Gemini 2.0 can be brought into the physical world through robotics with Gemini Robotics (a VLA) and Gemini Robotics-ER, an embodied VLM. Apptronik, Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools are some of the early testers.

13.03.2025 05:09 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image
03.03.2025 11:53 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

10. Idea Refinement:

"Take my rough conceptβ€”[e.g., 'a platform for decentralized education']β€”and explore similar ideas on X and the web. Provide a 500-word report on existing implementations, potential challenges, and 5 actionable next steps for development."

02.03.2025 02:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

9. Image-Inspired Writing:

"Search X for the 5 most shared images related to [theme, e.g., climate change impacts] in the last week. For each, write a 200-word fictional vignette inspired by the image, and ask if I’d like you to generate a complementary image."

02.03.2025 02:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

8. Debate Prep:

"Find the 10 most influential X posts on [controversial issue, e.g., universal basic income] from the past month. Summarize each stance, then draft two 300-word opposing arguments I can refine for a debate script."

02.03.2025 02:22 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

7. Historical Context:

"Research the evolution of [concept, e.g., cryptocurrency regulation] over the past 5 years using X posts and web sources. Create a timeline with 10 key events and a 700-word narrative explaining their significance for a blog post."

02.03.2025 02:21 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

6. Document Breakdown:

"Analyze this uploaded PDFβ€”[assume user uploads a research paper]β€”and extract its main arguments, methodology, and conclusions. Then, write a 400-word critique assessing its strengths and gaps, suggesting 3 follow-up research questions."

02.03.2025 02:21 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

5. Creative Brainstorming:

"Generate 15 unique story ideas based on emerging trends in [field, e.g., biotechnology]. For each, provide a one-sentence premise, a potential protagonist, and a key conflict, drawing from current web and X conversations."

02.03.2025 02:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

4. Comparative Analysis:

"Search X and the web for discussions on [topic, e.g., renewable energy policies] from the last 3 months. Compare perspectives from at least 3 distinct groups (e.g., scientists, policymakers, activists) and write a 600-word analysis highlighting agreements and conflicts."

02.03.2025 02:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

3. Content Expansion:

"Take this ideaβ€”[e.g., 'AI could reshape urban planning']β€”and generate a 1000-word article outline. Include potential arguments, counterarguments, and data points I should research further, using web and X searches to suggest credible sources."

02.03.2025 02:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

2. Profile Deep Dive:

"Examine the X profile of [username], including their posts, linked content, and uploaded files from the past 6 months. Identify their main areas of expertise, biases, and recurring themes, and draft a 300-word profile summary for a writing project."

02.03.2025 02:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

1. Research Synthesis:

"Analyze the latest 20 X posts and their linked articles about [specific topic, e.g., quantum computing advancements]. Summarize key trends, debates, and unresolved questions, then provide a 500-word overview with citations I can use for a research paper."

02.03.2025 02:19 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image
02.03.2025 02:19 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0