Marc Andreesen dropped the best life advice
@ankits0052
AI Research Enthusiast | Multimedia Analysis | LLMs Associate director at Accenture. PhD in LTI CMU, prev: Google, Bosch, merl, ARM Looking for the next breakthrough that will lead to AGI - understanding why LLMs actually work ankitshah009.github.io
Marc Andreesen dropped the best life advice
"Once you realize the whole world is run by shameless self-promoters you almost have no choice but to put yourself out there."
- Justin Welsh
3. SuperAGI is a dev-first open source autonomous AI agent framework to build, manage & run useful autonomous agents. You can run concurrent agents seamlessly, extend agent capabilities with tools.
2. AG2 is an open-source framework for building and coordinating multiple AI agents using LLMs, supporting tool use and human-in-the-loop interaction.
It simplifies agent creation, communication, and workflow management through pre-built patterns and configurable options.
OpenAI released Agents SDK today.
But you didn't have to wait for OpenAI.
Other AI agent frameworks do the same.
100% opensource.
1. Agno is a lightweight library for building Multimodal Agents with memory, knowledge, and tools. It is 10,000x faster than LangGraph.
Incoming Intel CEO Lip-Bu Tan is signaling that heβll stick with his predecessorβs plan to make chips for other companies, even as he vows to learn from past mistakes trib.al/8N8IeE1
A framework to automatically generate robot constitutions from real-world data to steer a robot's behavior using Constitutional AI mechanisms.
2. The ASIMOV benchmark is a large-scale and comprehensive collection of datasets for evaluating and improving semantic safety of foundation models serving as robot brains to generate data under undesirable situations from real-world visual scenes for better robot scene understanding.
New Dataset & Benchmarks:
1. ASIMOV Dataset for measuring safety implications of robotic actions in real-world scenarios.
3. A top alignment rate of 84.3% was measured with ASIMOV Benchmark using generated constitutions, outperforming no-constitution baselines and human-written constitutions.
Some limitations for Gemini Robotics-ER stated in the report include struggles in spatial relationships across long videos and still ways to go for fine-grained robot control.
ERQA (Embodied Reasoning Question Answering) is the benchmark introduced for embodied reasoning for VLMs. With over 400 MCVQs in spatial and action reasoning, trajectory reasoning, state estimation, task reasoning and more. It's similar to existing VLM benchmarks.
Gemini Robotics-ER VLM can enable spatial understanding, trajectory prediction, precise pointing and multi-view. The VLM brings foundational work for real-world robotics applications via zero-shot and few-shot adaptation for perception, planning and code generation to control robot embodiments.
They also introduced a new dataset & framework for robot constitutionsπ
New Models:
Gemini Robotics taps into Gemini's world understanding to generalize to novel situations and solve a wide variety of tasks out of the box, including tasks it has never seen before in training.
Google Deepmind's latest paper showcases how Gemini 2.0 can be brought into the physical world through robotics with Gemini Robotics (a VLA) and Gemini Robotics-ER, an embodied VLM. Apptronik, Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools are some of the early testers.
10. Idea Refinement:
"Take my rough conceptβ[e.g., 'a platform for decentralized education']βand explore similar ideas on X and the web. Provide a 500-word report on existing implementations, potential challenges, and 5 actionable next steps for development."
9. Image-Inspired Writing:
"Search X for the 5 most shared images related to [theme, e.g., climate change impacts] in the last week. For each, write a 200-word fictional vignette inspired by the image, and ask if Iβd like you to generate a complementary image."
8. Debate Prep:
"Find the 10 most influential X posts on [controversial issue, e.g., universal basic income] from the past month. Summarize each stance, then draft two 300-word opposing arguments I can refine for a debate script."
7. Historical Context:
"Research the evolution of [concept, e.g., cryptocurrency regulation] over the past 5 years using X posts and web sources. Create a timeline with 10 key events and a 700-word narrative explaining their significance for a blog post."
6. Document Breakdown:
"Analyze this uploaded PDFβ[assume user uploads a research paper]βand extract its main arguments, methodology, and conclusions. Then, write a 400-word critique assessing its strengths and gaps, suggesting 3 follow-up research questions."
5. Creative Brainstorming:
"Generate 15 unique story ideas based on emerging trends in [field, e.g., biotechnology]. For each, provide a one-sentence premise, a potential protagonist, and a key conflict, drawing from current web and X conversations."
4. Comparative Analysis:
"Search X and the web for discussions on [topic, e.g., renewable energy policies] from the last 3 months. Compare perspectives from at least 3 distinct groups (e.g., scientists, policymakers, activists) and write a 600-word analysis highlighting agreements and conflicts."
3. Content Expansion:
"Take this ideaβ[e.g., 'AI could reshape urban planning']βand generate a 1000-word article outline. Include potential arguments, counterarguments, and data points I should research further, using web and X searches to suggest credible sources."
2. Profile Deep Dive:
"Examine the X profile of [username], including their posts, linked content, and uploaded files from the past 6 months. Identify their main areas of expertise, biases, and recurring themes, and draft a 300-word profile summary for a writing project."
1. Research Synthesis:
"Analyze the latest 20 X posts and their linked articles about [specific topic, e.g., quantum computing advancements]. Summarize key trends, debates, and unresolved questions, then provide a 500-word overview with citations I can use for a research paper."