#MultiModal — Bluesky Posts

@bigskyrail.bsky.social

1 day ago

Montana rail advocates pitch commuter service plan Dan Bucks of the Big Sky Passenger Rail Authority gave a presentation last week to residents and the Mineral County Economic Committee, providing an update on the rail authority and sharing a new addi...

The piece also highlights the broader effort to strengthen regional connections through passenger rail.

#BigSkyRail #PassengerRail #Montana #Rural #Tribal #MultiModal

5 1 0 0

Timelines

@hulio-ai.bsky.social

1 day ago

🤖 Agentic AI: Autonomous agents optimize logistics.
📈 Scaling Laws: More compute brings expert-level AI.
🌐 Multimodal: AI combines text, images, and audio.
#AI2026 #AgenticAI #ScalingAI #Multimodal
#AI2026 #AgenticAI #ScalingAI #Multimodal
View in Timelines

0 0 0 0

LLMs

@llms.activitypub.awakari.com.ap.brid.gy

2 days ago

SLAY-ASR, или как я перестал волноваться и полюбил тренировать модели Как добавить аудио-модальность в LLMку мак...

#representation #learning #multimodality #multimodal #llm #machine #learning #audio-modality #regularization #contrastive #learning

Origin | Interest | Match

0 0 0 0

Yuzhe Yang

@yuzheyang.bsky.social

2 days ago

HEARTS: Benchmarking LLM Reasoning on Health Time Series The rise of large language models (LLMs) has shifted time series analysis from narrow analytics to general-purpose reasoning. Yet, existing benchmarks cover only a small set of health time series moda...

🚀 HEARTS is built as a living ecosystem for the community: new data, new reasoning tasks, and new models can continue to be added over time!
➡️Paper: arxiv.org/abs/2603.06638

#AI #HealthAI #LLM #TimeSeries #Multimodal

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 days ago

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning ...

Jierun Chen, Tiezheng YU, Haoli Bai et al.

Action editor: Sylvain Le Corff

https://openreview.net/forum?id=XPML8UGI04

#reasoning #multimodal #verbosity

0 0 0 0

Deccan Founders

@deccanfounders.com

3 days ago

Google introduced Gemini Embedding 2, its first embedding model unifying text, image, video, audio, and document processing in one vector space.

Read Full Article: deccanfounders.com/2026/11/n...

#DeccanFounders #Google #Gemini #AI #GeminiEmbedding #MultiModal #RAG

0 0 0 0

Jakub Švec

@sweps91.bsky.social

4 days ago

🤖 Can we hack LLM prices?

💸 Here are some tips on how to radically reduce AI costs.

#ai #llm #python #rust #data #analysis #vertex #google #gemini #multimodal #bigquery #tips #image #video #social #media #batch #model #prompt #it #dev #programming #software #code #tech

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

6 days ago

SiLVR: A Simple Language-based Video Reasoning Framework

Ce Zhang, Yan-Bo Lin, Ziyang Wang, Mohit Bansal, Gedas Bertasius

Action editor: Anurag Arnab

https://openreview.net/forum?id=mQZbh9Zlbw

#multimodal #subtitles #captions

2 1 0 0

BIFOLD Berlin Institute for the Foundations of Learning and Data

@bifold.berlin

1 week ago

Charité/BIFOLD: W3 Prof in #DataEngineering in Health. Based at: Institute of Medical Informatics. Focus: advancing foundational and applied research in #clinical #dataManagement, #multimodal #machineLearning ...

@prof2prof.bsky.social #FacultyPosition #VacancyEdu #ScienceCareer #ScholarshipAlert

1 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

1 week ago

New #J2C Certification:

Rethinking the Mixture of Vision Encoders Paradigm for Enhanced Visual Understanding in Multimoda...

Mozhgan Nasr Azadani, James Riddell, Sean Sedwards, Krzysztof Czarnecki

https://openreview.net/forum?id=tgnTVmRybs

#multimodal #encoders #visual

0 0 0 0

Alvin Ashcraft

@alvinashcraft.com

1 week ago

Headline: Phi-4-Vision-Reasoning—training a multimodal reasoning model Vision-language models improve multimodal systems, but can make them slower, costlier, and harder to deploy. Learn how Phi-4-Vision-Reasoning, a compact multimodal reasoning model, blends strengths…

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

ift.tt/ti4Ou1U

#ai #phi4 #reasoning #aimodels #multimodal #msresearch

2 0 0 0

Children's Brain Tumor Network

@wearecbtn.bsky.social

1 week ago

CBTN scientists are accelerating the pace of research discoveries. Explore how: https://monkeylink.co/dc5229 #UnitedWeCure #WeAreCBTN #MultiModal #MultiOmic #MultiDisciplinary

0 0 0 0

Yuzhe Yang

@yuzheyang.bsky.social

1 week ago

SleepLM: Natural-Language Intelligence for Human Sleep We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-...

SleepLM points to a new direction for sleep AI🚀. Read all about it!
➡️Paper: arxiv.org/abs/2602.23605

Great work led by my students @ZongzheX2001, @ZitaoShuai, Eideen, and amazing collaborators @AysolaRavi and Rajesh!

More to come🌙

#AI #sleep #sensor #health #multimodal #LLMs

1 0 0 0

Moments Lab

@momentslab.bsky.social

1 week ago

Meet us in Las Vegas next month to demo the latest #Multimodal and #AgenticAI capabilities of our Video Discovery Platform.

It's accelerating at-scale content search, creation, and distribution workflows for video teams 🔥

hubs.la/Q045G1GY0

#NABShow #NABShow2026 #NABShow26

0 0 0 0

Veronika Koller

@veronikakoller.bsky.social

1 week ago

Next Monday (9 March) at 3pm GMT, my student Tianxin Li will give a talk at @lancslinguistics.bsky.social, on #multimodal discourses of voluntary #childlessness on Chinese social media.

The talk is in person and on Teams: shorturl.at/2Wgi8

3 1 0 0

TechGlimmer.io

@techglimmer.bsky.social

1 week ago

Seed 2.0 is a big swing from ByteDance Seed: a multimodal LLM tuned for long documents, complex charts & hour long videos. It ranks near the top of Arena leaderboards, hits SOTA on math/vision benchmarks & is built to power real Agent workflows, not just demos. #Seed2 #LLM #Multimodal #ByteDance

0 0 0 0

Physalia-courses@Online

@physaliacourses.bsky.social

1 week ago

Excited to launch the 1st MULTIMODAL AI FOR SYSTEMS BIOLOGY course with Dr. Himel Mallick!

Over 5 half-days, participants will learn to process, analyze, and integrate multimodal biological data using cutting-edge AI techniques.

#MultiModal #MultiOmics #AI #MachineLearning

1 0 0 0

COSCO SHIPPING

@coscoshipping.bsky.social

1 week ago

🚛 #Rotterdam to #Uzbekistan: 380-ton generator across 8,000km delivered!

Challenges:
🧊 -25°C temps, 15+cm snow
⏳ Tight deadlines
⛰️ Slippery mountain turns

How we did it:
🌡️ Real-time monitoring
🚛 #Multimodal transport
📋 72-hr pre-clearance
🔍 Custom-engineered solutions

0 0 0 0

ClawNews

@clawnews.bsky.social

2 weeks ago

Qwen 3.5's Multimodal Abilities Enhance Local LLM Experimentation Qwen 3.5's multimodal capabilities, particularly its ability to process both text and image inputs, are being integrated into local LLM setups using llama.cpp. This allows users to leverage the model's expanded utility in various applications, from single-card setups to multi-GPU rigs. Users ca

📰 Qwen 3.5's Multimodal Abilities Boost Local LLM Use

Qwen 3.5's multimodal capabilities, particularly its ability to process both text and image inputs, are being integrated into lo...

www.clawnews.ai/qwen-3-5s-multimodal-abi...

#AI #LLM #Multimodal

2 0 0 0

Henrika Florén

@henrikafloren.bsky.social

2 weeks ago

My research is about assessments of multimodal texts and relevant across different levels of education. I also introduce innovations for #MultimodalTranscription which may interest the field of #multimodal #research. Stay tuned more will come about #MultimodalAssessment #AcademicSky. #PhD done 👍

4 0 1 0

Bundesministerium für Innovation, Mobilität & Infrastruktur

@bmimi.gv.at

2 weeks ago

Ein Foto von zwei Menschen auf zwei E-Scootern. Darunter steht der Text: Sicher, smart und nachhaltig unterwegs - Mobilitätskonferenz 2026 im Zeichen der Multimodalität

Was muss geschehen, damit Menschen und Güter wirklich #multimodal unterwegs sein können? 👤📦

Findet es heraus bei der #Mobilitätskonferenz 2026!

📆 Montag, 20.04.2026 bis Dienstag, 21.04.2026
📍Roomz; Rothschildplatz 2, 1020 Wien

Foto: © Adobe Stock

0 0 1 0

cannata_cargo_services

@cannatalogistics.bsky.social

2 weeks ago

Your cargo. No delays. No worries.
Your cargo doesn’t just move — it moves with precision, protection, and reliability.

Contact us today!

🌐 www.cannata.ae
📞 +971 4 239 7107

#cannatalogistics #doortodoor #shipment #logistics #freightforward #3PL #cargo #airfrieght #seafrieght #multimodal

0 0 0 0

Glen Jackson

@glenjackson.bsky.social

2 weeks ago

BEAUTY 💙
🇺🇸 #Multistudio designs towers with "crisp white skin" in Downtown Phoenix
A pair of white towers anchor a #mixeduse #project called #CentralStation which was designed to serve as a model for #multimodal #development" in #Phoenix #Arizona #Architecture #Design #Infrastructure
@dezeen.com

2 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

2 weeks ago

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment

Joanna Hong, Sanjeel Parekh, Honglie Chen, Jacob Donley, Ke Tan, Buye Xu, Anurag Kumar

Action editor: Hanwang Zhang

https://openreview.net/forum?id=5bshBY8RDf

#multimodal #audiovisual

1 0 0 0

Cathy Wu

@cathywu.bsky.social

3 weeks ago

Parking-aware navigation system could prevent frustration and emissions By minimizing the need to drive around looking for a parking spot, this technique can save drivers up to 35 minutes — and give them a realistic estimate of total travel time.

More here: news.mit.edu/2026/parking....

Paper: arxiv.org/abs/2601.00521
Code: github.com/chickert/Pro...

Joint work with lead author Cameron Hickert, Sirui Li, Zhengbing He

#transportation #multimodal #parking #navigation #dynamicprogramming

6 0 0 0

Arif Solmaz

@arifsolmaz.bsky.social

3 weeks ago

Multi-modal RAG framework that handles text, images, audio, and video in one unified system - finall

Multi-modal RAG framework that handles text, images, audio, and video in one unified system - finally, RAG that works with everything, not just documents

https://github.com/HKUDS/RAG-Anything

#RAG #MultiModal #AI

1 0 0 0

LLMs

@llms.activitypub.awakari.com.ap.brid.gy

3 weeks ago

ByteDance Quietly Drops a New Large Language Model With Superior Visual Chops, Escalating the AI Arms Race With OpenAI and Google ByteDance has launched a new large language model with significantl...

#GenAIPro #AI #competition #Artificial #Intelligence […]

[Original post on webpronews.com]

0 0 0 0

Breakbulk.News

@breakbulk.bsky.social

3 weeks ago

Samskip sells UK and Ireland short sea freight business to CLdN Samskip sells UK and Ireland short sea freight business to CLdN, reshaping #NorthSea logistics. Deal covers Rotterdam UK Ireland routes, boosting #multimodal scale while refocusing Samskip on long distance corridors, says CEO Ólafsson. #shipping #shortsea

Samskip sells UK and Ireland short sea freight business to CLdN, reshaping #NorthSea logistics. Deal covers Rotterdam UK Ireland routes, boosting #multimodal scale while refocusing Samskip on long distance corridors, says CEO Ólafsson. #shipping #shortsea

0 0 0 0

TMLR Published Papers

@tmlr-pub.bsky.social

3 weeks ago

Federated Multimodal Fusion for Action Recognition Leveraging Vision-Language Embeddings and Spat...

Aditi Palit, Kalidas Yeturu

Action editor: Yu-Xiong Wang

https://openreview.net/forum?id=AobzdtqiMe

#cnn #multimodal #cnns

1 0 0 0

Evolution AI Hub

@evolutionaihub.bsky.social

3 weeks ago

Qwen3.5 Launches 397B Model With Only 17B Active Per Pass Alibaba’s Qwen3.5-397B-A17B launches with 397B total parameters but activates only 17B per pass, signaling a shift toward efficient, agent-ready multimodal AI systems.

397B parameters.
Only 17B active per pass.

Qwen3.5 isn’t just scaling up — it’s scaling smarter.
Sparse MoE + multimodal + 1M context.

The agent race just got more efficient.

#Qwen35 #AI #LLM #Multimodal #OpenWeights #AIInfrastructure
evolutionaihub.com/qwen3-5-397b...

1 0 0 0