Home New Trending Search
About Privacy Terms
#
#MultiModal
Posts tagged #MultiModal on Bluesky
Preview
Montana rail advocates pitch commuter service plan Dan Bucks of the Big Sky Passenger Rail Authority gave a presentation last week to residents and the Mineral County Economic Committee, providing an update on the rail authority and sharing a new addi...

The piece also highlights the broader effort to strengthen regional connections through passenger rail.

#BigSkyRail #PassengerRail #Montana #Rural #Tribal #MultiModal

5 1 0 0

🤖 Agentic AI: Autonomous agents optimize logistics.
📈 Scaling Laws: More compute brings expert-level AI.
🌐 Multimodal: AI combines text, images, and audio.
#AI2026 #AgenticAI #ScalingAI #Multimodal
#AI2026 #AgenticAI #ScalingAI #Multimodal
View in Timelines

0 0 0 0
Post image

SLAY-ASR, или как я перестал волноваться и полюбил тренировать модели Как добавить аудио-модальность в LLMку мак...

#representation #learning #multimodality #multimodal #llm #machine #learning #audio-modality #regularization #contrastive #learning

Origin | Interest | Match

0 0 0 0
Preview
HEARTS: Benchmarking LLM Reasoning on Health Time Series The rise of large language models (LLMs) has shifted time series analysis from narrow analytics to general-purpose reasoning. Yet, existing benchmarks cover only a small set of health time series moda...

🚀 HEARTS is built as a living ecosystem for the community: new data, new reasoning tasks, and new models can continue to be added over time!
➡️Paper: arxiv.org/abs/2603.06638

#AI #HealthAI #LLM #TimeSeries #Multimodal

1 0 0 0

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning ...

Jierun Chen, Tiezheng YU, Haoli Bai et al.

Action editor: Sylvain Le Corff

https://openreview.net/forum?id=XPML8UGI04

#reasoning #multimodal #verbosity

0 0 0 0
Post image

Google introduced Gemini Embedding 2, its first embedding model unifying text, image, video, audio, and document processing in one vector space.

Read Full Article: deccanfounders.com/2026/11/n...

#DeccanFounders #Google #Gemini #AI #GeminiEmbedding #MultiModal #RAG

0 0 0 0
Post image

🤖 Can we hack LLM prices?

💸 Here are some tips on how to radically reduce AI costs.

#ai #llm #python #rust #data #analysis #vertex #google #gemini #multimodal #bigquery #tips #image #video #social #media #batch #model #prompt #it #dev #programming #software #code #tech

0 0 0 0

SiLVR: A Simple Language-based Video Reasoning Framework

Ce Zhang, Yan-Bo Lin, Ziyang Wang, Mohit Bansal, Gedas Bertasius

Action editor: Anurag Arnab

https://openreview.net/forum?id=mQZbh9Zlbw

#multimodal #subtitles #captions

2 1 0 0

Charité/BIFOLD: W3 Prof in #DataEngineering in Health. Based at: Institute of Medical Informatics. Focus: advancing foundational and applied research in #clinical #dataManagement, #multimodal #machineLearning ...

@prof2prof.bsky.social #FacultyPosition #VacancyEdu #ScienceCareer #ScholarshipAlert

1 0 0 0

New #J2C Certification:

Rethinking the Mixture of Vision Encoders Paradigm for Enhanced Visual Understanding in Multimoda...

Mozhgan Nasr Azadani, James Riddell, Sean Sedwards, Krzysztof Czarnecki

https://openreview.net/forum?id=tgnTVmRybs

#multimodal #encoders #visual

0 0 0 0
Preview
Headline: Phi-4-Vision-Reasoning—training a multimodal reasoning model Vision-language models improve multimodal systems, but can make them slower, costlier, and harder to deploy. Learn how Phi-4-Vision-Reasoning, a compact multimodal reasoning model, blends strengths…

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

ift.tt/ti4Ou1U

#ai #phi4 #reasoning #aimodels #multimodal #msresearch

2 0 0 0
Post image

CBTN scientists are accelerating the pace of research discoveries. Explore how: https://monkeylink.co/dc5229 #UnitedWeCure #WeAreCBTN #MultiModal #MultiOmic #MultiDisciplinary

0 0 0 0
Preview
SleepLM: Natural-Language Intelligence for Human Sleep We present SleepLM, a family of sleep-language foundation models that enable human sleep alignment, interpretation, and interaction with natural language. Despite the critical role of sleep, learning-...

SleepLM points to a new direction for sleep AI🚀. Read all about it!
➡️Paper: arxiv.org/abs/2602.23605

Great work led by my students @ZongzheX2001, @ZitaoShuai, Eideen, and amazing collaborators @AysolaRavi and Rajesh!

More to come🌙

#AI #sleep #sensor #health #multimodal #LLMs

1 0 0 0
Post image

Meet us in Las Vegas next month to demo the latest #Multimodal and #AgenticAI capabilities of our Video Discovery Platform.

It's accelerating at-scale content search, creation, and distribution workflows for video teams 🔥

hubs.la/Q045G1GY0

#NABShow #NABShow2026 #NABShow26

0 0 0 0

Next Monday (9 March) at 3pm GMT, my student Tianxin Li will give a talk at @lancslinguistics.bsky.social, on #multimodal discourses of voluntary #childlessness on Chinese social media.

The talk is in person and on Teams: shorturl.at/2Wgi8

3 1 0 0

Seed 2.0 is a big swing from ByteDance Seed: a multimodal LLM tuned for long documents, complex charts & hour long videos. It ranks near the top of Arena leaderboards, hits SOTA on math/vision benchmarks & is built to power real Agent workflows, not just demos. #Seed2 #LLM #Multimodal #ByteDance

0 0 0 0
Post image Post image

Excited to launch the 1st MULTIMODAL AI FOR SYSTEMS BIOLOGY course with Dr. Himel Mallick!

Over 5 half-days, participants will learn to process, analyze, and integrate multimodal biological data using cutting-edge AI techniques.

#MultiModal #MultiOmics #AI #MachineLearning

1 0 0 0
Post image Post image

🚛 #Rotterdam to #Uzbekistan: 380-ton generator across 8,000km delivered!

Challenges:
🧊 -25°C temps, 15+cm snow
⏳ Tight deadlines
⛰️ Slippery mountain turns

How we did it:
🌡️ Real-time monitoring
🚛 #Multimodal transport
📋 72-hr pre-clearance
🔍 Custom-engineered solutions

0 0 0 0
Preview
Qwen 3.5's Multimodal Abilities Enhance Local LLM Experimentation Qwen 3.5's multimodal capabilities, particularly its ability to process both text and image inputs, are being integrated into local LLM setups using llama.cpp. This allows users to leverage the model's expanded utility in various applications, from single-card setups to multi-GPU rigs. Users ca

📰 Qwen 3.5's Multimodal Abilities Boost Local LLM Use

Qwen 3.5's multimodal capabilities, particularly its ability to process both text and image inputs, are being integrated into lo...

www.clawnews.ai/qwen-3-5s-multimodal-abi...

#AI #LLM #Multimodal

2 0 0 0

My research is about assessments of multimodal texts and relevant across different levels of education. I also introduce innovations for #MultimodalTranscription which may interest the field of #multimodal #research. Stay tuned more will come about #MultimodalAssessment #AcademicSky. #PhD done 👍

4 0 1 0
Ein Foto von zwei Menschen auf zwei E-Scootern. Darunter steht der Text: Sicher, smart und nachhaltig unterwegs - Mobilitätskonferenz 2026 im Zeichen der Multimodalität

Ein Foto von zwei Menschen auf zwei E-Scootern. Darunter steht der Text: Sicher, smart und nachhaltig unterwegs - Mobilitätskonferenz 2026 im Zeichen der Multimodalität

Was muss geschehen, damit Menschen und Güter wirklich #multimodal unterwegs sein können? 👤📦

Findet es heraus bei der #Mobilitätskonferenz 2026!

📆 Montag, 20.04.2026 bis Dienstag, 21.04.2026
📍Roomz; Rothschildplatz 2, 1020 Wien

Foto: © Adobe Stock

0 0 1 0
Video thumbnail

Your cargo. No delays. No worries.
Your cargo doesn’t just move — it moves with precision, protection, and reliability.

Contact us today!

🌐 www.cannata.ae
📞 +971 4 239 7107

#cannatalogistics #doortodoor #shipment #logistics #freightforward #3PL #cargo #airfrieght #seafrieght #multimodal

0 0 0 0
Post image Post image Post image Post image

BEAUTY 💙
🇺🇸 #Multistudio designs towers with "crisp white skin" in Downtown Phoenix
A pair of white towers anchor a #mixeduse #project called #CentralStation which was designed to serve as a model for #multimodal #development" in #Phoenix #Arizona #Architecture #Design #Infrastructure
@dezeen.com

2 0 0 0

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment

Joanna Hong, Sanjeel Parekh, Honglie Chen, Jacob Donley, Ke Tan, Buye Xu, Anurag Kumar

Action editor: Hanwang Zhang

https://openreview.net/forum?id=5bshBY8RDf

#multimodal #audiovisual

1 0 0 0
Preview
Parking-aware navigation system could prevent frustration and emissions By minimizing the need to drive around looking for a parking spot, this technique can save drivers up to 35 minutes — and give them a realistic estimate of total travel time.

More here: news.mit.edu/2026/parking....

Paper: arxiv.org/abs/2601.00521
Code: github.com/chickert/Pro...

Joint work with lead author Cameron Hickert, Sirui Li, Zhengbing He

#transportation #multimodal #parking #navigation #dynamicprogramming

6 0 0 0
Multi-modal RAG framework that handles text, images, audio, and video in one unified system - finall

Multi-modal RAG framework that handles text, images, audio, and video in one unified system - finall

Multi-modal RAG framework that handles text, images, audio, and video in one unified system - finally, RAG that works with everything, not just documents

https://github.com/HKUDS/RAG-Anything

#RAG #MultiModal #AI

1 0 0 0
Post image

ByteDance Quietly Drops a New Large Language Model With Superior Visual Chops, Escalating the AI Arms Race With OpenAI and Google ByteDance has launched a new large language model with significantl...

#GenAIPro #AI #competition #Artificial #Intelligence […]

[Original post on webpronews.com]

0 0 0 0
Preview
Samskip sells UK and Ireland short sea freight business to CLdN Samskip sells UK and Ireland short sea freight business to CLdN, reshaping #NorthSea logistics. Deal covers Rotterdam UK Ireland routes, boosting #multimodal scale while refocusing Samskip on long distance corridors, says CEO Ólafsson. #shipping #shortsea

Samskip sells UK and Ireland short sea freight business to CLdN, reshaping #NorthSea logistics. Deal covers Rotterdam UK Ireland routes, boosting #multimodal scale while refocusing Samskip on long distance corridors, says CEO Ólafsson. #shipping #shortsea

0 0 0 0

Federated Multimodal Fusion for Action Recognition Leveraging Vision-Language Embeddings and Spat...

Aditi Palit, Kalidas Yeturu

Action editor: Yu-Xiong Wang

https://openreview.net/forum?id=AobzdtqiMe

#cnn #multimodal #cnns

1 0 0 0
Preview
Qwen3.5 Launches 397B Model With Only 17B Active Per Pass Alibaba’s Qwen3.5-397B-A17B launches with 397B total parameters but activates only 17B per pass, signaling a shift toward efficient, agent-ready multimodal AI systems.

397B parameters.
Only 17B active per pass.

Qwen3.5 isn’t just scaling up — it’s scaling smarter.
Sparse MoE + multimodal + 1M context.

The agent race just got more efficient.

#Qwen35 #AI #LLM #Multimodal #OpenWeights #AIInfrastructure
evolutionaihub.com/qwen3-5-397b...

1 0 0 0