Home New Trending Search
About Privacy Terms
#
#AudioAI
Posts tagged #AudioAI on Bluesky
Preview
Sam Audio Large: Isolate Sound With Precision - Ai Adoption Agency Sam Audio Large is an AI model that lets you isolate any sound from complex audio with text, visual or time-based prompts, transforming audio cleaning, music production and content workflows across me...

Sam Audio Large: Isolate ANY sound from recordings.
Text prompts.
Visual prompts.
Time prompts.
General-purpose.
Near real-time.
Target + residual output.

#SamAudioLarge #AudioAI #AITools #AIForBusiness

Read more:
aiadoptionagency.com/sam-audio-la...

0 0 0 0
Post image Post image Post image Post image

From Semantics to Trajectories: Reimagining the Spatial Audio Workflow with Generative SPATAI


A new Sounding Future article by Sinan Bökesoy:
www.soundingfuture.com/en/ar...

#SpatialAudio #ImmersiveAudio #3DAudio #Ambisonics #AudioTech #SoundDesign #ObjectBasedAudio
#AudioAI #SpatAI #sonicLAB

2 0 0 0

Kudos to the authors Xinhao Mei, Varun Nagaraja, et al.
Read the paper: arxiv.org/pdf/2601.12594

#ai #audioai #research #foundationmodels #equalyzai

0 0 0 0
Preview
Nova SR: Clear & Enhance Speech - Ai Adoption Agency Imagine you recorded a friend talking in a noisy kitchen with an old phone. The voice sounds small and cloudy, and you can hear the room more than the person. Nova SR is like a magic cleaner that take...

Nova SR: Turn muffled speech into crystal-clear audio.

16kHz → 48kHz.
Real-world ready.
Fraction of a cent/sec.
Simple API.
Boosts ASR.
Perfect for contact centers, podcasters, e-learning, sales.
#NovaSR #SpeechEnhancement #AudioAI

Read the full article here:

aiadoptionagency.com/nova-sr-clea...

1 0 0 0
Preview
Deepfilternet 3: Noise Suppression - Ai Adoption Agency Deepfilternet 3 is a compact deep learning model that delivers strong real time noise suppression for speech, making calls, streams and recordings clearer without expensive hardware or heavy compute o...

Sam Audio: Isolate ANY sound from recordings.

Text prompts.
Video prompts.
Time prompts.
General-purpose.
Real-time speed.
Target + residual output.

Perfect for podcasters, musicians, gamers, creators.

#SamAudio #AudioAI #AITools

Read the full article here:

aiadoptionagency.com/deepfilterne...

1 0 0 0
Post image

Sam Audio: Isolate ANY sound from recordings.

Text prompts.
Video prompts.
Time prompts.
General-purpose.
Real-time speed.

Perfect for podcasters, musicians, gamers, creators.

sam.audio

#SamAudio #AudioAI #AITools

Read the full article here:

aiadoptionagency.com/sam-audio-th...

1 0 0 0
Preview
OpenAI Lines Up New Audio AI For Early 2026 As Voice Takes Center Stage OpenAI is preparing a new audio-focused AI for early 2026, signaling a major shift toward voice-first interaction and future AI devices.

OpenAI is gearing up for its next audio leap.
A new voice-focused AI, expected in early 2026, hints at a future where talking—not typing—becomes the default interface.
Voice is back in the spotlight.

#OpenAI #AudioAI #VoiceAI #AI2026
evolutionaihub.com/openai-new-a...

1 0 0 0
Preview
What Is OpenAI's Big Bet on Audio as Screens Become Outdated? OpenAI is overhauling its audio AI models as Silicon Valley shifts toward screenless, audio-first devices and interfaces.

OpenAI is rebuilding its audio AI stack as Silicon Valley shifts away from screens. Voice is becoming the main interface.

itmatterss.in/global/opena...

#OpenAI #AudioAI #FutureOfTech #AI

2 0 0 0

#OpenAI is prioritising #audioAI, unifying teams to develop an #audiofirst #personaldevice expected in a year. This aligns with the tech industry’s shift towards #audiointerfaces, with companies like Meta, Google, and Tesla integrating #voiceassistants into various devices. OpenAI’s new model,…

2 0 0 0

Meta dévoile SAM Audio, nouveau modèle d’IA capable d’isoler des sons dans des mélanges audio complexes 🎧🤖 Grâce à des instructions textuelles, visuelles ou temporelles ⏱️👀, il ouvre de nouvelles possibilités pour la post-production sonore 🎶✨"

#AudioAI

Détails : www.prompt9000.com/actus-ia-en-...

0 0 0 0
Post image

If you actually want to see and hear and talk to real people, hit this up.

#AudioAI #AiforAudio #Interactive #GameAudio #IASIG #AIWG #AIWorkingGroup #AIsounddesign #AImusic

www.eventbrite.com/e/ai-for-int...

4 1 0 0
Post image

🎧 Voiser AI converts text into natural-sounding voiceovers — ideal for creators, educators & brands.#AI #VoiserAI #VoiceAI #AudioAI #Automation #Creativity #AItools #Innovation #TechTrends #DigitalAudio

2 0 0 0
Post image

💡 Suno AI transforms your text into songs with vocals, beats & emotion — compose full tracks in minutes!#AI #SunoAI #MusicAI #AudioAI #Creativity #Automation #AItools #Innovation #DigitalMusic #TechTrends

1 0 0 0

Current audio models struggle with human-like nuances: pitch, emotion, and accents. This is due to learning difficulty vs. text, reliance on synthetic data, and even intentional safeguards to prevent misuse. #AudioAI 2/5

0 0 1 0
Post image

🎧 Podcastle AI records, edits, and enhances your podcast with studio-quality sound — all in your browser. Create like a pro, effortlessly!#AI #Podcastle #AudioAI #ContentCreation #Automation #AItools #Innovation #TechTrends #Podcasting

2 0 0 0
Post image

🎧 Descript revolutionizes video & podcast editing. Edit by changing text, remove filler words automatically, and use AI voices to fix recordings seamlessly.
#AI #Descript #VideoEditing #PodcastTools #AudioAI #Productivity #Innovation

2 0 0 0
Audio-Reasoner Boosts Reasoning Skills in Large Audio Language Models

Audio-Reasoner Boosts Reasoning Skills in Large Audio Language Models

Audio-Reasoner, a language model trained on the CoTA dataset of 1.2 million samples, improves benchmarks by +25.42% on MMAU‑mini and +14.57% on AIR‑Bench chat. Read more: getnews.me/audio-reasoner-boosts-re... #audioreasoner #audioai #multimodal

1 0 0 0
Latent Bridge Models Boost Audio Super-Resolution Quality

Latent Bridge Models Boost Audio Super-Resolution Quality

Latent Bridge Models enable audio super‑resolution up to 192 kHz, setting state‑of‑the‑art scores for any‑to‑48 kHz across speech, music and environmental sounds. getnews.me/latent-bridge-models-boo... #latentsuperresolution #audioai

0 0 0 0
Huxe launches audio‑AI app for news briefs and deep‑dive podcasts

Huxe launches audio‑AI app for news briefs and deep‑dive podcasts

Ex‑Google NotebookLM engineers launched Huxe, an audio‑first app that turns emails, calendars and web topics into spoken briefings. It's free on iOS and Android and raised $4.6 million. Read more: getnews.me/huxe-launches-audio-ai-a... #huxe #audioai

0 0 0 0
Benchmark Targets Speech, Scene & Event Understanding in Audio AI

Benchmark Targets Speech, Scene & Event Understanding in Audio AI

SSEU‑Bench tests speech, scene and event understanding with independent, joint and energy‑aware settings; chain‑of‑thought prompts boost joint task performance. Submitted 16 Sep 2025. Read more: getnews.me/benchmark-targets-speech... #audioai #sseubench

0 0 0 0
FCPE Model Offers Fast, Accurate Pitch Estimation for Audio

FCPE Model Offers Fast, Accurate Pitch Estimation for Audio

FCPE reaches 96.79% raw pitch accuracy on the MIR‑1K dataset and runs with a 0.0062 real‑time factor on an RTX 4090, enabling faster‑than‑real‑time processing. Read more: getnews.me/fcpe-model-offers-fast-a... #fcpe #audioai

0 0 0 0
Preview
Voxtral | Mistral AI Introducing frontier open source speech understanding models.

🗣️ Mistral se lanza al audio con Voxtral, su primer modelo de voz open-source. Promete transcripción y Q&A nativo a bajo coste. ¡A probarlo! #Mistral #OpenSource #AudioAI

4 1 0 0
Post image

Voxtral: Open Source AI Audio Model—Capabilities, Features, and How to Access.

See here - techchilli.com/artificial-i...

#Voxtral #AI2025 #AudioAI #MistralAI #OpenSource

1 1 0 0
Preview
Voxtral | Mistral AI Introducing frontier open source speech understanding models.

🚀 Mistral lance Voxtral, un modèle audio IA open source performant et accessible ! 🎙️ Transcription, compréhension, résumé en temps réel, multilingue 🌍. Une alternative économique aux solutions fermées. Découvrez-le ! 👇 #IA #OpenSource #AudioAI #Innovation
mistral.ai/fr/news/voxt...

4 1 0 0
Preview
Assessing the Alignment of Audio Representations with Timbre Similarity Ratings Psychoacoustical so-called "timbre spaces" map perceptual similarity ratings of instrument sounds onto low-dimensional embeddings via multidimensional scaling, but suffer from scalability issues and a...

🎶 New paper alert!
Do AI audio embeddings *hear* timbre like we do?
➡️ Benchmarked 18 reps vs 2.6 K human ratings (21 datasets)
🏅 Style embeddings from CLAP & our sound-matching model are best aligned!
Paper: arxiv.org/abs/2507.07764
#ISMIR2025 #MIR #AudioAI #SonyCSLMusic

3 0 1 1
Preview
Bộ trưởng Giáo dục lên tiếng về Thông tư 29/2024: ‘Oan cho một số địa phương nếu nói dạy thêm, học thêm không hiệu quả’ – #AudioAI #QuốcHội #ChấtVấn #BộTrưởngGDĐT 1 giờ trước1 liên quanGốcBộ trưởng Bộ Giáo dục và Đào tạo Nguyễn Kim Sơn cho rằng nếu nói Thông tư 29/2024 về dạy thêm, học thêm không hiệu quả là oan cho một số tỉnh, thành. Vệ Loan - Bộ Giáo dục và Đào tạoAudio AIThông tư 29/2024dạy thêmhọc thêmNguyễn Kimchất vấnBộ trưởng Bộ Giáo dụcBộ Giáo dục và Đào tạooanQuốc hội Nguồn NLĐ:

Bộ trưởng Giáo dục lên tiếng về Thông tư 29/2024: ‘Oan cho một số địa phương nếu nói dạy thêm, học thêm không hiệu quả’ – #AudioAI #QuốcHội #ChấtVấn #BộTrưởngGDĐT

1 giờ trước1 liên quanGốcBộ trưởng Bộ Giáo dục và Đào tạo Nguyễn Kim Sơn cho rằng nếu nói Thông tư 29/2024 về dạy thêm, học thêm không…

0 0 0 0
Preview
GitHub - Jeremy-Harper/chatterboxPro: audiobook GUI for chatterbox audiobook GUI for chatterbox. Contribute to Jeremy-Harper/chatterboxPro development by creating an account on GitHub.

Audiobook Generator GUI that can clone your voice like 11Labs but hosted locally. Fun little project so I could listen to the books I've written and make sure everything sounded right.

github.com/Jeremy-Harpe...

#chatterbox
#audiobook
#author
#localllama
#audible
#audioAI

1 0 0 0
Bí Ẩn Miền Tây: “Nổ” Giải Đặc Biệt Xổ Số – Âm Thanh AI hé lộ điều gì? #XổSố #MiềnTây #AudioAI #BíẨn #GiảiĐắcBiệt #TinTức Bí Ẩn Miền Tây: "Nổ" Giải Đặc Biệt Xổ Số - Âm Thanh AI hé lộ điều gì? #XổSố #MiềnTây #AudioAI #BíẨn #GiảiĐắcBiệt #TinTức Phát hiện gây chấn động: Sự kiện xổ số miền Tây liên tiếp "gây bão" với giải đặc biệt không chỉ dừng lại ở vé số truyền thống mà còn lan rộng sang xổ số Vietlott, đang thu hút sự chú ý của giới chuyên môn và công chúng.

Bí Ẩn Miền Tây: “Nổ” Giải Đặc Biệt Xổ Số – Âm Thanh AI hé lộ điều gì? #XổSố #MiềnTây #AudioAI #BíẨn #GiảiĐắcBiệt #TinTức

Bí Ẩn Miền Tây: "Nổ" Giải Đặc Biệt Xổ Số - Âm Thanh AI hé lộ điều gì? #XổSố #MiềnTây #AudioAI #BíẨn #GiảiĐắcBiệt #TinTức Phát hiện gây chấn động: Sự kiện xổ số miền Tây liên…

0 0 0 0
Preview
Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds Audio DNNs have demonstrated impressive performance on various machine listening tasks; however, most of their representations are computationally costly and uninterpretable, leaving room for optimiza...

I’m excited to share one of two papers accepted to #Interspeech2025! @interspeech.bsky.social

“Spectrotemporal Modulation: Efficient & Interpretable Feature Representation for Classifying Speech, Music & Environmental Sounds”
📄 Paper: arxiv.org/abs/2505.23509
#NeuroInspiredML #AudioAI

3 1 1 0
Preview
Audio AI: Khám phá câu chuyện đầy mê hoặc giữa mưa – nắng và bí mật đằng sau cầu vồng 🌈 #AudioAI Audio AI: Câu chuyện mưa - nắng và bí mật cầu vồng

Audio AI: Khám phá câu chuyện đầy mê hoặc giữa mưa – nắng và bí mật đằng sau cầu vồng 🌈 #AudioAI

Audio AI: Câu chuyện mưa - nắng và bí mật cầu vồng

0 0 0 0