Home New Trending Search
About Privacy Terms
#
#SpeechProcessing
Posts tagged #SpeechProcessing on Bluesky
Preview
Speech sounds are a blur—here’s how your brain sorts them out Speech blurs together unless you know the language; scientists found the brain signal that separates the words

New research reveals how the brain separates speech into words #Science #Biology #Neurobiology #Neuroscience #SpeechProcessing #BrainResearch

www.scientificamerican.com/article/new-research-rev...

1 1 0 0
Staring Intently Helps The Brain Process Difficult Speech All the science news you can handle in a single feed

Staring Intently Helps The Brain Process Difficult Speech #Science #HealthandMedicine #Neurology #BrainHealth #SpeechProcessing #CognitiveScience

3 2 0 0

We can’t fix what we don’t measure.

That’s why I build evaluation frameworks for speech & conversational AI — so we can stress-test systems against real-world variability.

#AIResearch #Evaluation #SpeechProcessing

0 0 0 0

These insights still apply to anyone working on conversational AI, spoken summarization, or voice-driven interfaces today.

📄 Read more: www.isca-archive.org/interspeech_...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage
#INTERSPEECH

0 0 0 0
Deep Learning Survey Explores Complex Speech Spectrograms

Deep Learning Survey Explores Complex Speech Spectrograms

Survey finds complex‑valued neural networks with complex convolutions and phase‑aware activations improve speech enhancement and speaker separation. Read more: getnews.me/deep-learning-survey-exp... #deeplearning #speechprocessing

1 0 0 0

Speech isn’t perfect.
We restart, repeat, and slip.

For AI, those little disfluencies can cause big problems.
That’s why my research builds methods to make spoken language systems more robust.

#SpeechProcessing #ConversationalAI #NLP #AI

1 0 0 0

🌱 The takeaway: model selection is critical for real-world conversational AI.

📄 Full paper & code: mariateleki.github.io/pdf/HorrorCo...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage
#INTERSPEECH #ICASSP

1 0 0 0

📄 Paper/code: www.isca-archive.org/interspeech_...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage

2 0 0 0

#INTERSPEECH2025 #ConversationalAI #SpeechProcessing #RecommenderSystems

1 0 0 0

Curious to hear how others in speech/NLP are thinking about discourse as a bias signal!

#FairnessInAI #SpeechProcessing #OpenScience #ComputationalSocialScience

2 0 0 0

LLaSO: A Foundational Framework for Reproducible Research in Large
Language and Speech Model
Jinghan Yang, Peidong Wei et al.
Paper
Details
#ReproducibleResearch #LargeLanguageModels #SpeechProcessing

0 0 0 0

We have an open position on resource-aware and dynamic speech processing within the IPCEI-CIS project. It is about cutting edge ML applied to speech processing. It will be fun. Details:https://shorturl.at/E4DXR
DM me for any request. #ASR #speechprocessing #AI #ML

1 2 0 0
Preview
Voxtral – 前沿开源语音理解模型 Voxtral – Frontier open source speech understanding models (mistral.ai) 07-15  ↑ 103 HN Points

https://mistral.ai/news/voxtral

#SpeechProcessing #AudioProcessing

1 0 0 0
Preview
Voxtral We present Voxtral Mini and Voxtral Small, two multimodal audio chat models. Voxtral is trained to comprehend both spoken audio and text documents, achieving state-of-the-art performance across a diverse range of audio benchmarks, while preserving strong text capabilities. Voxtral Small outperforms a number of closed-source models, while being small enough to run locally. A 32K context window enables the model to handle audio files up to 40 minutes in duration and long multi-turn conversations. We also contribute three benchmarks for evaluating speech understanding models on knowledge and trivia. Both Voxtral models are released under Apache 2.0 license.

https://arxiv.org/abs/2507.13264

#SpeechProcessing

0 0 1 0
Post image

📢 The Jelinek Summer Workshop on Speech and Language Technology (JSALT 2025) starts today!

👉 More info: eloquenceai.eu/event/jeline...

#ELOQUENCEAI #SpeechProcessing #SpeechTechnology #Workshop

2 0 0 0
Post image

We are pleased to introduce a member of our Editorial Board, Isabel Trancoso. She is a full professor @istecnico.bsky.social and former President of the Scientific Council of INESC ID Lisbon. With her passion and expertise in #speechprocessing, she brings valuable insights to our editorial board.

1 1 0 0
Post image Post image

🎉 Excited to share that our @sarapapi.bsky.social has won the 2024 Best PhD Award from the Information and Engineering Doctoral School for her thesis “Direct Speech Translation in Constrained Contexts: The Simultaneous and Subtitling Scenarios.”

#nlproc #speech #speechprocessing #speechtranslation

6 1 0 1
Post image Post image Post image

Don't miss today's episode with @mdhk.net as we discuss her work on AI and speech processing. She even brought in slides 🧠🗣️🤖🖼️

12:00PM at Echobox Radio

#Linguistics #CognitiveScience #ArtificialIntelligence #SpeechProcessing #AIResearch #UniversityofAmsterdam #NewEpisode #MarianneDeHeerKloots

5 1 1 0

Our pick of the week by @mgaido91.bsky.social: "OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis" by Luo et al. (2025)

#SpeechProcessing #LLM #SFM #NLProc #speechtech #audio

3 0 0 0

repo will be up on git soon, but basically ive managed to put together two methods to compute articulation rate:
- with onset detection (Librosa)
- with forced alignment (MFA)
this is a bit... experimental for sure... but exciting nonetheless! #NLP #speechprocessing

0 0 0 0

Our pick of the week by @zhihangxie.bsky.social: "Bridging Speech and Text Foundation Models with ReShape Attention" by Takatomo Kano, @wanchichen.bsky.social, @shinjiw.bsky.social, et al. #ICASSP2025

ieeexplore.ieee.org/document/108...

#FoundationModel #SpeechProcessing

3 0 0 0
Preview
New neuroscience research upends traditional cognitive models of reading A new study finds that the left posterior inferior frontal cortex activates within 100 milliseconds during reading, playing a critical, early role in turning text into speech, challenging traditional models that assumed a slower, step-by-step process.

A new study finds that the left posterior inferior frontal cortex activates within 100 milliseconds during reading, playing a critical, early role in turning text into speech, challenging traditional models that… #Neuroscience #CognitiveScience #ReadingResearch #BrainActivation #SpeechProcessing

17 5 0 0
Preview
Brain mapping advances understanding of human speech and hallucinations in schizophrenia Voice experiments in people with epilepsy have helped trace the circuit of electrical signals in the brain that allow its hearing center to sort out background sounds from their own voices.

New research from NYU Langone sheds light on how the brain distinguishes self-generated speech from external sounds. Findings link disruptions in this process to auditory hallucinations in schizophrenia, offering hope for innovative therapies.
#Neuroscience #Schizophrenia #SpeechProcessing

11 2 1 2