#SpeechProcessing — Bluesky Posts

1 month ago

Speech sounds are a blur—here’s how your brain sorts them out Speech blurs together unless you know the language; scientists found the brain signal that separates the words

New research reveals how the brain separates speech into words #Science #Biology #Neurobiology #Neuroscience #SpeechProcessing #BrainResearch

www.scientificamerican.com/article/new-research-rev...

1 1 0 0

Pure Science

@purescience.news

3 months ago

Staring Intently Helps The Brain Process Difficult Speech All the science news you can handle in a single feed

Staring Intently Helps The Brain Process Difficult Speech #Science #HealthandMedicine #Neurology #BrainHealth #SpeechProcessing #CognitiveScience

3 2 0 0

Maria Teleki

@mariateleki.bsky.social

4 months ago

We can’t fix what we don’t measure.

That’s why I build evaluation frameworks for speech & conversational AI — so we can stress-test systems against real-world variability.

#AIResearch #Evaluation #SpeechProcessing

0 0 0 0

Maria Teleki

@mariateleki.bsky.social

5 months ago

These insights still apply to anyone working on conversational AI, spoken summarization, or voice-driven interfaces today.

📄 Read more: www.isca-archive.org/interspeech_...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage
#INTERSPEECH

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Deep Learning Survey Explores Complex Speech Spectrograms

Survey finds complex‑valued neural networks with complex convolutions and phase‑aware activations improve speech enhancement and speaker separation. Read more: getnews.me/deep-learning-survey-exp... #deeplearning #speechprocessing

1 0 0 0

Maria Teleki

@mariateleki.bsky.social

5 months ago

Speech isn’t perfect.
We restart, repeat, and slip.

For AI, those little disfluencies can cause big problems.
That’s why my research builds methods to make spoken language systems more robust.

#SpeechProcessing #ConversationalAI #NLP #AI

1 0 0 0

Maria Teleki

@mariateleki.bsky.social

5 months ago

🌱 The takeaway: model selection is critical for real-world conversational AI.

📄 Full paper & code: mariateleki.github.io/pdf/HorrorCo...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage
#INTERSPEECH #ICASSP

1 0 0 0

Maria Teleki

@mariateleki.bsky.social

5 months ago

📄 Paper/code: www.isca-archive.org/interspeech_...

#SpeechProcessing #ConversationalAI #VoiceAI #Disfluency #SpokenLanguage

2 0 0 0

Maria Teleki

@mariateleki.bsky.social

5 months ago

#INTERSPEECH2025 #ConversationalAI #SpeechProcessing #RecommenderSystems

1 0 0 0

Maria Teleki

@mariateleki.bsky.social

6 months ago

Curious to hear how others in speech/NLP are thinking about discourse as a bias signal!

#FairnessInAI #SpeechProcessing #OpenScience #ComputationalSocialScience

2 0 0 0

@arxivlens.bsky.social

6 months ago

LLaSO: A Foundational Framework for Reproducible Research in Large
Language and Speech Model
Jinghan Yang, Peidong Wei et al.
Paper
Details
#ReproducibleResearch #LargeLanguageModels #SpeechProcessing

0 0 0 0

stek_fbk

@speechtekfbk.bsky.social

7 months ago

We have an open position on resource-aware and dynamic speech processing within the IPCEI-CIS project. It is about cutting edge ML applied to speech processing. It will be fun. Details:https://shorturl.at/E4DXR
DM me for any request. #ASR #speechprocessing #AI #ML

1 2 0 0

Ronan

@ronan.mastodon.ronandev.ovh.ap.brid.gy

7 months ago

Voxtral – 前沿开源语音理解模型 Voxtral – Frontier open source speech understanding models (mistral.ai) 07-15 ↑ 103 HN Points

https://mistral.ai/news/voxtral

#SpeechProcessing #AudioProcessing

1 0 0 0

Ronan

@ronan.mastodon.ronandev.ovh.ap.brid.gy

7 months ago

Voxtral We present Voxtral Mini and Voxtral Small, two multimodal audio chat models. Voxtral is trained to comprehend both spoken audio and text documents, achieving state-of-the-art performance across a diverse range of audio benchmarks, while preserving strong text capabilities. Voxtral Small outperforms a number of closed-source models, while being small enough to run locally. A 32K context window enables the model to handle audio files up to 40 minutes in duration and long multi-turn conversations. We also contribute three benchmarks for evaluating speech understanding models on knowledge and trivia. Both Voxtral models are released under Apache 2.0 license.

https://arxiv.org/abs/2507.13264

#SpeechProcessing

0 0 1 0

ELOQUENCEAI

@eloquenceai.bsky.social

8 months ago

📢 The Jelinek Summer Workshop on Speech and Language Technology (JSALT 2025) starts today!

👉 More info: eloquenceai.eu/event/jeline...

#ELOQUENCEAI #SpeechProcessing #SpeechTechnology #Workshop

2 0 0 0

Proceedings of the IEEE

@proceedingsieee.bsky.social

9 months ago

We are pleased to introduce a member of our Editorial Board, Isabel Trancoso. She is a full professor @istecnico.bsky.social and former President of the Scientific Council of INESC ID Lisbon. With her passion and expertise in #speechprocessing, she brings valuable insights to our editorial board.

1 1 0 0

MT Group at FBK

@fbk-mt.bsky.social

10 months ago

🎉 Excited to share that our @sarapapi.bsky.social has won the 2024 Best PhD Award from the Information and Engineering Doctoral School for her thesis “Direct Speech Translation in Constrained Contexts: The Simultaneous and Subtitling Scenarios.”

#nlproc #speech #speechprocessing #speechtranslation

6 1 0 1

@talkthatscience.bsky.social

10 months ago

Don't miss today's episode with @mdhk.net as we discuss her work on AI and speech processing. She even brought in slides 🧠🗣️🤖🖼️

12:00PM at Echobox Radio

#Linguistics #CognitiveScience #ArtificialIntelligence #SpeechProcessing #AIResearch #UniversityofAmsterdam #NewEpisode #MarianneDeHeerKloots

5 1 1 0

MT Group at FBK

@fbk-mt.bsky.social

11 months ago

Our pick of the week by @mgaido91.bsky.social: "OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis" by Luo et al. (2025)

#SpeechProcessing #LLM #SFM #NLProc #speechtech #audio

3 0 0 0

kit

@kittog.bsky.social

11 months ago

repo will be up on git soon, but basically ive managed to put together two methods to compute articulation rate:
- with onset detection (Librosa)
- with forced alignment (MFA)
this is a bit... experimental for sure... but exciting nonetheless! #NLP #speechprocessing

0 0 0 0

MT Group at FBK

@fbk-mt.bsky.social

11 months ago

Our pick of the week by @zhihangxie.bsky.social: "Bridging Speech and Text Foundation Models with ReShape Attention" by Takatomo Kano, @wanchichen.bsky.social, @shinjiw.bsky.social, et al. #ICASSP2025

ieeexplore.ieee.org/document/108...

#FoundationModel #SpeechProcessing

3 0 0 0

PsyPost

@psypost.bsky.social

1 year ago

New neuroscience research upends traditional cognitive models of reading A new study finds that the left posterior inferior frontal cortex activates within 100 milliseconds during reading, playing a critical, early role in turning text into speech, challenging traditional models that assumed a slower, step-by-step process.

A new study finds that the left posterior inferior frontal cortex activates within 100 milliseconds during reading, playing a critical, early role in turning text into speech, challenging traditional models that… #Neuroscience #CognitiveScience #ReadingResearch #BrainActivation #SpeechProcessing

17 5 0 0

Benjamin Getenet

@benjamingetenet.fr

1 year ago

Brain mapping advances understanding of human speech and hallucinations in schizophrenia Voice experiments in people with epilepsy have helped trace the circuit of electrical signals in the brain that allow its hearing center to sort out background sounds from their own voices.

New research from NYU Langone sheds light on how the brain distinguishes self-generated speech from external sounds. Findings link disruptions in this process to auditory hallucinations in schizophrenia, offering hope for innovative therapies.
#Neuroscience #Schizophrenia #SpeechProcessing

11 2 1 2