Artificial Intelligence

Speech-to-Text

AI technology that converts spoken audio into written text (also called automatic speech recognition or ASR). Modern systems handle accents, background noise, and multiple speakers.

Why It Matters

Speech-to-text enables voice commands, meeting transcription, voice search, and accessibility features. It is a core component of voice-first interfaces.

Example

Whisper (by OpenAI) transcribing a one-hour meeting recording into text with speaker labels, punctuation, and 95%+ accuracy, even with background noise.

Think of it like...

Like having a perfect stenographer who can transcribe any conversation in real time, no matter the accent or audio quality.

Speech-to-Text

Why It Matters

Example

Think of it like...

Related Terms

Text-to-Speech

Natural Language Processing

Whisper