News

Google Translate introduces real-time live translation in 70 languages and launches AI-powered language practice for users ...
While browsers are marching toward supporting speech recognition and more futuristic capabilities, web application developers are typically constrained to the keyboard and mouse. But what if we ...
It notes that smaller organizations have a distinct disadvantage when trying to develop models for speech recognition, because the most comprehensive datasets available have always had high ...
OpenAI has released Whisper, a robust speech recognition model that can understand and transcribe multiple languages.
Closed captioning to convey the speech of TV programs by text is becoming a useful means of providing information for elderly people and the hearing impaired, and real-time captioning of live ...
Meta has created an AI language model that (in a refreshing change of pace) isn’t a ChatGPT clone. The company’s Massively Multilingual Speech (MMS) project can recognize over 4,000 spoken ...
Mistral's open-source speech model Voxtral can recognize multiple languages, understand spoken instructions and also offer enterprise security.
Google has open-sourced the speech engine that powers its Android speech recognition transcription tool Live Transcribe on GitHub.