Speech Recognition API

The secret to powering web apps with full speech recognition

A few months ago, I wrote an article on web speech recognition using TensorflowJS. Even though it was super interesting to implement, it was cumbersome for many of you to extend. The reason was pretty ...

Geeky Gadgets

Universal 2: Next Generation AI Speech-to-Text Technology Demonstrated

Universal 2 represents a major advancement in AI speech-to-text technology, offering unmatched accuracy and flexibility across a broad array of audio processing tasks. Trained on an extensive dataset ...

InfoQ

OpenAI Launches Public Beta of Realtime API for Low-Latency Speech Interactions

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Yahoo Finance

OpenAI launches DALL-E 3 API, new text-to-speech models

OpenAI launched a slew of new APIs during its first-ever developer day. The DALL-E 3 API offers different format and quality options and resolutions ranging from 1024x1024 to 1792x1024, with prices ...

InfoWorld

OpenAI previews Realtime API for speech-to-speech apps

Realtime API supports multi-model text and speech experiences including natural speech-to-speech conversations using preset voices already supported in the API. OpenAI has introduced a public beta of ...

Digit

Microsoft gives developers access to its image, speech recognition APIs

Microsoft has released new machine-learning APIs in beta, which can calculate a person's age based on their photograph. Microsoft How-Old.net demo under its Project Oxford program went viral a day ...

EDN

IoT: GenAI voice helps generate speech recognition models

A new generative AI feature brings voice recognition to tiny devices with a text-to-speech (TTS) synthetic dataset generation capability. It enables developers to generate synthetic speech data with ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results