This repository contains the code for the Open ASR Leaderboard. The leaderboard is a Gradio Space that allows users to compare the accuracy of ASR models on a variety ...
This repository is a fork of the Open ASR Leaderboard repository from Hugging Face. The idea is to contain the benchmarks in a Dockerized environment while caching the models and datasets in a ...
Nvidia has entered the open-source speech recognition arena with Parakeet-TDT-0.6B-v2, an automatic speech recognition (ASR) model now hosted on Hugging Face. Beyond its accuracy ranking, Nvidia ...
At the heart of Parakeet TDT 0.6B’s appeal is its unmatched speed and transcription quality. The model can transcribe 60 minutes of audio in just one second, a performance that’s over 50x faster than ...
The Allen Institute for AI (AI2) has released OLMoASR, a suite of open automatic speech recognition (ASR) models that rival closed-source systems such as OpenAI’s Whisper. Beyond just releasing model ...
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Nvidia has become one of the most valuable ...