Gradio

This demo compares the transcription performance of several automatic speech recognition (ASR) models. Users can select age, gender, and accent to generate diverse English audio samples. The models are evaluated on their ability to transcribe those samples. Data is sourced from 249 validated entries in the Common Voice English Delta Segment 21.0 release.

Comparing ASR Models on Diverse English Speech Samples