VietMed: A Large-Scale Vietnamese Medical Speech Recognition Dataset and Benchmark
VietMed is the world's largest public medical speech recognition dataset for Vietnamese, comprising 16 hours of labeled medical speech, 1000 hours of unlabeled medical speech, and 1200 hours of unlabeled general-domain speech. It covers a wide range of medical conditions, recording conditions, speaker roles, and accents, enabling comprehensive research on Vietnamese medical speech recognition.