ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition Paper • 2506.04635 • Published Jun 5, 2025