Related Videos
42:53
Google Researcher's In-Depth Analysis on End-to-End Speech Recognition, Part 1: Overview & Modeling
1:40:04
Olewave's most detailed illustration of RNN-T: Sequence Transduction with Recurrent Neural Networks
1:33:00
[Olewave's Review] CLIP (3/3): Learning Transferable Visual Models From Natural Language Supervision
1:38:05
[Olewave's Review] CLIP (2/3): Learning Transferable Visual Models From Natural Language Supervision
44:26
[Olewave's Review] OpenAI's Whisper ASR: Robust Speech Recognition via Large-Scale Weak Supervision
55:00
[Olewave's Review] CLIP (1/3): Learning Transferable Visual Models From Natural Language Supervision
1:16:59
[Detailed Paper Reading] Zipformer: A faster and better encoder for automatic speech recognition
39:12