13:53
Residual Vector Quantization for Audio and Speech Embeddings
1.6K views • 2 weeks ago
0:30
Introducing Voice Writer
542 views • 2 months ago
8:41
Can Whisper be used for real-time streaming ASR?
2.6K views • 2 months ago
11:04
Top 10 most cited and influential papers in the history of NLP
971 views • 3 months ago
28:10
Fine-tuning Whisper to learn my Chinese dialect (Teochew)
4.2K views • 4 months ago
22:28
A better Hugging Face model search with OpenAI, RAG, pgvector
1.1K views • 7 months ago
12:46
Speculative Decoding: When Two LLMs are Faster than One
8.3K views • 8 months ago
29:56
Exploring the 24 Areas of Natural Language Processing Research
1.8K views • 9 months ago
11:17
Rotary Positional Embeddings: Combining Absolute and Relative
24K views • 10 months ago
8:33
The KV Cache: Memory Usage in Transformers
28K views • 10 months ago
19:46
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference
13K views • 11 months ago
8:15
How is Beam Search Really Implemented?
9.2K views • 1 year ago
8:22
Non-Autoregressive and Shallow Decoding: Speeding up Translation
1.1K views • 1 year ago
7:38
Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models
18K views • 1 year ago
End of Videos