Channel - Efficient NLP

Residual Vector Quantization for Audio and Speech Embeddings

Efficient NLP
1.7K views • 3 weeks ago

Introducing Voice Writer

Efficient NLP
559 views • 2 months ago

Can Whisper be used for real-time streaming ASR?

Efficient NLP
2.8K views • 2 months ago

Top 10 most cited and influential papers in the history of NLP

Efficient NLP
977 views • 3 months ago

Fine-tuning Whisper to learn my Chinese dialect (Teochew)

Efficient NLP
4.2K views • 4 months ago

A better Hugging Face model search with OpenAI, RAG, pgvector

Efficient NLP
1.1K views • 7 months ago

Speculative Decoding: When Two LLMs are Faster than One

Efficient NLP
8.4K views • 8 months ago

Exploring the 24 Areas of Natural Language Processing Research

Efficient NLP
1.9K views • 9 months ago

Rotary Positional Embeddings: Combining Absolute and Relative

Efficient NLP
24K views • 10 months ago

The KV Cache: Memory Usage in Transformers

Efficient NLP
29K views • 10 months ago

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Efficient NLP
13K views • 11 months ago

How is Beam Search Really Implemented?

Efficient NLP
9.3K views • 1 year ago

Non-Autoregressive and Shallow Decoding: Speeding up Translation

Efficient NLP
1.1K views • 1 year ago

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Efficient NLP
19K views • 1 year ago

End of Videos