Argilla
629 subscribers
0:35
Synthetic Data Generator - Build Datasets Using Natural Language
Argilla
3.8K views • 6 months ago
8:02
FineWeb2 collaborative sprint: how to annotate
Argilla
257 views • 6 months ago
51:39
Start a token classification project on the Hugging Face Hub with Argilla, GliNER and NuExtract LLM
Argilla
737 views • 7 months ago
49:15
Start a text classification project on the Hugging Face Hub with Argilla and SetFit
Argilla
984 views • 8 months ago
44:03
Image projects on Hugging Face: from fine-tuning CLIP models to synthetic image datasets
Argilla
355 views • 9 months ago
0:59
What is distilabel? A brief feature overview.
Argilla
564 views • 10 months ago
31:19
Generating and cleaning a preference dataset for DPO / ORPO with LLMs and distilabel
Argilla
817 views • 10 months ago
45:54
Optimizing RAG Pipelines by fine-tuning custom embedding models on synthetic data with ZenML
Argilla
520 views • 10 months ago
0:32
ZenML a way to streamline your complex projects with ease
Argilla
92 views • 10 months ago
0:32
cosine similarity as proxy for quality of sentence pair data
Argilla
102 views • 10 months ago
0:39
optimizing RAG by choosing the right model
Argilla
92 views • 10 months ago
0:36
model pooling for diverse synthetic data generation
Argilla
61 views • 10 months ago
30:12
Ellamind on synthetic data generation with distilabel for pipelining and LLM finetuning
Argilla
324 views • 11 months ago
4:45
Scaling Synthetic Data Creation with 1 Billion Personas | PersonaHub Dataset Explained
Argilla
1.1K views • 11 months ago
50:02
Javier Alonso on lead optimisation at Idealista
Argilla
125 views • 1 year ago
6:29
Exploring the PRISM Dataset: Conversations, Insights, and Model Performance
Argilla
240 views • 1 year ago
39:06
Ben Burtenshaw on the Argilla 2.0 SDK refactor
Argilla
123 views • 1 year ago
1:02:59
Louis Guitton on NER with Argilla
Argilla
200 views • 1 year ago
54:39
Weights & Biases on Wandbot
Argilla
46 views • 1 year ago
32:18
Datamaran on using Argilla in MLOps workflows for ESG governance
Argilla
120 views • 1 year ago
39:00
Understanding and reproducing DEITA with MantisNLP using distilabel=1.0.0
Argilla
201 views • 1 year ago
38:11
Elad Levi on AutoPrompt and intent-based prompt calibration and prompt engineering
Argilla
686 views • 1 year ago
55:51
Daniel van Strien on the Hugging Face hub and synthetic creation of a DPO dataset for Haiku
Argilla
291 views • 1 year ago
51:10
Seth Levine on the usage of SetFit and BerTopic for unsupervised clustering
Argilla
613 views • 1 year ago
45:40
Red Cross 510 on NLP for good with SetFit for chat message classification
Argilla
359 views • 1 year ago
49:29
Prolific on workload distribution, LLM preference data annotation and Phi2 fine-tune Colab
Argilla
506 views • 1 year ago
1:03:58
Pitching AI to your boss, SLMs vs LLMs and contributing to open source projects
Argilla
104 views • 1 year ago
43:59
Kickstart NLP with synthetic data and running LLMs on Google Colab using vLLM
Argilla
341 views • 1 year ago
36:03
How we cleaned OpenBMB UltraFeedback and Notus
Argilla
220 views • 1 year ago
36:40
An introduction to distilabel for AI feedback and synthetic data generation
Argilla
1.1K views • 1 year ago
Load More