Andrea Soria

asoria

AI & ML interests

Maintainer of 🤗Datasets: Data processing

Articles

Organizations

asoria's activity

upvoted an article 16 days ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted 2 articles about 1 month ago
view article
Article

Synthetic dataset generation techniques: generating custom sentence similarity data

12
view article
Article

Synthetic data: save money, time and carbon with open source

32
upvoted an article about 2 months ago
view article
Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

63
upvoted an article 2 months ago
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

22
upvoted 2 articles 3 months ago
view article
Article

It's raining diffusion personalization techniques☔️🎭🖼️

By linoyts
16
view article
Article

DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub

3