Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

New activity in datasets-maintainers/test-smart-update 24 minutes ago
New activity in Cnam-LMSSC/vibravox about 4 hours ago

Clean refs/convert/duckdb

6
#4 opened 15 days ago by zinc75
New activity in nyu-visionx/Cambrian-Alignment about 5 hours ago
New activity in shareAI/CodeChat 2 days ago
New activity in HuggingFaceFW/fineweb 8 days ago
New activity in openslr/librispeech_asr 9 days ago

Enable Dataset Viewer

1
#6 opened 9 days ago by sanchit-gandhi
New activity in mteb/neuclir-2023 9 days ago

Convert dataset to Parquet

1
#1 opened 9 days ago by lhoestq
New activity in mteb/neuclir-2022 9 days ago

Convert dataset to Parquet

1
#1 opened 9 days ago by lhoestq
New activity in mteb/amazon_counterfactual 9 days ago

Convert dataset to Parquet

#2 opened 9 days ago by lhoestq
New activity in hf-internal-testing/fill10 9 days ago

Convert dataset to Parquet

#1 opened 9 days ago by lhoestq
New activity in apple/DataCompDR-1B 13 days ago
New activity in mwalmsley/gz_desi about 1 month ago
New activity in imageomics/rare-species about 1 month ago
New activity in common-canvas/commoncatalog-cc-by-sa about 1 month ago

Maximum queue size reached

6
#1 opened about 1 month ago by alfredplpl
New activity in monology/pile-uncopyrighted about 1 month ago

Streaming broken for Pile

4
#5 opened about 1 month ago by Dahoas
New activity in TIGER-Lab/MMLU-Pro about 1 month ago

Fix the Dataset Viewer

1
#10 opened about 1 month ago by lhoestq
New activity in m-a-p/Matrix about 1 month ago
New activity in bigai-nlco/LooGLE about 2 months ago
New activity in ivrit-ai/jpress-demo about 2 months ago
New activity in arnaudstiegler/synthetic_us_passports_hard about 2 months ago
New activity in naver-clova-ix/cord-v2 about 2 months ago

Add image-to-text task tag

#11 opened about 2 months ago by lhoestq
New activity in tbone5563/tar_images about 2 months ago
New activity in lhoestq/presidio-dataset-scanner about 2 months ago

Update app.py

#2 opened about 2 months ago by lhoestq

Update app.py

#1 opened about 2 months ago by lhoestq
New activity in PleIAs/Post-OCR-Correction 2 months ago

Configure the Dataset Viewer

#3 opened 2 months ago by lhoestq
New activity in bop-benchmark/datasets 2 months ago
New activity in nroggendorff/nebulae 2 months ago
New activity in Timbrt/SciOL-CI 2 months ago

Enable the Dataset Viewer

1
#1 opened 2 months ago by lhoestq
New activity in m-a-p/MAP-CC 2 months ago
New activity in LanguageBind/Open-Sora-Plan-v1.0.0 2 months ago

Documentation on how to use

#2 opened 2 months ago by lhoestq
New activity in Bastao/VeraCruz_PT-BR 3 months ago

Update README.md

#17 opened 3 months ago by lhoestq
New activity in chaoyi-wu/PMC-Inline 3 months ago

Dataset Viewer issue

2
#1 opened 8 months ago by wahid028
New activity in lhoestq/LLM_DataGen 3 months ago

Not working

1
#2 opened 3 months ago by mrfakename

Update README.md

1
#1 opened 3 months ago by victor

Update requirements.txt

#4 opened 3 months ago by lhoestq
New activity in ai4privacy/pii-masking-300k 3 months ago
New activity in mozilla-foundation/common_voice_6_1 3 months ago
New activity in Equall/Saul-7B-Base 3 months ago

Easily Accesible?

5
#1 opened 3 months ago by aibasics
New activity in ForzaJuve1/UEFA_Euro_2020_Data 3 months ago
New activity in mlfoundations/datacomp_xlarge 3 months ago

Enable the Dataset Viewer

#1 opened 3 months ago by lhoestq
New activity in bigbio/pubmed_qa 3 months ago
New activity in shachardon/ShareLM 3 months ago