Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

New activity in openslr/librispeech_asr about 19 hours ago

Enable Dataset Viewer

1
#6 opened about 22 hours ago by sanchit-gandhi
New activity in mteb/neuclir-2023 about 19 hours ago

Convert dataset to Parquet

1
#1 opened about 19 hours ago by lhoestq
New activity in mteb/neuclir-2022 about 19 hours ago

Convert dataset to Parquet

1
#1 opened about 19 hours ago by lhoestq
New activity in mteb/amazon_counterfactual about 19 hours ago

Convert dataset to Parquet

#2 opened about 19 hours ago by lhoestq
New activity in hf-internal-testing/fill10 about 19 hours ago

Convert dataset to Parquet

#1 opened about 19 hours ago by lhoestq
New activity in google-research-datasets/conceptual_captions about 20 hours ago

Upload dataset + remove script

1
#3 opened about 20 hours ago by lhoestq
New activity in apple/DataCompDR-1B 5 days ago
New activity in mwalmsley/gz_desi 22 days ago
New activity in imageomics/rare-species 27 days ago
New activity in common-canvas/commoncatalog-cc-by-sa 29 days ago

Maximum queue size reached

6
#1 opened about 1 month ago by alfredplpl
New activity in monology/pile-uncopyrighted about 1 month ago

Streaming broken for Pile

3
#5 opened about 1 month ago by Dahoas
New activity in TIGER-Lab/MMLU-Pro about 1 month ago

Fix the Dataset Viewer

1
#10 opened about 1 month ago by lhoestq
New activity in m-a-p/Matrix about 1 month ago
New activity in bigainlco/LooGLE about 1 month ago
New activity in ivrit-ai/jpress-demo about 1 month ago
New activity in arnaudstiegler/synthetic_us_passports_hard about 1 month ago
New activity in naver-clova-ix/cord-v2 about 1 month ago

Add image-to-text task tag

#11 opened about 1 month ago by lhoestq
New activity in tbone5563/tar_images about 1 month ago
New activity in lhoestq/presidio-dataset-scanner about 2 months ago

Update app.py

#2 opened about 2 months ago by lhoestq

Update app.py

#1 opened about 2 months ago by lhoestq
New activity in PleIAs/Post-OCR-Correction about 2 months ago

Configure the Dataset Viewer

#3 opened about 2 months ago by lhoestq
New activity in bop-benchmark/datasets about 2 months ago
New activity in amanrangapur/Fin-Fact about 2 months ago
New activity in nroggendorff/nebulae about 2 months ago
New activity in Timbrt/SciOL-CI 2 months ago

Enable the Dataset Viewer

1
#1 opened 2 months ago by lhoestq
New activity in m-a-p/MAP-CC 2 months ago
New activity in LanguageBind/Open-Sora-Plan-v1.0.0 2 months ago

Documentation on how to use

#2 opened 2 months ago by lhoestq
New activity in Bastao/VeraCruz_PT-BR 2 months ago

Update README.md

#17 opened 2 months ago by lhoestq
New activity in chaoyi-wu/PMC-Inline 2 months ago

Dataset Viewer issue

2
#1 opened 8 months ago by wahid028
New activity in lhoestq/LLM_DataGen 2 months ago

Not working

1
#2 opened 2 months ago by mrfakename

Update README.md

1
#1 opened 2 months ago by victor

Update requirements.txt

#4 opened 2 months ago by lhoestq
New activity in ai4privacy/pii-masking-300k 3 months ago
New activity in mozilla-foundation/common_voice_6_1 3 months ago
New activity in Equall/Saul-7B-Base 3 months ago

Easily Accesible?

5
#1 opened 3 months ago by aibasics
New activity in ForzaJuve1/UEFA_Euro_2020_Data 3 months ago
New activity in mlfoundations/datacomp_xlarge 3 months ago

Enable the Dataset Viewer

#1 opened 3 months ago by lhoestq
New activity in bigbio/pubmed_qa 3 months ago
New activity in shachardon/ShareLM 3 months ago
New activity in Wikimedians/wikidata-all 3 months ago
New activity in cornell-movie-review-data/rotten_tomatoes 3 months ago

Convert dataset to Parquet

1
#4 opened 5 months ago by davzoku

Remove dataset script

#6 opened 3 months ago by lhoestq
New activity in WenhaoWang/VidProM 3 months ago

Configure the Dataset Viewer

1
#3 opened 3 months ago by lhoestq
New activity in FreedomIntelligence/ApolloCorpus 3 months ago

Configure the Dataset Viewer

1
#1 opened 3 months ago by lhoestq
New activity in facebook/wiki_dpr 3 months ago

Host data files in the repo

4
#14 opened 4 months ago by mariosasko
New activity in google/wiki40b 3 months ago