Pierre-Carl Langlais

Pclanglais

AI & ML interests

Open data & open LLMs

Recent Activity

updated a dataset about 12 hours ago
PleIAs/post-ocr
updated a dataset 1 day ago
PleIAs/post-ocr

Articles

Organizations

Pclanglais's activity

upvoted an article 9 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

94
upvoted an article 21 days ago
upvoted an article about 1 month ago
view article
Article

OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B

13
upvoted 2 articles about 2 months ago
upvoted 2 articles 3 months ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

265
view article
Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

26