Catherine Arnett

catherinearnett

AI & ML interests

multilingual NLP, tokenization

Recent Activity

updated a model 30 days ago
PleIAs/Pleias-Nano
updated a model 30 days ago
PleIAs/Pleias-1.2b-Preview
View all activity

Articles

Organizations

Blog-explorers's profile picture Language and Cognition Lab (UCSD)'s profile picture PleIAs's profile picture

catherinearnett's activity

upvoted an article 29 days ago
upvoted an article about 2 months ago
view article
Article

Releasing the largest multilingual open pretraining dataset

98