Luca Soldaini

soldni

AI & ML interests

question answering, information retrieval, scientific document processing

Recent Activity

updated a collection about 10 hours ago
Tulu 3 Datasets
updated a dataset about 10 hours ago
allenai/tulu-3-sft-personas-math
updated a dataset about 10 hours ago
allenai/llama-3.1-tulu-3-70b-preference-mixture

Organizations

Posts 1

view post
Post
release day release day! OLMo 1b + 7b out today 🥳 and 65b coming soon...

With OLMo, we are really focused on advancing the study of LLMs. We release **everything**, from toolkit to create its training dataset (dolma) to training & inference code:

- OLMo paper: https://allenai.org/olmo/olmo-paper.pdf
- OLMo train code: https://github.com/allenai/OLMo
- OLMo eval code: https://github.com/allenai/OLMo-Eval
- OLMo 7b: allenai/OLMo-7B
- OLMo 1b: allenai/OLMo-1B
- Dolma paper: https://allenai.org/olmo/dolma-paper.pdf
- Dolma dataset v1.6: allenai/dolma
- Dolma toolkit v1.0: https://github.com/allenai/dolma