Nathan Simons's picture

Nathan Simons

JoeySalmons

·

AI & ML interests

I like AI

Recent Activity

liked a model about 2 hours ago

GSAI-ML/LLaDA-8B-Instruct

liked a model about 2 hours ago

ElectricAlexis/NotaGen

liked a model about 3 hours ago

allenai/OLMo-2-0325-32B

View all activity

Organizations

None yet

JoeySalmons's activity

upvoted a collection about 3 hours ago

OLMo 2

Artifacts for the second set of OLMo models. • 26 items • Updated about 6 hours ago • 86

upvoted a collection about 6 hours ago

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated about 6 hours ago • 13

upvoted a collection 1 day ago

Gemma 3 Release

9 items • Updated 37 minutes ago • 211

upvoted a collection 7 days ago

Jamba 1.6

The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, outperforming open model competitors on quality and speed. • 2 items • Updated 7 days ago • 11

upvoted a collection 8 days ago

DiffRhythm

3 items • Updated 11 days ago • 11

upvoted a collection 9 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 9 days ago • 63

upvoted 3 collections 15 days ago

Phi-4

Phi-4 family of small language and multi-modal models. • 7 items • Updated 10 days ago • 109

Granite Vision Models

3 items • Updated 6 days ago • 8

Granite 3.2 Language Models

3 items • Updated 15 days ago • 14

upvoted a collection 18 days ago

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 34 items • Updated 4 days ago • 28

upvoted a collection 20 days ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated about 10 hours ago • 55

upvoted a paper 20 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

upvoted a collection 22 days ago

PaliGemma 2 Mix

13 items • Updated 1 day ago • 60

upvoted a collection 24 days ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 24 days ago • 30

upvoted a collection 27 days ago

Hamanasu

A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay. • 16 items • Updated 3 days ago • 5

upvoted a collection 29 days ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated about 7 hours ago • 9

upvoted 2 articles about 1 month ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 202

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted a collection about 1 month ago

SFTvsRL Models & Data

This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 7 items • Updated about 17 hours ago • 8

upvoted a paper about 1 month ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108