Devin Gonier's picture

10 145

Devin Gonier

dgonier

·

AI & ML interests

None yet

Recent Activity

liked a model about 7 hours ago

NousResearch/DeepHermes-3-Mistral-24B-Preview

liked a model about 12 hours ago

open-r1/OlympicCoder-32B

liked a model 1 day ago

google/gemma-3-27b-it

View all activity

Organizations

dgonier's activity

upvoted a paper 1 day ago

LoRACode: LoRA Adapters for Code Embeddings

Paper • 2503.05315 • Published 7 days ago • 8

upvoted a collection 7 months ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated 8 days ago • 85

upvoted a paper 8 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 44

upvoted 2 collections 8 months ago

models to evaluate

collecting models I want to evaluate on shadereval-task2: https://github.com/bigcode-project/bigcode-evaluation-harness/pull/173 at fp16!! • 39 items • Updated Nov 17, 2024 • 2

Code Evaluation

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted a collection 11 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22, 2024 • 24

upvoted a paper about 1 year ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 51

upvoted 3 papers over 1 year ago

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 23

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170