wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

liked a dataset about 6 hours ago

WebOrganizer/Corpus-200B

liked a model about 6 hours ago

facebook/MobileLLM-125M

liked a dataset about 7 hours ago

microsoft/orca-agentinstruct-1M-v1

View all activity

Organizations

None yet

varuy322's activity

upvoted a collection 1 day ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 14 days ago • 244

upvoted a paper 27 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 30 days ago • 198

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 789

upvoted an article about 2 months ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 36

upvoted a paper 2 months ago

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

upvoted a collection 2 months ago

Agents

94 items • Updated 12 days ago • 3

upvoted a paper 4 months ago

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 50

upvoted an article 5 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

• 58

upvoted 2 papers 5 months ago

Erasing Conceptual Knowledge from Language Models

Paper • 2410.02760 • Published Oct 3, 2024 • 14

LML: Language Model Learning a Dataset for Data-Augmented Prediction

Paper • 2409.18957 • Published Sep 27, 2024 • 10

upvoted 2 collections 5 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 574

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 119

upvoted an article 6 months ago

Article

An Introduction to Deep Reinforcement Learning

May 4, 2022

• 3

upvoted an article 8 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 327

upvoted a paper 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

upvoted an article 8 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 80

upvoted 3 collections 9 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 359

LLMs

412 items • Updated about 5 hours ago • 30

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 37

upvoted an article 10 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19, 2024

• 137