2 8 14

cheng

zhoujun

BlankCheng

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

nanotron/ultrascale-playbook

liked a dataset 25 days ago

agentica-org/DeepScaleR-Preview-Dataset

upvoted a paper about 1 month ago

Fast Video Generation with Sliding Tile Attention

View all activity

Organizations

zhoujun's activity

liked a Space 21 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 25 days ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 3.75k • 79

liked a model about 1 month ago

Qwen/Qwen2.5-Math-7B-Instruct

Text Generation • Updated Sep 23, 2024 • 88.6k • 63

liked a Space 5 months ago

Decentralized Arena Leaderboard

🥇

Display model leaderboard evaluations

liked a dataset 5 months ago

LLM360/TxT360

Preview • Updated 2 minutes ago • 482k • 224

liked a Space 5 months ago

105

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

liked a dataset 11 months ago

minimario/FOLIO

Viewer • Updated Jan 2, 2024 • 1.21k • 221 • 1

liked a dataset 12 months ago

bigcode/the-stack-v2

Viewer • Updated Apr 23, 2024 • 5.45B • 4.39k • 341

liked 2 models 12 months ago

deepseek-ai/deepseek-coder-7b-instruct-v1.5

Text Generation • Updated Feb 5, 2024 • 13.6k • 128

deepseek-ai/deepseek-coder-1.3b-instruct

Text Generation • Updated Mar 7, 2024 • 67.6k • • 116

liked a model over 1 year ago

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17, 2024 • 1.19M • • 4.3k

liked a dataset almost 2 years ago

bigcode/ta-prompt

Viewer • Updated May 4, 2023 • 650 • 320 • 197

liked 2 Spaces over 2 years ago

Binder

🔗

238

Code generation with 🤗

✨

Generate code snippets using language models