Ming Kong

hitalex

AI & ML interests

None yet

Recent Activity

liked a Space 27 days ago

nanotron/ultrascale-playbook

upvoted an article about 1 month ago

Introducing smolagents: simple agents that write actions in code.

upvoted an article about 1 month ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

None yet

hitalex's activity

liked a Space 27 days ago

2.29k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 articles about 1 month ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 884

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.17k

liked a dataset about 1 month ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 3.85k • 120

liked a model 2 months ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 15 days ago • 266k • 1.05k

liked a Space 2 months ago

106

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

upvoted a collection 4 months ago

OpenCulture

Collection

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 124

liked a dataset 5 months ago

princeton-nlp/TextbookChapters

Viewer • Updated Jan 16 • 77.9k • 136 • 9

liked a model 5 months ago

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 302k • • 1.49k

liked a dataset 6 months ago

argilla/FinePersonas-v0.1

Viewer • Updated Dec 11, 2024 • 42.1M • 2.66k • 399

liked 2 models 7 months ago

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated 11 days ago • 43.8k • • 556

internlm/internlm2_5-7b

Text Generation • Updated 6 days ago • 4.55k • 17

liked a Space 10 months ago

883

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 11 months ago

gradientai/Llama-3-8B-Instruct-Gradient-1048k

Text Generation • Updated Oct 29, 2024 • 5.17k • 682

liked a model 12 months ago

databricks/dbrx-base

Text Generation • Updated Apr 19, 2024 • 24 • 556

liked a model about 1 year ago

Qwen/Qwen1.5-0.5B

Text Generation • Updated Apr 5, 2024 • 101k • • 154

liked 2 datasets about 1 year ago

yaofu/slimpajama-per-source-length-upsample

Viewer • Updated Feb 15, 2024 • 84.7k • 523 • 17

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 1.53k • 719