Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
57
258
Dokyoon
leeloolee
Follow
kaki-paper's profile picture
ddobokki's profile picture
victor's profile picture
9 followers
Ā·
20 following
Eruly
AI & ML interests
ai
Recent Activity
upvoted
a
paper
9 days ago
GUI Agents: A Survey
reacted
to
m-ric
's
post
with š
15 days ago
šš®š š š¢š§š š ššš š«šš„ššš¬šš¬ šš¢ššØšš«šØš§, š š¦š¢šš«šØš¬ššØš©š¢š š„š¢š šš”šš š¬šØš„šÆšš¬ ššš šš«šš¢š§š¢š§š šš š©šš«šš„š„šš„š¢š³ššš¢šØš§ š„³ š°ļø Llama-3.1-405B took 39 million GPU-hours to train, i.e. about 4.5 thousand years. š“š» If they had needed all this time, we would have GPU stories from the time of Pharaoh š: "Alas, Lord of Two Lands, the shipment of counting-stones arriving from Cathay was lost to pirates, this shall delay the building of your computing temple by many moons " š ļø But instead, they just parallelized the training on 24k H100s, which made it take just a few months. This required parallelizing across 4 dimensions: data, tensor, context, pipeline. And it is infamously hard to do, making for bloated code repos that hold together only by magic. š¤ ššš š»š¼š šš² š±š¼š»'š š»š²š²š± šµšš“š² šæš²š½š¼š š®š»ššŗš¼šæš²! Instead of building mega-training codes, Hugging Face colleagues cooked in the other direction, towards tiny 4D parallelism libs. A team has built Nanotron, already widely used in industry. And now a team releases Picotron, a radical approach to code 4D Parallelism in just a few hundred lines of code, a real engineering prowess, making it much easier to understand what's actually happening! ā” šš'š šš¶š»š, šš²š š½š¼šš²šæš³šš¹: Counting in MFU (Model FLOPs Utilization, how much the model actually uses all the compute potential), this lib reaches ~50% on SmolLM-1.7B model with 8 H100 GPUs, which is really close to what huge libs would reach. (Caution: the team is leading further benchmarks to verify this) Go take a look š https://github.com/huggingface/picotron/tree/main/picotron
reacted
to
alimotahharynia
's
post
with š„
15 days ago
Here's the space for our new article that leverages LLMs with reinforcement learning to design high-quality small molecules. Check it out at https://huggingface.co/spaces/alimotahharynia/GPT-2-Drug-Generator. You can also access the article here: https://arxiv.org/abs/2411.14157. I would be happy to receive your feedback.
View all activity
Organizations
leeloolee
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
16 days ago
echo840/OCRBench
Viewer
ā¢
Updated
16 days ago
ā¢
1k
ā¢
5.2k
ā¢
11
liked
a model
17 days ago
U4R/StructTable-InternVL2-1B
Image-to-Text
ā¢
Updated
22 days ago
ā¢
1.12k
ā¢
28
liked
a model
18 days ago
google/Gemma-Embeddings-v1.0
Updated
18 days ago
ā¢
654
ā¢
111
liked
a model
23 days ago
TIGER-Lab/VLM2Vec-Full
Text Generation
ā¢
Updated
14 days ago
ā¢
22.6k
ā¢
21
liked
a Space
24 days ago
Running
77
š»
Vision Papers
All paper summaries read by Merve
liked
a dataset
about 1 month ago
NCSOFT/K-SEED
Viewer
ā¢
Updated
28 days ago
ā¢
2.97k
ā¢
225
ā¢
14
liked
2 Spaces
about 1 month ago
Running
97
š„
Vidore Leaderboard
Running
640
š
PR Puppet Sora
liked
a model
about 1 month ago
zjunlp/HalDet-llava-7b
Text Generation
ā¢
Updated
Apr 24, 2024
ā¢
27
ā¢
2
liked
5 datasets
2 months ago
SakanaAI/JA-VG-VQA-500
Viewer
ā¢
Updated
May 14, 2024
ā¢
1.5k
ā¢
214
ā¢
13
jahyungu/llava_instruct
Viewer
ā¢
Updated
Oct 5, 2024
ā¢
238k
ā¢
6
ā¢
2
jxu124/llava_complex_reasoning_77k
Viewer
ā¢
Updated
May 20, 2023
ā¢
76.6k
ā¢
36
ā¢
10
sujet-ai/Sujet-Finance-Instruct-177k
Viewer
ā¢
Updated
Apr 5, 2024
ā¢
178k
ā¢
127
ā¢
71
HuggingFaceM4/LLaVAR-Instruct-16K
Viewer
ā¢
Updated
Jul 28, 2023
ā¢
15.5k
ā¢
37
ā¢
17
liked
3 datasets
3 months ago
YiyangAiLab/POVID_preference_data_for_VLLMs
Viewer
ā¢
Updated
Apr 1, 2024
ā¢
17.2k
ā¢
44
ā¢
7
Salesforce/blip3-grounding-50m
Viewer
ā¢
Updated
Sep 19, 2024
ā¢
52.4M
ā¢
902
ā¢
20
nvidia/HelpSteer2
Viewer
ā¢
Updated
16 days ago
ā¢
21.4k
ā¢
15.2k
ā¢
392
liked
a model
3 months ago
google/gemma-2-2b-jpn-it
Text Generation
ā¢
Updated
Oct 2, 2024
ā¢
24.6k
ā¢
149
liked
2 datasets
3 months ago
HuggingFaceH4/10k_prompts_ranked
Viewer
ā¢
Updated
Sep 30, 2024
ā¢
10.3k
ā¢
46
ā¢
3
HuggingFaceH4/llava-instruct-mix-vsft
Viewer
ā¢
Updated
Apr 11, 2024
ā¢
273k
ā¢
1.06k
ā¢
36
Load more