Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
6
6
Ricardo Malagon Jerez
rjmalagon
Follow
NikolayKozloff's profile picture
21world's profile picture
2 followers
·
16 following
rjmalagon
rjmalagon
AI & ML interests
None yet
Recent Activity
reacted
to
mkurman
's
post
with 🔥
27 days ago
We built a new small language model SmolLM2-MedIT-Upscale-2B, based on SmolLM2-1.7B-Instruct from Hugging Face. The premise was simple - increasing the vector in attention layers would positively impact the model's capabilities. What did we prove? In total, not much really, since we don't have the original trained under the same conditions as our upscale. However... 1. We scaled up the model without losing its quality 2. We confirmed that the method we devised works 3. After extremely short fine-tuning, the model achieved much better results in IFEval compared to the original (53.68 vs 64.29) and a higher overall average score in Open LLM Leaderboard (14.75 vs 15.17) I consider this a big success 😇, since surpassing the original in metrics is often very time-consuming, generates high costs, and doesn't always work out. Meanwhile, we're moving forward, training SmolLM2 400M Instruct as an upscale of 136M. We're curious about how increasing the base and intermediate vectors will affect the model's quality. We'll compare it to the original and the 360M Instruct version released by Hugging Face. License: Apache 2.0 https://huggingface.co/meditsolutions/SmolLM2-MedIT-Upscale-2B
reacted
to
clem
's
post
with 👍
29 days ago
Hugging Face is becoming the best place to share the most viral AI apps with spaces. Kolors Virtual Try-on just crossed 6,000,000 unique visitors & is now the #5 most popular space. Congrats to the Kwai Kolors team! https://huggingface.co/spaces/Kwai-Kolors/Kolors-Virtual-Try-On
View all activity
Organizations
None yet
rjmalagon
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
4 months ago
Danielbrdz/Barcenas-14b-Juridico-Mexicano
Text Generation
•
Updated
Aug 7
•
13
•
1
Danielbrdz/Barcenas-Llama3-8b-ORPO
Text Generation
•
Updated
Apr 29
•
13.9k
•
7
liked
3 models
6 months ago
cognitivecomputations/dolphin-2.9.2-qwen2-7b
Text Generation
•
Updated
Jun 18
•
7.27k
•
63
mlabonne/Beyonder-4x7B-v3
Text Generation
•
Updated
Mar 28
•
7.22k
•
58
mlabonne/NeuralDaredevil-8B-abliterated
Text Generation
•
Updated
Aug 27
•
20.6k
•
166
liked
a model
10 months ago
yunconglong/Truthful_DPO_TomGrc_FusionNet_7Bx2_MoE_13B
Text Generation
•
Updated
Feb 28
•
10.5k
•
53