Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
267.6
TFLOPS
46
10
130
Daniel Han-Chen
danielhanchen
Follow
mehakk-lunkar's profile picture
21world's profile picture
Orenguteng's profile picture
187 followers
·
110 following
https://unsloth.ai/
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
unsloth/Llama-3.1-Tulu-3-70B-GGUF
updated
a model
about 2 hours ago
unsloth/Llama-3.1-Tulu-3-70B-bnb-4bit
updated
a model
about 4 hours ago
unsloth/Llama-3.1-Tulu-3-70B
View all activity
Articles
Faster fine-tuning using TRL & Unsloth
Jan 10
•
37
Organizations
danielhanchen
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
2 months ago
Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 2 months ago by
fullstack
New activity in
unsloth/gemma-7b-bnb-4bit
2 months ago
No module named 'triton'
1
#3 opened 2 months ago by
NeelM0906
New activity in
unsloth/Hermes-3-Llama-3.1-8B-bnb-4bit
3 months ago
update base_model
#1 opened 3 months ago by
davanstrien
New activity in
unsloth/mistral-7b-instruct-v0.3
3 months ago
ValueError: The following `model_kwargs` are not used by the model: ['num_logits_to_keep'] (note: typos in the generate arguments will also show up in this list)
2
#1 opened 3 months ago by
NeelM0906
New activity in
unsloth/Phi-3-mini-4k-instruct-v0-bnb-4bit
3 months ago
Cant use the tokenizer using Unsloth Fastmodel
2
#2 opened 3 months ago by
aryarishit
New activity in
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
4 months ago
RuntimeError: Unsloth: `unsloth/Meta-Llama-3.1-8B-bnb-4bit` is not a base model or a PEFT model.
6
#3 opened 4 months ago by
yorickdejong
New activity in
unsloth/Mistral-Nemo-Base-2407
4 months ago
difference
3
#1 opened 4 months ago by
ehartford
New activity in
google/gemma-2-9b-it
4 months ago
9B - query_pre_attn_scalar = 256 not 224
#26 opened 4 months ago by
danielhanchen
New activity in
google/gemma-2-9b
4 months ago
9B - query_pre_attn_scalar = 256 not 224
#22 opened 4 months ago by
danielhanchen
New activity in
unsloth/llama-3-8b
6 months ago
is this the llama-3-8b model clone?
13
#1 opened 7 months ago by
malhajar
New activity in
unsloth/gemma-2b-bnb-4bit
6 months ago
Model seems to be not PEFT model
1
#1 opened 6 months ago by
neuralresearcher
New activity in
unsloth/mistral-7b-v0.2-bnb-4bit
6 months ago
full disk on colab
2
#2 opened 6 months ago by
Dav22
New activity in
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
6 months ago
TGI - RuntimeError: mat1 and mat2 shapes cannot be multiplied (4145x3072 and 1x14155776)
4
#3 opened 6 months ago by
turjo4nis
New activity in
unsloth/llama-3-8b-bnb-4bit
6 months ago
34 hour for file tunning ?
4
#7 opened 6 months ago by
dad1909
New activity in
unsloth/llama-3-70b-Instruct-bnb-4bit
6 months ago
Update config.json
#1 opened 6 months ago by
huseink
New activity in
unsloth/llama-3-8b-Instruct
6 months ago
Update config.json
2
#3 opened 6 months ago by
huseink
New activity in
unsloth/llama-3-8b-Instruct-bnb-4bit
6 months ago
Update config.json
1
#2 opened 6 months ago by
huseink
New activity in
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
6 months ago
No package metadata was found for bitsandbytes
1
#1 opened 7 months ago by
halilbabacan
New activity in
unsloth/llama-3-8b-Instruct-bnb-4bit
6 months ago
BitsAndBytesConfig error
3
#1 opened 7 months ago by
vdavidr
New activity in
unsloth/llama-3-8b-bnb-4bit
6 months ago
Error: pull model manifest: file does not exist
2
#6 opened 6 months ago by
wesleyhk
Load more