Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
54
JJ
J22
Follow
21world's profile picture
1 follower
ยท
0 following
AI & ML interests
None yet
Recent Activity
new
activity
about 2 months ago
ibm-granite/granite-3.0-3b-a800m-instruct:
Upload tokenizer.json
updated
a model
about 2 months ago
ibm-granite/granite-3.0-3b-a800m-instruct
new
activity
about 2 months ago
facebook/MobileLLM-1B:
a horrible function in `modeling_mobilellm.py`
View all activity
Organizations
None yet
J22
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
ibm-granite/granite-3.0-3b-a800m-instruct
about 2 months ago
Upload tokenizer.json
1
#1 opened about 2 months ago by
J22
updated
a model
about 2 months ago
ibm-granite/granite-3.0-3b-a800m-instruct
Text Generation
โข
Updated
2 days ago
โข
4.01k
โข
16
New activity in
facebook/MobileLLM-1B
about 2 months ago
a horrible function in `modeling_mobilellm.py`
1
#5 opened about 2 months ago by
J22
New activity in
allenai/OLMoE-1B-7B-0924-Instruct
3 months ago
Run this on CPU
#6 opened 3 months ago by
J22
New activity in
openbmb/MiniCPM3-4B
3 months ago
Run on CPU
1
#13 opened 3 months ago by
J22
New activity in
microsoft/Phi-3.5-MoE-instruct
4 months ago
need gguf
18
#4 opened 4 months ago by
windkkk
New activity in
meta-llama/Llama-3.1-8B-Instruct
5 months ago
Best practice for tool calling with meta-llama/Meta-Llama-3.1-8B-Instruct
1
#33 opened 5 months ago by
zzclynn
Run this on CPU and use tool calling
1
#38 opened 5 months ago by
J22
New activity in
AI-MO/NuminaMath-7B-TIR
5 months ago
My alternative quantizations.
5
#5 opened 5 months ago by
ZeroWw
New activity in
mistralai/Mistral-7B-Instruct-v0.3
6 months ago
Tool calling is supported by ChatLLM.cpp
#36 opened 6 months ago by
J22
New activity in
mistralai/Mistral-7B-Instruct-v0.3
7 months ago
can't say hello
1
#9 opened 7 months ago by
J22
no system message?
8
#14 opened 7 months ago by
mclassHF2023
New activity in
microsoft/Phi-3-small-8k-instruct
7 months ago
"small" is so different from "mini" and "medium"
1
#8 opened 7 months ago by
J22
New activity in
nvidia/Llama3-ChatQA-1.5-8B
8 months ago
how to set context in multi-turn QA?
6
#14 opened 8 months ago by
J22
New activity in
microsoft/Phi-3-mini-128k-instruct
8 months ago
clarification on the usage of `short_factor` and `long_factor`?
1
#49 opened 8 months ago by
J22
Continue the discussion: `long_factor` and `short_factor`
2
#32 opened 8 months ago by
J22
New activity in
microsoft/Phi-3-mini-4k-instruct
8 months ago
is the '\n' after `'<|end|>'`?
1
#43 opened 8 months ago by
J22
Is sliding window used or not?
1
#25 opened 8 months ago by
J22
New activity in
microsoft/Phi-3-mini-128k-instruct
8 months ago
`long_factor` is never used?
2
#22 opened 8 months ago by
J22
generate +6 min, +20GB V-ram
2
#17 opened 8 months ago by
NickyNicky
Load more