Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JuncaiL
/
llama-265m
like
1
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
1
Train
Use this model
1f3f5eb
llama-265m
Commit History
Update README.md
1f3f5eb
verified
JuncaiL
commited on
Mar 25, 2024
Upload README.md
af43b70
verified
JuncaiL
commited on
Mar 25, 2024
fix state_dict loading in MoE model
3240d88
verified
JuncaiL
commited on
Mar 25, 2024
update config.json
0b1dfd4
verified
JuncaiL
commited on
Mar 25, 2024
upload llama-265m model checkpoint
e567dee
verified
JuncaiL
commited on
Mar 24, 2024
initial commit
6dda61f
verified
JuncaiL
commited on
Mar 24, 2024