Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JuncaiL
/
llama-265m
like
1
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
1
Train
Use this model
main
llama-265m
/
modeling_llama_moe_hf.py
Commit History
fix state_dict loading in MoE model
3240d88
verified
JuncaiL
commited on
Mar 25
upload llama-265m model checkpoint
e567dee
verified
JuncaiL
commited on
Mar 24