A Llama version for Nanbeige/Nanbeige-16B-Base, which could be loaded by LlamaForCausalLM.
Nanbeige-16B is a 16 billion parameter language model developed by Nanbeige LLM Lab. It uses 2.5T Tokens for pre-training. The training data includes a large amount of high-quality internet corpus, various books, code, etc. It has achieved good results on various authoritative evaluation data sets.
- Downloads last month
- 1,787
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.