Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rinna
/
nekomata-14b
like
19
Follow
rinna Co., Ltd.
88
Text Generation
Transformers
PyTorch
Safetensors
5 datasets
Japanese
English
qwen
custom_code
arxiv:
2309.16609
arxiv:
2404.01657
License:
tongyi-qianwen-license-agreement
Model card
Files
Files and versions
Community
2
Train
Use this model
d1e319f
nekomata-14b
3 contributors
History:
3 commits
tianyuz
upload model
d1e319f
11 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
LICENSE
6.9 kB
first commit
11 months ago
NOTICE
3.85 kB
first commit
11 months ago
README.md
5.45 kB
first commit
11 months ago
cache_autogptq_cuda_256.cpp
8.4 kB
first commit
11 months ago
cache_autogptq_cuda_kernel_256.cu
52 kB
first commit
11 months ago
configuration_qwen.py
2.35 kB
first commit
11 months ago
cpp_kernels.py
1.92 kB
first commit
11 months ago
modeling_qwen.py
55.6 kB
first commit
11 months ago
pytorch_model-00001-of-00003.bin
9.96 GB
LFS
upload model
11 months ago
pytorch_model-00002-of-00003.bin
9.88 GB
LFS
upload model
11 months ago
pytorch_model-00003-of-00003.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
8.49 GB
LFS
upload model
11 months ago
pytorch_model.bin.index.json
24.4 kB
upload model
11 months ago
qwen.tiktoken
2.56 MB
first commit
11 months ago
qwen_generation_utils.py
14.6 kB
first commit
11 months ago
rinna.png
60.3 kB
first commit
11 months ago
tokenization_qwen.py
9.62 kB
first commit
11 months ago
tokenizer_config.json
270 Bytes
first commit
11 months ago