Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Azurro
/
APT3-1B-Base
like
14
Text Generation
Transformers
Safetensors
chrisociepa/wikipedia-pl-20230401
Polish
llama
ALLaMo
text-generation-inference
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
a9dbe5e
APT3-1B-Base
1 contributor
History:
7 commits
chrisociepa
Update README.md
a9dbe5e
verified
6 days ago
.gitattributes
1.52 kB
initial commit
9 months ago
README.md
9.57 kB
Update README.md
6 days ago
allamo_config_ckpt.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"model.AllamoTransformerConfig"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
How to fix it?
3.5 kB
LFS
Upload 9 files
9 months ago
allamo_model_ckpt.pt
4.17 GB
LFS
Upload 9 files
9 months ago
allamo_optimizer_ckpt.pt
8.33 GB
LFS
Upload 9 files
9 months ago
apt3-1b-base-eval.jpg
117 kB
Upload 2 files
9 months ago
apt3-1b-base-train.jpg
179 kB
Upload 2 files
9 months ago
config.json
608 Bytes
Upload 9 files
9 months ago
generation_config.json
111 Bytes
Upload 9 files
9 months ago
model.safetensors
4.17 GB
LFS
Upload 9 files
9 months ago
special_tokens_map.json
96 Bytes
Upload 9 files
9 months ago
tokenizer.json
1.42 MB
Upload 9 files
9 months ago
tokenizer_config.json
281 Bytes
Upload 9 files
9 months ago