Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
microsoft
/
phi-1
like
198
Text Generation
Transformers
Safetensors
English
phi
code
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
11
Train
Deploy
Use this model
769684a
phi-1
3 contributors
History:
25 commits
gugarosa
Upload README.md
769684a
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
7.34 kB
Upload README.md
10 months ago
Research License.docx
38.9 kB
Upload Research License.docx
10 months ago
added_tokens.json
1.08 kB
Upload tokenizer
10 months ago
config.json
705 Bytes
Support for `attention_mask` in forward pass.
10 months ago
configuration_mixformer_sequential.py
1.86 kB
Support for `attention_mask` in forward pass.
10 months ago
generation_config.json
69 Bytes
Update generation_config.json
10 months ago
merges.txt
456 kB
Upload tokenizer
10 months ago
modeling_mixformer_sequential.py
28.7 kB
fix(phi-1): Checks length of `attention_mask`if it is passed as direct tensor.
10 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.HalfStorage"
What is a pickle import?
2.84 GB
LFS
Upload MixFormerSequentialForCausalLM
10 months ago
special_tokens_map.json
99 Bytes
Upload tokenizer
10 months ago
tokenizer.json
2.11 MB
Upload tokenizer
10 months ago
tokenizer_config.json
237 Bytes
Upload tokenizer
10 months ago
vocab.json
798 kB
Upload tokenizer
10 months ago