Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
steerapi
/
Llama-2-7b-chat-hf-onnx
like
5
Text Generation
Transformers
ONNX
llama
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
bd15347
Llama-2-7b-chat-hf-onnx
/
onnx
1 contributor
History:
8 commits
steerapi
Upload folder using huggingface_hub
bd15347
over 1 year ago
.ipynb_checkpoints
Upload folder using huggingface_hub
over 1 year ago
q1
Upload folder using huggingface_hub
over 1 year ago
decoder_model.onnx
5.44 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model.onnx_data
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_fp16.onnx
3.7 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_fp16.onnx_data
13.5 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx
10.9 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx_data
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_fp16.onnx
6.73 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_fp16.onnx_data
13.5 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_quantized.onnx
12.1 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_quantized.onnx.data
6.74 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx
5.47 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx_data
27 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model_fp16.onnx
3.75 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model_fp16.onnx_data
13.5 GB
LFS
Upload folder using huggingface_hub
over 1 year ago
quantize_config.json
993 Bytes
Upload folder using huggingface_hub
over 1 year ago