Edit model card

MoMonir/gte-Qwen1.5-7B-instruct-GGUF

This model was converted to GGUF format from Alibaba-NLP/gte-Qwen1.5-7B-instruct using llama.cpp
Refer to the original model card for more details on the model.

Note: This is an Embedding Model

For more information about Embedding check OpenAI Embedding Document

Downloads last month
77
GGUF
Model size
7.72B params
Architecture
qwen2
Inference API
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.