GGUF
Inference Endpoints
gemma / README.md
van-qa's picture
Update README.md
ae7c019 verified
metadata
license: gemma

Overview

The Gemma, state-of-the-art open model trained with the Gemma datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties. The model belongs to the Gemma family with the 4B, 7B version in two variants 8K and 128K which is the context length (in tokens) that it can support.

Variants

No Variant Cortex CLI command
1 7b-gguf cortex run gemma:7b-gguf
2 7b-onnx cortex run gemma:7b-onnx

Use it with Jan (UI)

  1. Install Jan using Quickstart
  2. Use in Jan model Hub:
    cortexso/gemma
    

Use it with Cortex (CLI)

  1. Install Cortex using Quickstart
  2. Run the model with command:
    cortex run gemma
    

Credits