metadata
license: apache-2.0
pipeline_tag: text-generation
language:
- fr
- en
- it
- de
- es
tags:
- pretrained
- llama-3
- openllm-france
- mlx
datasets:
- OpenLLM-France/Lucie-Training-Dataset
widget:
- text: |-
Quelle est la capitale de l'Espagne ? Madrid.
Quelle est la capitale de la France ?
example_title: Capital cities in French
group: 1-shot Question Answering
training_progress:
num_steps: 756291
num_tokens: 3131736326144
context_length: 32000
base_model: OpenLLM-France/Lucie-7B
alexgusevski/Lucie-7B-q6-mlx
The Model alexgusevski/Lucie-7B-q6-mlx was converted to MLX format from OpenLLM-France/Lucie-7B using mlx-lm version 0.21.4.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("alexgusevski/Lucie-7B-q6-mlx")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)