meta-llama
/

Llama-3.1-8B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (32)

BUG Chat template doesn't respect `add_generation_prompt`flag from transformers tokenizer

#44 opened 4 months ago by

How to use the ASR on LLama3.1

#43 opened 4 months ago by

Tokenizer 'apply_chat_template' issue

#42 opened 4 months ago by

Function Calling Evaluation bench Nexus (0-shot)

#41 opened 4 months ago by

Error: json: cannot unmarshal array into Go struct field Params.eos_token_id of type int

#40 opened 4 months ago by

ValueError: Pipeline with tokenizer without pad_token cannot do batching. You can try to set it with `pipe.tokenizer.pad_token_id = model.config.eos_token_id`.

#39 opened 4 months ago by

Run this on CPU and use tool calling

#38 opened 4 months ago by

J22

!!Access Problem

#37 opened 4 months ago by

LLama-3.1-8B generates way to long answers!

#36 opened 4 months ago by

Tokenizer error and/or 'rope_scaling' problem

#35 opened 4 months ago by

Deployment to Inference Endpoints

#34 opened 4 months ago by

Best practice for tool calling with meta-llama/Meta-Llama-3.1-8B-Instruct

#33 opened 4 months ago by

The model often enters infinite generation loops

#32 opened 4 months ago by

unable to load 4-bit quantized varient with llama.cpp

#31 opened 4 months ago by

Garbage output ?

#30 opened 4 months ago by

Question about chat template and fine-tuning

#23 opened 4 months ago by

Issues loading model with ooabooga textgenwebui

#20 opened 4 months ago by

what is the right tokenizer should I use for llama 3.1 8B?

#19 opened 4 months ago by

The sample code on the model card page is not right

#18 opened 4 months ago by

My alternative quantizations.

#16 opened 4 months ago by

ValueError: `rope_scaling` must be a dictionary with two fields

#15 opened 4 months ago by

Independently Benchmarked Humaneval and Evalplus scores

#13 opened 4 months ago by

DO NOT MERGE v2 make sure vllm and transformers work

#12 opened 4 months ago by

DO NOT MERGE test for vllm

#11 opened 4 months ago by

Please do not include original PTH files.

#10 opened 4 months ago by

Utterly based

#9 opened 4 months ago by