New discussion

Requirements

#52 opened 4 months ago by
sneakybeaky

Adding Evaluation Results

#48 opened about 1 year ago by
leaderboard-pr-bot

Finetuning llama2

#47 opened about 1 year ago by
zuhashaik

Any example of batch inference?

#46 opened about 1 year ago by
PrintScr

How to set max_split_size_mb?

1
#30 opened over 1 year ago by
neo-benjamin

max_position_embeddings = 2048?

1
#29 opened over 1 year ago by
zzzac

Load into 2 GPUs

3
#28 opened over 1 year ago by
sauravm8

Load model into TGI

#27 opened over 1 year ago by
schauppi

Perplexity

#22 opened over 1 year ago by
gsaivinay

70TB with multiple A5000

6
#21 opened over 1 year ago by
nashid

Inference time with TGI

1
#15 opened over 1 year ago by
jacktenyx

Can't launch with TGI

6
#14 opened over 1 year ago by
yekta

text-generation-inference error

7
#5 opened over 1 year ago by
msteele

Output always 0 tokens

11
#4 opened over 1 year ago by
sterogn