LLMCalc / src /v1.6_latency.csv
carbonnnnn's picture
update main data df, minor changes in combined json
54d8c8e
raw
history blame
2.59 kB
model,latency
Phi-3-mini-128k-instruct,0.6615315832127351
Meta-Llama-3.1-70B-Instruct,0.8105055275945272
Meta-Llama-3.1-405B-Instruct-Turbo,0.7886103946545832
mistral-medium-2312,3.3167870515212137
gemma-2-2b-it,0.3139821517919896
Meta-Llama-3-8B-Instruct-hf,2.116447543256806
Qwen1.5-1.8B-Chat,0.5502705450000004
CodeLlama-34b-Instruct-hf,3.851887315425931
Qwen2-72B-Instruct,0.9480584860151365
Meta-Llama-3.1-8B-Instruct,0.20630574840608434
gemma-1.1-7b-it,0.1782953878345513
gemma-2-27b-it,0.9922771009345804
openchat-3.5-1210,0.28049827691029705
codegemma-7b-it,0.30489740508652785
dolphin-2.5-mixtral-8x7b,1.100729847312237
sheep-duck-llama-2-70b-v1.1,5.524607914346896
gpt-4-0613,0.6484411465828742
Meta-Llama-3-70B-Instruct-hf,3.348256158662245
openchat_3.5,0.31728768684620623
Yi-34B-Chat,1.2871676207135385
gpt-4o-2024-05-13,0.5570990732389735
openchat-3.5-0106,0.2920951450556654
Mixtral-8x7B-Instruct-v0.1,0.9392967660636317
gpt-4,0.9930876319528372
Qwen1.5-14B-Chat,0.37289333481152975
mistral-large-2402,0.3967416598893948
claude-3-5-sonnet-20240620,0.879929281888541
gemma-7b-it,0.6112263564356414
gpt-4-0125-preview,1.0418927523113601
gpt-4-1106-preview,0.7767265743542753
gemini-1.5-pro-latest,82.90283880578006
Nous-Hermes-2-Mixtral-8x7B-SFT,0.860892263303385
claude-2.1,1.6836316221022491
gemma-1.1-2b-it,0.11925699461279458
claude-3-opus-20240229,2.955845827917778
command-r,0.18832414914586068
gpt-4o-mini-2024-07-18,0.5342684150088973
sheep-duck-llama-2-13b,2.9462099794520453
SUS-Chat-34B,2.27951476106911
claude-3-haiku-20240307,0.869549739619107
gpt-4o-2024-08-06,0.5169587531894035
Yi-1.5-34B-Chat,9.232551962889337
Qwen2-7B-Instruct,0.3589407217948713
Mixtral-8x22B-Instruct-v0.1,1.0759354563573875
vicuna-13b-v1.5,1.4753938719676492
gemma-2-9b-it,0.3692553324432539
Mistral-7B-Instruct-v0.2,0.7635151196709047
Qwen1.5-32B-Chat,0.5092292557397938
Qwen1.5-0.5B-Chat,0.173469139375476
Starling-LM-7B-beta,1.365002297029707
gemini-1.0-pro,0.6346876567117742
gemini-1.5-flash-latest,37.2042672100488
aya-23-35B,0.5755088395104249
llama-2-70b-chat-hf,4.724659620079609
Yi-1.5-9B-Chat,8.02352422018858
WizardLM-70b-v1.0,3.9249772038834863
Yi-1.5-6B-Chat,6.496741103848931
tulu-2-dpo-70b,7.848597339328551
Qwen1.5-72B-Chat,12.689668927658234
Mistral-7B-Instruct-v0.1,0.2828647550771723
gpt-3.5,0.6829601016193744
aya-23-8B,0.4818848185613409
Qwen1.5-7B-Chat,0.38989076908838965
claude-3-sonnet-20240229,1.419486012822594
vicuna-33b-v1.3,0.8235025152162343
WizardLM-13b-v1.2,3.565436762576296
Mistral-Large-Instruct-2407,1.2444667688634197
command-r-plus,0.3104016019283745