nvidia
/

Llama3-ChatQA-2-70B

Text Generation

Model card Files Files and versions Community

root commited on Sep 9, 2024

Commit

7bc6d2d

·

1 Parent(s): 56da4d9

update README

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ We introduce Llama3-ChatQA-2, which bridges the gap between open-source LLMs and
 ## Overview of Benchmark Results
 <!-- Results in [ChatRAG Bench](https://huggingface.co/datasets/nvidia/ChatRAG-Bench) are as follows: -->
 ![Example Image](overview.png)
 <!-- | | ChatQA-2-70B | GPT-4-Turbo-2024-04-09 | Qwen2-72B-Instruct | Llama3.1-70B-Instruct |

 ## Overview of Benchmark Results
 <!-- Results in [ChatRAG Bench](https://huggingface.co/datasets/nvidia/ChatRAG-Bench) are as follows: -->
+We evaluate ChatQA 2 on short-context RAG benchmark (ChatRAG) (within 4K tokens), long context tasks from SCROLLS and LongBench (within 32K tokens), and ultra-long context tasks from In- finiteBench (beyond 100K tokens). Results are shown below.
 ![Example Image](overview.png)
 <!-- | | ChatQA-2-70B | GPT-4-Turbo-2024-04-09 | Qwen2-72B-Instruct | Llama3.1-70B-Instruct |