LLMLingua

Runtime error

iofu728 commited on Oct 9, 2023

Commit

26923cb

•

1 Parent(s): 9afe260

Feature(LLMLingua): add exaplain

Files changed (1) hide show

app.py CHANGED Viewed

@@ -4,7 +4,9 @@ from llmlingua import PromptCompressor
 llm_lingua = PromptCompressor("lgaalves/gpt2-dolly", device_map="cpu")
 INTRO = """
-# LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
 This is an early demo of the prompt compression method LLMLingua.
 It should be noted that due to limited resources, we only provide the **GPT2-Small** size language model in this demo. Using the **LLaMA2-7B** as a small language model would result in a significant performance improvement, especially at high compression ratios.

 llm_lingua = PromptCompressor("lgaalves/gpt2-dolly", device_map="cpu")
 INTRO = """
+# LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models (EMNLP 2023) [paper]()
+_Huiqiang Jiang, Qianhui Wu, Chin-Yew Lin, Yuqing Yang and Lili Qiu_
 This is an early demo of the prompt compression method LLMLingua.
 It should be noted that due to limited resources, we only provide the **GPT2-Small** size language model in this demo. Using the **LLaMA2-7B** as a small language model would result in a significant performance improvement, especially at high compression ratios.