Spaces:

wandb
/

paper_reader

Runtime error

App Files Files Community

parambharat commited on Jul 25, 2024

Commit

85bfd70

1 Parent(s): 049ff35

chore: fix citations and response format

Browse files

Files changed (1) hide show

rag/rag.py +14 -11

rag/rag.py CHANGED Viewed

@@ -29,11 +29,7 @@ Here are the relevant snippets from the Llama 3 405B model research paper:
 {context_str}
 </snippets>
-<question>
-{query_str}
-</question>
-To answer this question:
 1. Carefully read and analyze the provided snippets.
 2. Identify information that is directly relevant to the user's question.
@@ -50,11 +46,14 @@ Guidelines for your answer:
 6. Cite the relevant sentences from the snippets and their page numbers to support your answer.
 7. Answer in MFAQ format (Minimal Facts Answerable Question), providing the most concise and accurate response possible.
 8. Use Markdown to format your response and include citations to indicate the snippets and the page number used to derive your answer.
 Here's an example of a question and an answer. You must use this as a template to format your response:
 <example>
-Question: What was the main mix of the training data ? How much data was used to train the model ?
 ### Answer
 The main mix of the training data for the Llama 3 405 billion parameter model is as follows:
@@ -66,16 +65,20 @@ The main mix of the training data for the Llama 3 405 billion parameter model is
 Regarding the amount of data used to train the model, the snippets do not provide a specific total volume of data in terms of tokens or bytes. However, they do mention that the model was pre-trained on a large dataset containing knowledge until the end of 2023[^2^]. Additionally, the training process involved pre-training on 2.87 trillion tokens before further adjustments[^3^].
-### References
-[^1^]: "Scaling Laws for Data Mix," page 6.
-[^2^]: "Pre-Training Data," page 4.
-[^3^]: "Initial Pre-Training," page 14.
 </example>
 Remember, your role is to accurately convey the information from the research paper snippets, not to speculate or provide information from other sources.
 Answer:
 """
@@ -113,7 +116,7 @@ class SimpleRAGPipeline(weave.Model):
             nodes,
             embed_model=self._get_embedding_model(),
             show_progress=True,
-            insert_batch_size=128,
         )
         return index

 {context_str}
 </snippets>
+To answer the question:
 1. Carefully read and analyze the provided snippets.
 2. Identify information that is directly relevant to the user's question.
 6. Cite the relevant sentences from the snippets and their page numbers to support your answer.
 7. Answer in MFAQ format (Minimal Facts Answerable Question), providing the most concise and accurate response possible.
 8. Use Markdown to format your response and include citations to indicate the snippets and the page number used to derive your answer.
+9. Your answer must only have two headings: 'Answer' and 'Citations'.
 Here's an example of a question and an answer. You must use this as a template to format your response:
 <example>
+<question>
+What was the main mix of the training data ? How much data was used to train the model ?
+</question>
 ### Answer
 The main mix of the training data for the Llama 3 405 billion parameter model is as follows:
 Regarding the amount of data used to train the model, the snippets do not provide a specific total volume of data in terms of tokens or bytes. However, they do mention that the model was pre-trained on a large dataset containing knowledge until the end of 2023[^2^]. Additionally, the training process involved pre-training on 2.87 trillion tokens before further adjustments[^3^].
+### Citations
+- [^1^]: "Scaling Laws for Data Mix," page 6.
+- [^2^]: "Pre-Training Data," page 4.
+- [^3^]: "Initial Pre-Training," page 14.
 </example>
 Remember, your role is to accurately convey the information from the research paper snippets, not to speculate or provide information from other sources.
+<question>
+{query_str}
+</question>
 Answer:
 """
             nodes,
             embed_model=self._get_embedding_model(),
             show_progress=True,
+            insert_batch_size=512,
         )
         return index