Spaces:

ryanrwatkins
/

needs

Running

ryanrwatkins commited on May 20, 2023

Commit

335902b

1 Parent(s): 6c5fa24

Update notes.md

Files changed (1) hide show

notes.md CHANGED Viewed

+# Description:
+In the app a query is initially “related” to chunks of text from the documents that are in my embedding model pkl file (using cosine similarity).  Those results (i.e., the chunks most closely associated with the query) are then fed into a new ChatGPT prompt as “context” along with the query. ChatGPT then uses context + query to respond.   Thus the chain is (a) query my model, and (b) use those results to query ChatGPT. The size of chunks is therefore an important variable since if you have large chunks (e.g., 2000 tokens) you pass larger context blocks to ChatGPT, but also use up the limited number of tokens available. For ChatGPT the default length is fixed at 2048 tokens, while the maximum can be set at 4096 tokens. The point being, consider what strategy you want to use for chunk sizing for each project you are working on. More on tokens limits can be found here: https://medium.com/@russkohn/mastering-ai-token-limits-and-memory-ce920630349a
 # Templates for App: