Spaces:

mirukulla
/

Resonate-Meetings-chat-bot

Runtime error

App Files Files Community

madhuroopa commited on May 16, 2024

Commit

0c9a4dd

1 Parent(s): 234534d

added new application files

Browse files

Files changed (3) hide show

config/.dummy_env +5 -0
config/README.md +46 -0
config/config.json +26 -0

config/.dummy_env ADDED Viewed

	@@ -0,0 +1,5 @@

+## Create a .env file and fill in the values for the following keys
+OPENAI_API_KEY = ""
+PINECONE_API_KEY = ""
+AWS_ACCESS_KEY = ""
+AWS_SECRET_ACCESS_KEY = ""

config/README.md ADDED Viewed

	@@ -0,0 +1,46 @@

+# Project Configuration
+## .env File
+This file contains the necessary API keys required for the application to function properly. Obtain the API keys from the following sources:
+- [OPENAI_API_KEY](https://platform.openai.com/api-keys)
+- [PINECONE_API_KEY](https://app.pinecone.io/)
+- [AWS_ACCESS_KEY](https://console.aws.amazon.com/)
+- [AWS_SECRET_ACCESS_KEY](https://console.aws.amazon.com/)
+## config.json
+This JSON file holds crucial configuration values for the entire application. Please refer to the documentation before modifying any configurations.
+### Pinecone Configuration
+- **PINECONE_INDEX_NAME**: The name of the index, the highest-level organizational unit of vector data in Pinecone.
+- **PINECONE_VECTOR_DIMENSION**: Dimensionality of the embedding model's vectors.
+- **PINECONE_UPSERT_BATCH_LIMIT**: Number of transcript rows inserted into Pinecone Serverless in parallel.
+- **PINECONE_TOP_K_RESULTS**: Number of results fetched by Pinecone for a query.
+- **PINECONE_DELTA_WINDOW**: Conversation window size fetched for TOP_K results.
+- **PINECONE_CLOUD_PROVIDER**: Cloud provider for Pinecone DB.
+- **PINECONE_REGION**: Region of the Pinecone Cloud provider.
+- **PINECONE_METRIC**: Distance metric used by Pinecone to calculate similarity.
+- **PINECONE_NAMESPACE**: Logical separation inside the Pinecone Index.
+### Embedding Provider Configuration
+- **EMBEDDING_PROVIDER**: Provider of the embedding model for text-to-vector conversion.
+- **EMBEDDING_MODEL_NAME**: Name of the embedding model provided by the provider.
+### AWS Configuration
+- **AWS_INPUT_BUCKET**: Bucket for storing audio files for AWS Transcribe.
+- **AWS_OUTPUT_BUCKET**: Bucket collecting transcribed files.
+- **AWS_REGION**: AWS region in use.
+- **AWS_TRANSCRIBE_JOB_NAME**: Default name for Transcribe job.
+### LangChain Configuration
+- **LC_LLM_TEMPERATURE**: Temperature value for the Large Language Model.
+- **LC_CONV_BUFFER_MEMORY_WINDOW**: Conversation memory window limit. (Future Use)
+- **LC_LLM_SUMMARY_MAX_TOKEN_LIMIT**: Maximum tokens allowed for summary in the memory buffer.
+- **LC_LLM_MODEL**: Large Language Model used for inference.

config/config.json ADDED Viewed

	@@ -0,0 +1,26 @@

+{
+	"PINECONE_INDEX_NAME": "langchain-retrieval-transcript",
+	"PINECONE_VECTOR_DIMENSION": 3072,
+	"PINECONE_UPSERT_BATCH_LIMIT": 90,
+	"PINECONE_TOP_K_RESULTS": 10,
+	"PINECONE_DELTA_WINDOW": 2,
+	"PINECONE_CLOUD_PROVIDER": "aws",
+	"PINECONE_REGION": "us-west-2",
+	"PINECONE_METRIC": "cosine",
+	"PINECONE_NAMESPACE": "default_namespace",
+	"EMBEDDING_PROVIDER": "OpenAI",
+	"EMBEDDING_MODEL_NAME": "text-embedding-3-large",
+	"MASTER_JSON_FILENAME": "master_meeting_details",
+	"AWS_INPUT_BUCKET": "input-bucket",
+	"AWS_OUTPUT_BUCKET": "output-bucket",
+	"AWS_REGION": "us-east-2",
+	"AWS_TRANSCRIBE_JOB_NAME": "transcribe-job",
+	"LC_LLM_TEMPERATURE": 0.01,
+	"LC_CONV_BUFFER_MEMORY_WINDOW": 1,
+	"LC_LLM_SUMMARY_MAX_TOKEN_LIMIT": 650,
+	"LC_LLM_MODEL": "gpt-3.5-turbo"
+}