# Project Configuration

## .env File

This file contains the necessary API keys required for the application to function properly. Obtain the API keys from the following sources:

- [OPENAI_API_KEY](https://platform.openai.com/api-keys)
- [PINECONE_API_KEY](https://app.pinecone.io/)
- [AWS_ACCESS_KEY](https://console.aws.amazon.com/)
- [AWS_SECRET_ACCESS_KEY](https://console.aws.amazon.com/)

## config.json

This JSON file holds crucial configuration values for the entire application. Please refer to the documentation before modifying any configurations.

### Pinecone Configuration

- **PINECONE_INDEX_NAME**: The name of the index, the highest-level organizational unit of vector data in Pinecone.
- **PINECONE_VECTOR_DIMENSION**: Dimensionality of the embedding model's vectors.
- **PINECONE_UPSERT_BATCH_LIMIT**: Number of transcript rows inserted into Pinecone Serverless in parallel.
- **PINECONE_TOP_K_RESULTS**: Number of results fetched by Pinecone for a query.
- **PINECONE_DELTA_WINDOW**: Conversation window size fetched for TOP_K results.
- **PINECONE_CLOUD_PROVIDER**: Cloud provider for Pinecone DB.
- **PINECONE_REGION**: Region of the Pinecone Cloud provider.
- **PINECONE_METRIC**: Distance metric used by Pinecone to calculate similarity.
- **PINECONE_NAMESPACE**: Logical separation inside the Pinecone Index.

### Embedding Provider Configuration

- **EMBEDDING_PROVIDER**: Provider of the embedding model for text-to-vector conversion.
- **EMBEDDING_MODEL_NAME**: Name of the embedding model provided by the provider.

### AWS Configuration

- **AWS_INPUT_BUCKET**: Bucket for storing audio files for AWS Transcribe.
- **AWS_OUTPUT_BUCKET**: Bucket collecting transcribed files.
- **AWS_REGION**: AWS region in use.
- **AWS_TRANSCRIBE_JOB_NAME**: Default name for Transcribe job.

### LangChain Configuration

- **LC_LLM_TEMPERATURE**: Temperature value for the Large Language Model.
- **LC_CONV_BUFFER_MEMORY_WINDOW**: Conversation memory window limit. (Future Use)
- **LC_LLM_SUMMARY_MAX_TOKEN_LIMIT**: Maximum tokens allowed for summary in the memory buffer.
- **LC_LLM_MODEL**: Large Language Model used for inference.