Spaces:

Manoj21k
/

Fin_Analyst

Running

App Files Files Community

Manoj21k commited on 24 days ago

Commit

bc6b22b

•

1 Parent(s): 20f5da8

Upload 13 files

Browse files

Files changed (13) hide show

README.md +80 -13
Rule_Based_Sample_Response.png +0 -0
Smart_Chatbot.png +0 -0
Smart_Chatbot2.png +0 -0
app.py +48 -0
data/Financial_data.csv +10 -0
requirements.txt +75 -0
src/__init__.py +0 -0
src/__pycache__/__init__.cpython-311.pyc +0 -0
src/__pycache__/main.cpython-311.pyc +0 -0
src/__pycache__/rule_based.cpython-311.pyc +0 -0
src/main.py +209 -0
src/rule_based.py +13 -0

README.md CHANGED Viewed

@@ -1,13 +1,80 @@
----
-title: Fin Analyst
-emoji: 🏆
-colorFrom: yellow
-colorTo: yellow
-sdk: streamlit
-sdk_version: 1.35.0
-app_file: app.py
-pinned: false
-license: apache-2.0
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# FinBuddy Project
+## Overview
+FinBuddy is an AI Assistant designed to help users with questions related to financial metrics of Microsoft (MSFT), Tesla (TSLA), and Apple (AAPL). The project uses a combination of rule-based logic and machine learning models to provide answers in different formats such as text responses, tables, and charts. The application is built using Streamlit for the user interface and integrates with LangChain and Groq for the underlying AI functionalities.
+## Project Structure
+The project consists of three main components:
+1. **main.py**: This script handles the core functionalities of the AI assistant, including initializing the language models, creating agents, and defining the conversation handling logic.
+2. **rule_based.py**: This script contains the rule-based logic for providing predefined answers to specific queries.
+3. **app.py**: This script sets up the Streamlit interface and integrates the rule-based and AI-driven responses.
+## File Descriptions
+### `main.py`
+- **Imports and Initialization**: Sets up necessary imports, initializes the Groq client and model, and defines chat history.
+- **Agent Creation**: Creates a CSV agent using LangChain's `create_csv_agent` function to handle queries related to financial metrics.
+- **Functions**:
+  - `convo_agent`: Handles simple conversational queries.
+  - `csv_agent`: Processes financial queries and formats responses as tables, bar charts, or line charts.
+  - `run_conversation`: Manages the flow of the conversation by determining which function to use based on the user query.
+  - `get_response`: Processes the user query, interacts with the agents, and returns the appropriate response.
+  - `write_answer`: Formats and displays the response in Streamlit, including rendering tables and charts.
+### `rule_based.py`
+- **simple_chatbot**: A simple rule-based chatbot that returns predefined answers for specific queries related to the total revenue of MSFT, AAPL, and TSLA for 2022 and 2023.
+### `app.py`
+- **Streamlit Setup**: Initializes the Streamlit app, sets the title, and displays chat history.
+- **Message Handling**: Captures user input, retrieves responses from the AI assistant or rule-based bot, and displays the responses.
+- **Integration**: Integrates `get_response` from `main.py` and `simple_chatbot` from `rule_based.py` to handle user queries.
+## Project Flow
+1. **User Interaction**:
+   - The user interacts with the Streamlit app by typing a query into the chat input.
+2. **Message Capture**:
+   - The app captures the user's input and appends it to the chat history.
+3. **Response Generation**:
+   - The `get_response` function is called with the user query.
+   - `get_response` decides whether to use the `convo_agent` or `csv_agent` based on the nature of the query.
+   - The appropriate agent processes the query and returns the response.
+4. **Rule-Based Check**:
+   - Optionally, the response can be generated by the `simple_chatbot` for predefined queries if the rule-based option is enabled.
+5. **Response Display**:
+   - The response is formatted and displayed in the Streamlit app.
+   - If the response includes a table or chart, it is rendered accordingly.
+6. **Chat History Update**:
+   - The chat history is updated with the new user query and the AI response.
+## How to Run
+Configuring Groq API
+Create an API Key:
+Visit the [Groq Console](https://console.groq.com/docs/api-reference#chat) and create an API key.
+1. Ensure you have the necessary dependencies installed:
+   ```bash
+   pip install -r requirements.txt
+   ```
+2. Run the Streamlit app:
+   ```bash
+   streamlit run app.py
+   ```
+3. Interact with the FinBuddy assistant through the Streamlit interface by typing queries related to the financial metrics of MSFT, TSLA, and AAPL.
+Current this doesn't support Visualization or table, will be added in next iteration.
+## Conclusion
+FinBuddy is a robust AI assistant designed to help users with financial queries. By combining rule-based logic and advanced AI models, it provides accurate and formatted responses to a variety of questions. The use of Streamlit ensures a user-friendly interface for seamless interaction.

Rule_Based_Sample_Response.png ADDED Viewed

Smart_Chatbot.png ADDED Viewed

Smart_Chatbot2.png ADDED Viewed

app.py ADDED Viewed

	@@ -0,0 +1,48 @@

+import time
+import warnings
+import pandas as pd
+import streamlit as st
+from langchain.memory.chat_message_histories import StreamlitChatMessageHistory
+from src.main import get_response
+from src.rule_based import simple_chatbot
+from src.main import write_answer
+warnings.filterwarnings("ignore")
+# Streamlit app setup
+st.title("FinBuddy")
+st.write("I am an AI Assistant helping with answering questions related to financial metrics of MSFT, TSLA, and Apple.")
+msgs = StreamlitChatMessageHistory(key="special_app_key")
+for msg in msgs.messages:
+    st.chat_message(msg.type).write(msg.content)
+if prompt := st.chat_input():
+    start_time = time.time()
+    st.chat_message("human").write(prompt)
+    msgs.add_user_message(prompt)
+    with st.spinner("Waiting for response..."):
+        # Get response from chatbot
+        response_text = get_response(prompt)
+        # enable below one to get the plots or table
+        #response_text = write_answer(response_text)
+        # Rule based bot
+        #response_text = simple_chatbot(prompt)
+    if response_text:
+        if 'answer' in response_text:
+            st.chat_message("ai").write(response_text['answer'])
+            msgs.add_ai_message(response_text['answer'])
+        else:
+            for key, value in response_text.items():
+                msgs.add_ai_message(str(response_text))
+        end_time = time.time()
+        st.write(f"Total time {end_time - start_time}")
+    else:
+        st.error("No valid response received from the AI.")

data/Financial_data.csv ADDED Viewed

	@@ -0,0 +1,10 @@

+Company,Year,Total Revenue(In Millions),Net Income(In Millions),Total Assets(In Millions),Total Liabilities(In Millions),Cash Flow from Operating Activities(In Millions)
+Microsoft,2023,211915.00,72361.00,411976.00,205753.00,87582.00
+Tesla,2023,96773.00,14974.00,106618.00,43009.00,13256.00
+Apple,2023,383285.00,96995.00,352583.00,290437.00,110543.00
+Microsoft,2022,198270.00,72738.00,364840.00,198298.00,89035.00
+Tesla,2022,81462.00,12587.00,82338.00,36440.00,14724.00
+Apple,2022,394328.00,99803.00,352755.00,302083.00,122151.00
+Microsoft,2021,168088.00,61271.00,333779.00,191791.00,76740.00
+Tesla,2021,53823.00,5644.00,62131.00,30548.00,11497.00
+Apple,2021,365817.00,94680.00,351002.00,287912.00,104038.00

requirements.txt ADDED Viewed

	@@ -0,0 +1,75 @@

+aiohttp==3.9.5
+aiosignal==1.3.1
+altair==5.3.0
+annotated-types==0.7.0
+anyio==4.4.0
+attrs==23.2.0
+blinker==1.8.2
+cachetools==5.3.3
+certifi==2024.6.2
+charset-normalizer==3.3.2
+click==8.1.7
+colorama==0.4.6
+dataclasses-json==0.6.6
+distro==1.9.0
+frozenlist==1.4.1
+gitdb==4.0.11
+GitPython==3.1.43
+greenlet==3.0.3
+groq==0.8.0
+h11==0.14.0
+httpcore==1.0.5
+httpx==0.27.0
+idna==3.7
+Jinja2==3.1.4
+jsonpatch==1.33
+jsonpointer==2.4
+jsonschema==4.22.0
+jsonschema-specifications==2023.12.1
+langchain==0.2.1
+langchain-community==0.2.1
+langchain-core==0.2.3
+langchain-experimental==0.0.59
+langchain-groq==0.1.4
+langchain-text-splitters==0.2.0
+langsmith==0.1.68
+markdown-it-py==3.0.0
+MarkupSafe==2.1.5
+marshmallow==3.21.2
+mdurl==0.1.2
+multidict==6.0.5
+mypy-extensions==1.0.0
+numpy==1.26.4
+orjson==3.10.3
+packaging==23.2
+pandas==2.2.2
+pillow==10.3.0
+protobuf==4.25.3
+pyarrow==16.1.0
+pydantic==2.7.3
+pydantic_core==2.18.4
+pydeck==0.9.1
+Pygments==2.18.0
+python-dateutil==2.9.0.post0
+pytz==2024.1
+PyYAML==6.0.1
+referencing==0.35.1
+requests==2.32.3
+rich==13.7.1
+rpds-py==0.18.1
+six==1.16.0
+smmap==5.0.1
+sniffio==1.3.1
+SQLAlchemy==2.0.30
+streamlit==1.35.0
+tabulate==0.9.0
+tenacity==8.3.0
+toml==0.10.2
+toolz==0.12.1
+tornado==6.4
+typing-inspect==0.9.0
+typing_extensions==4.12.1
+tzdata==2024.1
+urllib3==2.2.1
+watchdog==4.0.1
+yarl==1.9.4

src/__init__.py ADDED Viewed

File without changes

src/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (241 Bytes). View file

src/__pycache__/main.cpython-311.pyc ADDED Viewed

Binary file (10.1 kB). View file

src/__pycache__/rule_based.cpython-311.pyc ADDED Viewed

Binary file (1.03 kB). View file

src/main.py ADDED Viewed

	@@ -0,0 +1,209 @@

+import ast
+import json
+import streamlit as st
+import pandas as pd
+from langchain.agents.agent_types import AgentType
+from langchain_experimental.agents import create_csv_agent
+from langchain_groq import ChatGroq
+from langchain.memory import ChatMessageHistory
+from groq import Groq
+# Initialize Groq client and model
+client = Groq(api_key='gsk')
+MODEL = 'llama3-70b-8192'
+# Initialize chat history
+history = ChatMessageHistory()
+history.add_user_message("hi!")
+history.add_ai_message("whats up?")
+# Initialize language model
+llm = ChatGroq(
+    temperature=0,
+    groq_api_key='gsk...',
+    model_name='llama3-70b-8192'
+)
+# Create CSV agent
+agent = create_csv_agent(
+    llm,
+    r"Financial_data.csv",
+    verbose=True,
+    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,
+    max_iterations=5,
+    handle_parsing_errors=True
+)
+# Functions to handle conversations
+def convo_agent(question, chat_history):
+    response = 'I was built to answer questions related to financials MSFT, TSLA and AAPL. Let me know if you have any questions on these.'
+    return {'answer': response}
+def csv_agent(question, chat_history):
+    prompt = (
+        """
+        Let's decode the way to respond to the queries. The responses depend on the type of information requested in the query.
+        Return just the data, don't take effort of creating plots, prints and all.
+        No explanation needed. Return just the dict
+        Always include units in response .
+        1. If the query requires a table, format your answer like this:
+           {"table": {"columns": ["column1", "column2", ...], "data": [[value1, value2, ...], [value1, value2, ...], ...]}}
+        2. For a bar chart, respond like this:
+           {"bar": {"columns": ["A", "B", "C", ...], "data": [25, 24, 10, ...]}}
+        3. If a line chart is more appropriate, your reply should look like this:
+           {"line": {"columns": ["A", "B", "C", ...], "data": [25, 24, 10, ...]}}
+        Note: We only accommodate two types of charts: "bar" and "line".
+        4. For a plain question that doesn't need a chart or table, your response should be:
+           {"answer": "Your answer goes here"}
+        For example:
+           {"answer": "The Product with the highest Orders is '15143Exfo'"}
+        5. If the answer is not known or available, respond with:
+           {"answer": "I do not know."}
+        Return all output as a string. Remember to encase all strings in the "columns" list and data list in double quotes.
+        For example: {"columns": ["Products", "Orders"], "data": [["51993Masc", 191], ["49631Foun", 152]]}
+        Return all the numerical values in int format only.
+        Now, let's tackle the query step by step. Here's the query for you to work on:"""
+        +
+        question
+    )
+    response = agent.run(prompt)
+    return ast.literal_eval(response)
+# Define tools and function mapping
+tool_convo_agent = {
+    "type": "function",
+    "function": {
+        "name": "convo_agent",
+        "description": "Answers questions like chit chat or simple friendly messages",
+        "parameters": {
+            "type": "object",
+            "properties": {
+                "question": {"type": "string", "description": "The user question"}
+            },
+            "required": ["question"],
+        },
+    },
+}
+tool_fin_agent = {
+    "type": "function",
+    "function": {
+        "name": "csv_agent",
+        "description": "Answers questions related to financial metrics of us Apple, Microsoft and Tesla.",
+        "parameters": {
+            "type": "object",
+            "properties": {
+                "question": {"type": "string", "description": "The user question"}
+            },
+            "required": ["question"],
+        },
+    },
+}
+tools = [tool_convo_agent, tool_fin_agent]
+function_map = {
+    "csv_agent": csv_agent,
+    "convo_agent": convo_agent
+}
+# Conversation handling
+def run_conversation(chat_history, user_prompt, tools):
+    final_prompt = {'chat_history':{chat_history}, 'question':{user_prompt}}
+    messages = [
+        {"role": "system", "content": "You are an efficient agent that determines which function to use in order to answer user question."},
+        {"role": "user", "content": str(final_prompt)},
+    ]
+    response = client.chat.completions.create(
+        model=MODEL,
+        messages=messages,
+        tools=tools,
+        tool_choice="auto",
+        max_tokens=4096
+    )
+    response_message = response.choices[0].message
+    tool_calls = response_message.tool_calls
+    return tool_calls
+def get_response(question):
+    try:
+        history.add_user_message(question)
+        chat_history = str(history.messages)
+        agents = run_conversation(chat_history, question, tools)
+        func_to_call = agents[0].function.name
+        if func_to_call in function_map:
+            question_to_run = ast.literal_eval(agents[0].function.arguments)['question']
+            result = function_map[func_to_call](question_to_run, chat_history)
+        else:
+            result = {"error": "Something went Wrong"}
+        if 'error' in result:
+            return "Something went wrong"
+        print(result)
+        history.add_ai_message(str(result))
+        return result
+    except Exception as e:
+        return f"Something went wrong: {e}"
+# Response writing for Streamlit
+def write_answer(response_dict):
+    if not isinstance(response_dict, dict):
+        return "Invalid response format received."
+    if "answer" in response_dict:
+        return response_dict
+    if "bar" in response_dict:
+        data = response_dict["bar"]
+        try:
+            df_data = {col: [x[i] if isinstance(x, list) else x for x in data['data']] for i, col in enumerate(data['columns'])}
+            df = pd.DataFrame(df_data)
+            df.set_index("Year", inplace=True)
+            st.bar_chart(df)
+            return {'bar': ''}
+        except ValueError:
+            st.error(f"Couldn't create DataFrame from data: {data}")
+    if "line" in response_dict:
+        data = response_dict["line"]
+        try:
+            df_data = {col: [x[i] for x in data['data']] for i, col in enumerate(data['columns'])}
+            df = pd.DataFrame(df_data)
+            df.set_index("Year", inplace=True)
+            st.line_chart(df)
+            return {'line': ''}
+        except ValueError:
+            st.error(f"Couldn't create DataFrame from data: {data}")
+    if "table" in response_dict:
+        data = response_dict["table"]
+        try:
+            clean_data = [
+                [int(x.replace(',', '')) if isinstance(x, str) and x.replace(',', '').isdigit() else x for x in row]
+                for row in data["data"]
+            ]
+            df = pd.DataFrame(clean_data, columns=data["columns"])
+            st.table(df)
+            return {'table': ''}
+        except ValueError as e:
+            st.error(f"Couldn't create DataFrame from data: {data}. Error: {e}")
+    return "No valid response type found."

src/rule_based.py ADDED Viewed

	@@ -0,0 +1,13 @@

+def simple_chatbot(user_query):
+   if user_query == "What is the total revenue for MSFT 2023?":
+       return "The total revenue is $ 2,11,915 Millions"
+   elif user_query == "What is the total revenue for AAPL 2023?":
+       return "The total revenue is $ 3,83,285 Millions"
+   elif user_query == "What is the total revenue for TSLA 2023?":
+       return "The total revenue is $ 96,773 Millions"
+   elif user_query == "What is the total revenue for AAPL 2022?":
+       return "The total revenue is $ 3,94,328 Millions"
+   elif user_query == "What is the total revenue for TSLA 2022?":
+       return "The total revenue is $ 81,462 Millions"
+   else:
+       return "Sorry, I can only provide information on predefined queries."