lillybak commited on
Commit
dfd3714
β€’
1 Parent(s): 98d7a92

Update README.md

Browse files

The parser for the PDF is the LlamaParse which is one of the best for table extraction.
The original file has 96 pages. We are choosing pages 12 and 64 only to test how well the tables are parsed and how well the LLM can extract the info from the tables.
Observe how the LLM "reads" the tables. It is not perfect where the table is ambiguous at the separation of each column.
Example questions:
1. What is the Accumulated Amortization of the Acquisition-related intangible assets in Jan 28 2024?
2. Who is the CEO?
3. What is the gross carrying amount of Total Amortizable Intangible Assets for Jan 29, 2023?
4. What is the Accumulated Amortization of the Acquisition-related intangible assets in Jan 28 2024?
5. Who are the E-VP of Operations and how old are they?

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
  title: NVIDIA RAG
3
  emoji: 🌍
4
- colorFrom: yellow
5
- colorTo: red
6
  sdk: docker
7
  pinned: false
8
  license: openrail
9
  ---
10
 
11
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
1
  ---
2
  title: NVIDIA RAG
3
  emoji: 🌍
4
+ colorFrom: green
5
+ colorTo: blue
6
  sdk: docker
7
  pinned: false
8
  license: openrail
9
  ---
10
 
11
+ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference