Matching pdf with keywords

#57
by DevSelego - opened

Hey !

I have an issue with my project : I have a bunch of medical drug file and I'm trying to find specific file for specific type of drugs

What I did is search for keywords in every pdf with a retriever retriever = db.as_retriever(search_kwargs={'k': 2}) and embedding, it returns me paragraph, using sentence transformers.

Then I compare with the specificity of the drug I am looking for by comparing embedding => so I'm comparing paragraphs with keywords,

The result are not the good files, do you have any ideas how to improve it, I was thinking to summarize the paragraphe before doing the comparison.

Thanks fo the help :))))

Sign up or log in to comment