streamlit PyPDF2 nltk requests sqlalchemy pandas openai spacy