python-dotenv langchain torch PyPDF2 faiss-cpu langchain pypdf tiktoken InstructorEmbedding sentence-transformers chromadb