sentence_transformers gradio openai langchain pypdf unstructured pinecone-client docx2txt InstructorEmbedding