🚀 RAGoon is now available on PyPI, GitHub, and as a Space on Hugging Face for batched embeddings generation 🤗
RAGoon is a set of NLP utilities for multi-model embedding production, high-dimensional vector visualization, and aims to improve language model performance by providing contextually relevant information through search-based querying, web scraping and data augmentation techniques.
At this stage, 5 major classes are available via RAGoon to facilitate: - the production of chain embeddings for several models to simplify a continuous deployment process; - production of LLM requests for web querying and content retrieval via the Google API; - recursive chunking via tokens; - data visualization and the function to load embeddings from a FAISS index, reduce their dimensionality using PCA and/or t-SNE, and visualize them in an interactive 3D graph; - the creation of binary indexes for search with scalar (int8) rescoring.