metadata
title: README
emoji: π
colorFrom: red
colorTo: yellow
sdk: static
pinned: false
Hello, we're minish!
We're a two-person (@pringled and @stephantul) open-source company, with a focus on Natural Language Processing.
We believe that if you make models fast enough, you unlock new possibilities.
Using our software, you can:
- Ingest the entire English Wikipedia in 5 minutes
- Classify tens of thousands of documents per second on CPU
- Approximately deduplicate extremely large datasets in minutes
- Build the fastest RAG application in the world
- Easily evaluate which ANN algorithm works best for your data
Our projects:
- model2vec: make tiny models that are still really really good.
- potion: the best small model in the world. 100-500x faster than a sentence-transformer, and almost as good.
- vicinity: consistent interfaces to many approximate nearest neighbor algorithms.
- semhash: lightning-fast, super accuracte, approximate deduplication for your text datasets.