Update README.md
Browse files
README.md
CHANGED
@@ -10,8 +10,8 @@ tags:
|
|
10 |
datasets:
|
11 |
- burgerbee/wikipedia-en-20240320
|
12 |
---
|
13 |
-
# Wikipedia txtai embeddings index
|
14 |
-
This is a [txtai](https://github.com/neuml/txtai) embeddings index (
|
15 |
|
16 |
Embeddings is the engine that delivers semantic search. Data is transformed into embeddings vectors where similar concepts will produce similar vectors.
|
17 |
An embeddings index generated by txtai is a fully encapsulated index format. It dosen't require a database server.
|
|
|
10 |
datasets:
|
11 |
- burgerbee/wikipedia-en-20240320
|
12 |
---
|
13 |
+
# Wikipedia txtai embeddings index
|
14 |
+
This is a [txtai](https://github.com/neuml/txtai) embeddings index (5GB embeddings + 25GB documents) for the [english edition of Wikipedia](https://en.wikipedia.org/).
|
15 |
|
16 |
Embeddings is the engine that delivers semantic search. Data is transformed into embeddings vectors where similar concepts will produce similar vectors.
|
17 |
An embeddings index generated by txtai is a fully encapsulated index format. It dosen't require a database server.
|