pdelobelle
commited on
Commit
•
607f97c
1
Parent(s):
f3af836
Update README.md
Browse files
README.md
CHANGED
@@ -12,8 +12,11 @@ license: apache-2.0
|
|
12 |
|
13 |
# BübleLM
|
14 |
|
15 |
-
|
16 |
-
|
|
|
|
|
|
|
17 |
</div>
|
18 |
|
19 |
BübleLM is a German language model based on Gemma-2B, adapted using [trans-tokenization](https://pieter.ai/trans-tokenization/) with a custom German SentencePiece tokenizer. The model demonstrates how language-specific tokenization can significantly improve performance while maintaining the base model's capabilities.
|
|
|
12 |
|
13 |
# BübleLM
|
14 |
|
15 |
+
|
16 |
+
<div align="center" style="margin-bottom: 2rem; margin-top: 2rem">
|
17 |
+
<img src="https://pieter.ai/resources/buble-logo.png" alt="BübleLM Logo" style="max-height: 450px; width: auto;"/>
|
18 |
+
<h1 style="margin-top: 1rem;">BübleLM</h1>
|
19 |
+
<p><em>A small German LM</em></p>
|
20 |
</div>
|
21 |
|
22 |
BübleLM is a German language model based on Gemma-2B, adapted using [trans-tokenization](https://pieter.ai/trans-tokenization/) with a custom German SentencePiece tokenizer. The model demonstrates how language-specific tokenization can significantly improve performance while maintaining the base model's capabilities.
|