Spaces:
Running
Running
wjbmattingly
commited on
Commit
β’
c3d811a
1
Parent(s):
1fe831f
Update README.md
Browse files
README.md
CHANGED
@@ -7,4 +7,28 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
# Medieval Data π°
|
11 |
+
|
12 |
+
Welcome to the **Medieval Data** organization, a dedicated platform for offering datasets specifically curated for training machine learning models on medieval-specific tasks.
|
13 |
+
|
14 |
+
These datasets and models are maintained by [William J.B. Mattingly](https://wjbmattingly.com/)
|
15 |
+
|
16 |
+
## Datasets π
|
17 |
+
|
18 |
+
Here's a quick overview of our available datasets:
|
19 |
+
|
20 |
+
1. **MGH Critical Edition Dataset**: 100 annotated pages of an MGH critical edition to parse out the main body text and titles from marginalia and footers.
|
21 |
+
|
22 |
+
## Models π‘οΈ
|
23 |
+
|
24 |
+
1. **MGH Object Detection YOLOv8**: Annotate an MGH critical edition to extract the main body text and titles automatically. This helps in downstream OCR with Tesseract.
|
25 |
+
|
26 |
+
Replace `dataset_name` with the specific name of the dataset you're interested in.
|
27 |
+
|
28 |
+
## Contribute π€
|
29 |
+
|
30 |
+
We welcome contributions! If you have a medieval-specific dataset or have annotations that can be added, please reach out.
|
31 |
+
|
32 |
+
## License π
|
33 |
+
|
34 |
+
All datasets in this organization are released under the [CC BY 4.0 License](https://creativecommons.org/licenses/by/4.0/) unless specified otherwise. Please ensure to cite the original sources and the Medieval Data organization when using the datasets.
|