Update README.md

555c94c verified about 1 month ago

4.7 kB

	---
	license: mit
	language:
	- he
	tags:
	- languages
	- manuscripts
	- hebrew
	- ocr
	- letters
	- manuscript
	- digital-humanities
	datasets:
	- bsesic/HebrewManuscripts
	---

	# Hebrew Letter Recognition Model

	## Model Description

	This is a Convolutional Neural Network (CNN) model trained to recognize Hebrew letters and a stop symbols in images. The model can identify individual letters from a provided image, outputting their respective class along with probabilities.

	## Model Details:
	* Model Type: Convolutional Neural Network (CNN)
	* Framework: TensorFlow 2.x / Keras
	* Input Size: 64x64 grayscale images of isolated letters.
	* Output Classes: 28 Hebrew letters + 1 stop symbol (.)
	* Use Case: Recognizing handwritten or printed Hebrew letters and punctuation in scanned images or photos of documents.

	## Intended Use

	This model is designed for the automatic recognition of Hebrew letters from images. The model can be used in applications such as:

	* Optical character recognition (OCR) systems for Hebrew text.
	* Educational tools to help learners read Hebrew text.
	* Historical document digitization of Hebrew manuscripts.

	## How to Use:
	```python
	from tensorflow.keras.models import load_model
	import numpy as np
	import cv2

	# Load the model
	model = load_model('path_to_model.hebrew_letter_model.keras')

	# Preprocess an input image (example for one letter)
	img = cv2.imread('path_to_image.jpg', cv2.IMREAD_GRAYSCALE)
	img_resized = cv2.resize(img, (64, 64)) / 255.0
	img_array = np.expand_dims(img_resized, axis=0)

	# Predict
	predictions = model.predict(img_array)
	predicted_class = np.argmax(predictions, axis=1)[0]

	# Class names for Hebrew letters
	class_names = ['stop', 'א', 'ב', 'ג', 'ד', 'ה', 'ו', 'ז', 'ח', 'ט', 'י', 'ך', 'כ', 'ל', 'ם', 'מ', 'ן', 'נ', 'ס', 'ע', 'ף', 'פ', 'ץ', 'צ', 'ק', 'ר', 'ש', 'ת']

	print("Predicted letter:", class_names[predicted_class])

	```

	## Example:
	If given an image with the Hebrew word "אברם" (Abram), the model can detect and classify the letters and stop symbols with probabilities.

	## Limitations:

	* Font Variations: The model performs best on specific fonts (e.g., square Hebrew letters). Performance may degrade with highly stylized or cursive fonts.
	* Noise Sensitivity: Images with a lot of noise, artifacts, or low resolution may lead to incorrect predictions.
	* Stop Symbol: The stop symbol is particularly recognized by detecting three vertical dots. However, false positives can occur if letters with similar shapes are present.

	## Training Data:

	The model was trained on a dataset containing Hebrew letters and stop symbols. The training dataset includes:

	* 28 Hebrew letters.
	* 1 stop symbol representing three vertical dots (.).

	## Training Procedure:
	* Optimizer: Adam
	* Loss function: Categorical Crossentropy
	* Batch size: 32
	* Epochs: 10

	Data augmentation was applied to reduce overfitting and increase the model's generalizability to unseen data. This includes random rotations, zooms, and horizontal flips.

	## Model Performance

	# Metrics:
	* Accuracy: 95% on the validation dataset.
	* Precision: 94%
	* Recall: 93%
	*
	Performance may vary depending on the quality of the input images, noise levels, and whether the letters are handwritten or printed.

	## Known Issues:
	* False Positives for Stop Symbols: The model sometimes incorrectly identifies letters that resemble three vertical dots as stop symbols.
	* Overfitting to Specific Fonts: Performance can degrade on handwritten texts or cursive fonts not represented well in the training set.

	## Ethical Considerations

	* Bias: The model was trained on a specific set of Hebrew fonts and may not perform equally well across all types of Hebrew texts, particularly historical or handwritten documents.
	Fairness: The model may produce varying results depending on font style, quality of input images, and preprocessing applied.

	## Future Work:

	* Improving Generalization: Future work will focus on improving the model's robustness to different fonts, handwriting styles, and noisy inputs.
	Multilingual Expansion: Adding support for other Semitic scripts or expanding the model for multilingual OCR tasks.
	Citation:

	If you use this model in your work, please cite it as follows:

	```bibtex
	@misc{hebrew-letter-recognition,
	title={Hebrew Manuscripts Letter Recognition Model},
	author={Benjamin Schnabel},
	year={2024},
	howpublished={\url{https://huggingface.co/bsesic/HebrewManuscriptsMNIST}},
	}
	```

	License:

	This model is licensed under [MIT License](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md).