|
--- |
|
library_name: transformers |
|
tags: |
|
- cryptology |
|
- cipher |
|
datasets: |
|
- asimokby/Turkish-OSCAR-GEC |
|
language: |
|
- tr |
|
base_model: |
|
- google-t5/t5-base |
|
license: apache-2.0 |
|
--- |
|
|
|
This project contains a text-to-text model designed to decrypt Turkish text encoded using a substitution cipher. |
|
In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext. |
|
The model leverages statistical and linguistic properties of Turkish to make educated guesses about the letter substitutions, |
|
aiming to recover the original plaintext message. |
|
|
|
This model is for monoalphabetic Turkish substitution ciphers and it outputs decoded text. |
|
|
|
Example: |
|
|
|
Encoded text: |
|
**Wn cqwqp fqpöqgbatqj ckjxq çmxmçmvy rafhqça zgqjgaçkxnh.** |
|
|
|
Decoded text: |
|
**Bu sabah kahvaltıdan sonra yürüyüşe çıkmayı planlıyorum.** |
|
|
|
Suggested Usage: |
|
```py |
|
#Load the model and tokenizer |
|
cipher_text = "" #Encoded text here! |
|
inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device) |
|
outputs = model.generate(inputs["input_ids"], max_length=256) |
|
decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True) |
|
``` |