This project contains a text-to-text model designed to decrypt Turkish text encoded using a substitution cipher. In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext. The model leverages statistical and linguistic properties of Turkish to make educated guesses about the letter substitutions, aiming to recover the original plaintext message.

This model is for monoalphabetic Turkish substitution ciphers and it outputs decoded text.

Example:

Encoded text: Wn cqwqp fqpöqgbatqj ckjxq çmxmçmvy rafhqça zgqjgaçkxnh.

Decoded text: Bu sabah kahvaltıdan sonra yürüyüşe çıkmayı planlıyorum.

Suggested Usage:

#Load the model and tokenizer
cipher_text = "" #Encoded text here!
inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device)
outputs = model.generate(inputs["input_ids"], max_length=256)
decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
Downloads last month
46
Safetensors
Model size
223M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Cipher-AI/Substitution-Cipher-Text-Turkish

Base model

google-t5/t5-base
Finetuned
(428)
this model
Quantizations
1 model

Dataset used to train Cipher-AI/Substitution-Cipher-Text-Turkish

Collection including Cipher-AI/Substitution-Cipher-Text-Turkish