serdarcaglar's picture
Create README.md
f746b70 verified
---
language:
- tr
---
# Model Card: Turkish Scientific RoBERTa ONNX
## Model Description
ONNX version of roberta-base-turkish-scientific-cased, specialized for Turkish scientific text analysis.
## Intended Use
- Scientific text analysis in Turkish
- Text comprehension
- Fill-mask predictions
- Scientific text summarization
## Training Data
- Source: Turkish scientific article abstracts from trdizin, yöktez, and t.k.
- Training Duration: 3+ days
- Steps: 2M
- Built from scratch, no fine-tuning
## Technical Specifications
- Base Architecture: RoBERTa
- Tokenizer: BPE (Byte Pair Encoding)
- Format: ONNX
- Original Model: serdarcaglar/roberta-base-turkish-scientific-cased
## Performance and Limitations
- Optimized for scientific domain in Turkish
- Not tested for general domain text
- ONNX format optimized for inference
## Requirements
- onnxruntime
- transformers
- torch
## License and Usage
- Follow original model license
- Users responsible for compliance
## Citation
```bibtex
@misc{caglar2024roberta,
author = {Çağlar, Serdar},
title = {Roberta-base-turkish-scientific-cased},
year = {2024},
publisher = {HuggingFace},
url = {https://huggingface.co/serdarcaglar/roberta-base-turkish-scientific-cased}
}
```
## Contact
Serdar ÇAĞLAR (serdarildercaglar@gmail.com)