grammarly
/

detexd-roberta-base

Text Classification

Inference Endpoints

Model card Files Files and versions Community

detexd-roberta-base / README.md

syavnyi's picture

Update citation

7344702 over 1 year ago

|

2.75 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: text-classification
	---

	# DeTexD-RoBERTa-base delicate text detection

	This is a baseline RoBERTa-base model for the delicate text detection task.

	* Paper: [DeTexD: A Benchmark Dataset for Delicate Text Detection](TODO)
	* [GitHub repository](https://github.com/grammarly/detexd)

	The labels meaning according to the paper:
	- LABEL_0 -> non-delicate (0)
	- LABEL_1 -> very low risk (1)
	- LABEL_2 -> low risk (2)
	- LABEL_3 -> medium risk (3)
	- LABEL_4 -> high risk (4)
	- LABEL_5 -> very high risk (5)

	## Classification example code

	Here's a short usage example with the torch library in a binary classification task:

	```python
	from transformers import pipeline

	classifier = pipeline("text-classification", model="grammarly/detexd-roberta-base")

	def predict_binary_score(text: str):
	# get multiclass probability scores
	scores = classifier(text, top_k=None)

	# convert to a single score by summing the probability scores
	# for the higher-index classes
	return sum(score['score']
	for score in scores
	if score['label'] in ('LABEL_3', 'LABEL_4', 'LABEL_5'))

	def predict_delicate(text: str, threshold=0.72496545):
	return predict_binary_score(text) > threshold

	print(predict_delicate("Time flies like an arrow. Fruit flies like a banana."))
	```

	Expected output:

	```
	False
	```

	## Citation Information

	```
	@inproceedings{chernodub-etal-2023-detexd,
	title = "{D}e{T}ex{D}: A Benchmark Dataset for Delicate Text Detection",
	author = "Yavnyi, Serhii and Sliusarenko, Oleksii and Razzaghi, Jade and Mo, Yichen and Hovakimyan, Knar and Chernodub, Artem",
	booktitle = "The 7th Workshop on Online Abuse and Harms (WOAH)",
	month = jul,
	year = "2023",
	address = "Toronto, Canada",
	publisher = "Association for Computational Linguistics",
	url = "https://aclanthology.org/2023.woah-1.2",
	pages = "14--28",
	abstract = "Over the past few years, much research has been conducted to identify and regulate toxic language. However, few studies have addressed a broader range of sensitive texts that are not necessarily overtly toxic. In this paper, we introduce and define a new category of sensitive text called {``}delicate text.{''} We provide the taxonomy of delicate text and present a detailed annotation scheme. We annotate DeTexD, the first benchmark dataset for delicate text detection. The significance of the difference in the definitions is highlighted by the relative performance deltas between models trained each definitions and corpora and evaluated on the other. We make publicly available the DeTexD Benchmark dataset, annotation guidelines, and baseline model for delicate text detection.",
	}
	```