Daniel De Leon

daniel-de-leon

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Intel's profile picture Blog-explorers's profile picture test 's profile picture

daniel-de-leon's activity

posted an update 2 months ago
view post
Post
2403
As the rapid adoption of chat bots and QandA models continues, so do the concerns for their reliability and safety. In response to this, many state-of-the-art models are being tuned to act as Safety Guardrails to protect against malicious usage and avoid undesired, harmful output. I published a Hugging Face blog introducing a simple, proof-of-concept, RoBERTa-based LLM that my team and I finetuned to detect toxic prompt inputs into chat-style LLMs. The article explores some of the tradeoffs of fine-tuning larger decoder vs. smaller encoder models and asks the question if "simpler is better" in the arena of toxic prompt detection.

๐Ÿ”— to blog: https://huggingface.co/blog/daniel-de-leon/toxic-prompt-roberta
๐Ÿ”— to model: Intel/toxic-prompt-roberta
๐Ÿ”— to OPEA microservice: https://github.com/opea-project/GenAIComps/tree/main/comps/guardrails/toxicity_detection

A huge thank you to my colleagues that helped contribute: @qgao007 , @mitalipo , @ashahba and Fahim Mohammad
upvoted an article 2 months ago
view article
Article

ยกLanzamiento de la Comunidad Latinoamericana de NLP en Hugging Face! ๐ŸŒŸ

By prudant โ€ข
โ€ข 7
published an article 2 months ago
New activity in Intel/toxic-prompt-roberta 3 months ago
upvoted 2 articles 3 months ago
view article
Article

Fine Tuning a LLM Using Kubernetes with Intelยฎ Gaudiยฎ Accelerator

By omarkhleif โ€ข
โ€ข 7
view article
Article

Fine Tuning a LLM Using Kubernetes with Intelยฎ Xeonยฎ Scalable Processors

By dmsuehir โ€ข
โ€ข 5
upvoted an article 3 months ago
view article
Article

Model Card Generator Interface: Crafting Clear Insights into AI Models

By mitalipo โ€ข
โ€ข 4
New activity in Intel/toxic-prompt-roberta 3 months ago
updated a Space over 1 year ago
New activity in daniel-de-leon/test-docker over 1 year ago