AI & ML interests
Research Demos and Tools for Trustworthy and Safe AI Development and Deployment
Recent Activity
View all activity
Organization Card
- Welcome to TrustSafeAI! We are a reseach group focusing on evaluating and improving AI safety.
- If you are interested in joining us, please reach out to Pin-Yu Chen
- Team Members and Projects:
Member | Project | Webpage |
---|---|---|
Xiaomeng Hu | RADAR (NeurIPS'23), Gradient Cuff (NeurIPS'24) | webpage |
Lei Hsiung | NeuralFuse (NeurIPS'24), NCTV (TMLR; IJCAI'23) | webpage |
Zhi-Yi Chin | P4D (ICML'24) | webpage |
Barry Xiong | DPP | - |
Johnson Hung | AttentionTracker | - |
Zaitang Li | GREAT Score (NeurIPS'24) | - |
Yung-Chen Tang | NCTV (TMLR; IJCAI'23) , LLM-Physical-Safety | webpage |
Zhiyuan He | BEYOND (ICML'24) | - |
Yujun Zhou | LLM LabSafety | - |
Xiangyu Qi | LLM Finetuning Safety (ICLR'24) | webpage |
Pin-Yu Chen | All (research supervisor) | webpage |
spaces
8
Running
3
⚡
Attention Tracker Prompt Injection Detector
Attention Tracker: Prompt Injection Detector
Running
👀
LLM Physical Safety
LLM benchmark for Physical Safety
Running
⚡
NeuralFuse
Protect Model from Suffering Low-voltage-induced Bit Errors
Running
3
🦀
NCTV: Neural Clamping Toolkit and Visualization
Model-agnostic Toolkit for Neural Network Calibration
Running
🧠
GREAT Score
Running
8
🛡️
GradientCuff-Jailbreak-Defense
Demonstration of Gradient Cuff: A Jailbreak Defense