RIMA HAZRA's picture

3 8 2

RIMA HAZRA

rimahazra

https://sites.google.com/view/rima-hazra

AI & ML interests

AI and Safety, AI Hallucinations, Natural Language Processing, Information Retrieval, Large Language Models.

Organizations

rimahazra's activity

New activity in llava-hf/llava-v1.6-mistral-7b-hf 2 months ago

PLZ!😭When I run the template, I get the error“Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained."

#26 opened 4 months ago by

commented 4 papers 5 months ago

Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Paper • 2406.11801 • Published Jun 17 • 15 •

SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

Paper • 2406.12274 • Published Jun 18 • 14 •

SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

Paper • 2406.12274 • Published Jun 18 • 14 •

Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Paper • 2406.11801 • Published Jun 17 • 15 •