1 10 5

LLLeo Li

LLLeo612

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

upvoted a paper 1 day ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

upvoted a paper 3 days ago

Thus Spake Long-Context Large Language Model

View all activity

Organizations

LLLeo612's activity

authored a paper 1 day ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published 5 days ago • 5

upvoted a paper 1 day ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published 5 days ago • 5

upvoted a paper 3 days ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 4 days ago • 63

liked a model about 2 months ago

Goodfire/Llama-3.1-8B-Instruct-SAE-l19

Updated Jan 11 • 21 • 35

New activity in SafeMTData/SafeMTData 3 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

authored a paper 3 months ago

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10

upvoted 2 papers 3 months ago

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10

Multimodal Situational Safety

Paper • 2410.06172 • Published Oct 8, 2024 • 10

upvoted a paper 6 months ago

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65

upvoted a collection 7 months ago

Gemma Scope Release

Collection

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Dec 13, 2024 • 17

upvoted 2 papers 7 months ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 27

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

Paper • 2407.14435 • Published Jul 19, 2024 • 7

updated a collection 7 months ago

unlearning

Collection

2 items • Updated Jul 22, 2024

upvoted a paper 8 months ago

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Paper • 2407.10058 • Published Jul 14, 2024 • 31

liked a model 10 months ago

meta-llama/Meta-Llama-3-8B

Text Generation • Updated Sep 27, 2024 • 448k • 6.06k

liked a model 11 months ago

BAAI/bge-m3

upvoted a collection 12 months ago

RAG

Collection

122 items • Updated Sep 13, 2024 • 19

liked a dataset 12 months ago

nyu-mll/glue

Viewer • Updated Jan 30, 2024 • 1.49M • 187k • 391