Simone Van Taylor

svannie678

AI & ML interests

None yet

Recent Activity

liked a dataset 4 months ago

lmsys/lmsys-chat-1m

updated a model 5 months ago

svannie678/hi_algorithmic_bias_bounty_submission

updated a dataset 5 months ago

svannie678/red_team_repo_social_bias_dataset_information

View all activity

Organizations

None yet

svannie678's activity

liked a dataset 4 months ago

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 1.93k • 631

updated a model 5 months ago

svannie678/hi_algorithmic_bias_bounty_submission

Updated Sep 29, 2024

updated 2 datasets 5 months ago

svannie678/red_team_repo_social_bias_dataset_information

Viewer • Updated Sep 29, 2024 • 153 • 44

svannie678/red_team_repo_social_bias_prompts

Viewer • Updated Sep 29, 2024 • 40.3k • 66 • 1

upvoted a paper 5 months ago

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

Paper • 2308.03825 • Published Aug 7, 2023 • 2

liked a dataset 5 months ago

LibrAI/do-not-answer

Viewer • Updated Aug 28, 2023 • 939 • 1.21k • 31

liked 5 datasets 6 months ago

liked a Space 6 months ago

Redteaming Resistance Leaderboard

💻

Display model benchmark results

upvoted an article 6 months ago

Article

Introducing the Red-Teaming Resistance Leaderboard

Feb 23, 2024

• 13

upvoted an article 7 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 158

liked a model about 1 year ago

thenlper/gte-base

upvoted a collection about 1 year ago

Awesome RLHF

Collection

A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF). • 11 items • Updated Oct 2, 2023 • 7