wyf's picture

2 2

wyf

wyf23187

·

wyf23187

AI & ML interests

NLP

Recent Activity

authored a paper 19 days ago

Breaking Focus: Contextual Distraction Curse in Large Language Models

authored a paper 19 days ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

authored a paper 19 days ago

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

View all activity

Organizations

None yet

wyf23187's activity

authored 4 papers 19 days ago

Breaking Focus: Contextual Distraction Curse in Large Language Models

Paper • 2502.01609 • Published Feb 3 • 1

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published 20 days ago • 46

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

Paper • 2410.02736 • Published Oct 3, 2024

AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?

Paper • 2410.21259 • Published Oct 28, 2024 • 1

upvoted a paper 19 days ago

Breaking Focus: Contextual Distraction Curse in Large Language Models

Paper • 2502.01609 • Published Feb 3 • 1

liked a model 21 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 16 days ago • 3.21M • • 11.2k

upvoted a paper about 1 month ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published Feb 3 • 39

liked a dataset 5 months ago

MuskumPillerum/General-Knowledge

Viewer • Updated Oct 15, 2023 • 37.6k • 1.05k • 15