Henning Bartsch's picture

Henning Bartsch

HenningBlue

·

Curlykonda

AI & ML interests

AI Safety, NLP, vision-language models, safety evals

Organizations

None yet

HenningBlue's activity

upvoted a paper 6 months ago

On scalable oversight with weak LLMs judging strong LLMs

Paper • 2407.04622 • Published Jul 5, 2024 • 11

upvoted 5 papers 7 months ago

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 29

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 53

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

Paper • 2406.04520 • Published Jun 6, 2024 • 11

Self-Improving Robust Preference Optimization

Paper • 2406.01660 • Published Jun 3, 2024 • 18

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 44

upvoted an article 7 months ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

May 24, 2024

• 21

upvoted 3 papers 8 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 149

Imp: Highly Capable Large Multimodal Models for Mobile Devices

Paper • 2405.12107 • Published May 20, 2024 • 26

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 101