Patrick Haller's picture

9 7 40

Patrick Haller PRO

PatrickHaller

·

HallerPatrick

AI & ML interests

NLP, Language Models, Autoregressive Models

Recent Activity

upvoted a collection 25 days ago

fuck quadratic attention

updated a model about 1 month ago

PatrickHaller/hgrn2_pile_10M_distill_babylm

updated a model about 1 month ago

PatrickHaller/hgrn2_pile_100m_distill_babylm

View all activity

Organizations

Posts 1

Post

1917

How Robust Is Your Model in Complex Code Generation Tasks? 🤔

We've launched the PECC benchmark to challenge chat models in code generation, drawing from the Advent of Code for programming tasks and the Euler Project for math-heavy challenges. This new task tests models with problems presented in both detailed prose and concise "leet code" styles, evaluating their ability to understand and solve complex coding issues and math problem in chat-based interactions.

It seems that the Claude 3 models outperforme ChatGPT:
Model / Avg. (pass@3)
Claude 3 Haiku / 27.67
GPT-3.5-Turbo / 23.75
Mixtral-8x22B-Instruct-v0.1 / 8.35

Read our Preprint📃: PECC: Problem Extraction and Coding Challenges (2404.18766)
Look at the dataset🔎: PatrickHaller/pecc

We also got accepted at LREC-COLING '24 🎉

Collections 4

Papers 5

arxiv:2404.18766

arxiv:2309.09582

arxiv:2309.03876

arxiv:2211.05100

spaces 1

Pecc Leaderboard

models 23

PatrickHaller/hgrn2_pile_10M_distill_babylm

Updated about 1 month ago • 3.38k

PatrickHaller/hgrn2_pile_100m_distill_babylm

Text Generation • Updated Dec 17, 2024 • 3.38k

PatrickHaller/babylm_transformer_strict_small_comparison

Text Generation • Updated Oct 9, 2024 • 166

PatrickHaller/hgrn2_de_wiki

Text Generation • Updated Sep 30, 2024 • 13

PatrickHaller/xlstm_pile_10m

Updated Sep 17, 2024 • 1 • 1

PatrickHaller/hgrn2_pile_10m

Updated Sep 17, 2024 • 3

PatrickHaller/llama_pile_10m_babylm

Text Generation • Updated Sep 17, 2024 • 6

PatrickHaller/hgrn2_pile_100m

Updated Sep 10, 2024 • 1

PatrickHaller/hgrn2_pile_100m_ckpt_6

Updated Sep 10, 2024

PatrickHaller/hgrn2_pile_10m_distill

Updated Sep 10, 2024 • 4

datasets 13

PatrickHaller/cosmopedia-v2-1B

Viewer • Updated Dec 5, 2024 • 1.39M • 40

PatrickHaller/pile-10M-words

Viewer • Updated Oct 9, 2024 • 40.3k • 43 • 1

PatrickHaller/pile-100M-words

Viewer • Updated Oct 9, 2024 • 403k • 221 • 1

PatrickHaller/wiki-and-book-corpus-1B

Viewer • Updated Aug 23, 2024 • 31.3M • 36 • 1

PatrickHaller/wiki-and-book-corpus-500M

Viewer • Updated Aug 23, 2024 • 15.7M • 33 • 1

PatrickHaller/wiki-and-book-corpus-100M

Viewer • Updated Aug 23, 2024 • 3.13M • 30

PatrickHaller/wiki-and-book-corpus-10M

Viewer • Updated Aug 23, 2024 • 313k • 27

PatrickHaller/wiki-and-book-corpus-1M

Viewer • Updated Aug 23, 2024 • 31.1k • 27

PatrickHaller/the-stack-python-1M

Viewer • Updated May 13, 2024 • 1M • 61

PatrickHaller/the-stack-python-100k

Viewer • Updated May 13, 2024 • 100k • 40