Wenyue Hua's picture

2 7 2

Wenyue Hua

wenyueH

·

https://wenyueh.github.io/

AI & ML interests

LLM-based agent, LLM reasoning

Recent Activity

authored a paper 2 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

upvoted a paper 2 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

commented on a paper 2 months ago

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

View all activity

Organizations

None yet

Articles 1

Article

4

NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates

Papers 5

arxiv:2412.08972

arxiv:2401.04925

arxiv:2305.06569

arxiv:2304.04370

models

None public yet

datasets

None public yet