kaizuberbuehler
's Collections
Code Generation
updated
CodeEditorBench: Evaluating Code Editing Capability of Large Language
Models
Paper
•
2404.03543
•
Published
•
15
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
•
2406.11931
•
Published
•
58
AppWorld: A Controllable World of Apps and People for Benchmarking
Interactive Coding Agents
Paper
•
2407.18901
•
Published
•
33
Diversity Empowers Intelligence: Integrating Expertise of Software
Engineering Agents
Paper
•
2408.07060
•
Published
•
41
SWE-bench-java: A GitHub Issue Resolving Benchmark for Java
Paper
•
2408.14354
•
Published
•
41
FuzzCoder: Byte-level Fuzzing Test via Large Language Model
Paper
•
2409.01944
•
Published
•
45
Qwen2.5-Coder Technical Report
Paper
•
2409.12186
•
Published
•
139
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks
at Scale
Paper
•
2409.16299
•
Published
•
11
CodeElo: Benchmarking Competition-level Code Generation of LLMs with
Human-comparable Elo Ratings
Paper
•
2501.01257
•
Published
•
40
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on
Self-invoking Code Generation
Paper
•
2412.21199
•
Published
•
9
Outcome-Refining Process Supervision for Code Generation
Paper
•
2412.15118
•
Published
•
19
o1-Coder: an o1 Replication for Coding
Paper
•
2412.00154
•
Published
•
42
CodeDPO: Aligning Code Models with Self Generated and Verified Source
Code
Paper
•
2410.05605
•
Published
•
1
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate
Prioritized Experience Replay
Paper
•
2410.12236
•
Published
•
1
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
•
2411.04905
•
Published
•
113