4 3 3

Dayiheng Liu

Losin94

AI & ML interests

None yet

Recent Activity

authored a paper 14 days ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

authored a paper 14 days ago

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

authored a paper 14 days ago

OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models

View all activity

Organizations

Losin94's activity

authored 10 papers 14 days ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

Paper • 2106.06125 • Published Jun 11, 2021

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Paper • 2406.14024 • Published Jun 20, 2024

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Paper • 2409.12122 • Published Sep 18, 2024 • 3

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31, 2024 • 17

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 14 days ago • 334

upvoted a paper 14 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 14 days ago • 334

authored a paper 23 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 25 days ago • 72

upvoted a paper 24 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 25 days ago • 72

authored 2 papers 4 months ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 75

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 138

updated a collection 4 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 9 items • Updated Nov 28, 2024 • 58

authored a paper 4 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 71