10 15 104

Xie

Zhihui

https://zhxie.site/

zhxieml

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 hours ago

nebius/SWE-bench-extra

liked a dataset about 5 hours ago

open-r1/codeforces-cots

liked a model about 24 hours ago

RekaAI/reka-flash-3

View all activity

Organizations

Zhihui's activity

upvoted a paper 29 days ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published about 1 month ago • 47

upvoted a collection 29 days ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots • 312 items • Updated about 5 hours ago • 47

upvoted a paper 29 days ago

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted a paper 2 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 82

upvoted a paper 3 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 43

upvoted a paper 4 months ago

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Paper • 2411.17451 • Published Nov 26, 2024 • 11

upvoted a paper 6 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 62

upvoted 2 papers 8 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18, 2024 • 39

upvoted a paper 9 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20, 2024 • 13

upvoted an article 9 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 84

upvoted 2 papers 9 months ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 10

Calibrating Reasoning in Language Models with Internal Consistency

Paper • 2405.18711 • Published May 29, 2024 • 6

upvoted a collection 9 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 104 items • Updated 7 days ago • 97

upvoted a paper about 1 year ago

Silkie: Preference Distillation for Large Visual Language Models

Paper • 2312.10665 • Published Dec 17, 2023 • 11