Melih Özcan's picture

29

Melih Özcan

staycoolish

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

upvoted a paper 5 days ago

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

upvoted a paper 5 days ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

View all activity

Organizations

None yet

staycoolish's activity

upvoted 4 papers 5 days ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published 6 days ago • 32

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published 9 days ago • 20

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published 6 days ago • 49

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 6 days ago • 123

upvoted 2 papers 12 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 13 days ago • 53

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 14 days ago • 175

upvoted a paper 16 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 17 days ago • 53

upvoted 2 papers 19 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 22 days ago • 56

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 22 days ago • 57

upvoted a paper 20 days ago

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published 28 days ago • 27

upvoted 3 papers 24 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 26 days ago • 93

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published 25 days ago • 67

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 25 days ago • 319

upvoted a paper 26 days ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 28 days ago • 91

upvoted 2 papers about 1 month ago

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published Jan 15 • 30

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 53

upvoted a paper about 2 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

upvoted 3 papers 3 months ago

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 16

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 51

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 33