Guanzhou Ke's picture

73 4

Guanzhou Ke

guanzhouk

·

Guanzhou-Ke

AI & ML interests

Multi-modal learning

Recent Activity

upvoted a paper 8 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper 8 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

upvoted a paper 23 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

View all activity

Organizations

None yet

guanzhouk's activity

upvoted 2 papers 8 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 283

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 13 days ago • 82

upvoted 3 papers 23 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 26 days ago • 90

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 26 days ago • 251

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published about 1 month ago • 90

upvoted 4 papers about 1 month ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 37

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 7 papers about 2 months ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 41

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 91

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

Analyzing The Language of Visual Tokens

Paper • 2411.05001 • Published Nov 7, 2024 • 23

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 45

liked a model about 2 months ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 610k • • 1.83k

upvoted 2 papers about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

liked a model about 2 months ago

jacklishufan/OmniFlow-v0.5

Any-to-Any • Updated 24 days ago • 2