AK's picture

AK

akhaliq

·

_akhaliq

AI & ML interests

None yet

Recent Activity

upvoted a collection about 7 hours ago

commented on a paper about 7 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

upvoted a collection about 20 hours ago

Gemma 3 Release

View all activity

Organizations

akhaliq's activity

commented a paper about 7 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published about 17 hours ago • 15 •

commented a paper 1 day ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published 3 days ago • 22 •

commented 2 papers 2 days ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published 3 days ago • 15 •

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published 4 days ago • 20 •

commented 5 papers 3 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 6 days ago • 24 •

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 6 days ago • 24 •

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published 6 days ago • 44 •

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 7 days ago • 14 •

Learning from Failures in Multi-Attempt Reinforcement Learning

Paper • 2503.04808 • Published 9 days ago • 15 •

commented 5 papers 6 days ago

PokéChamp: an Expert-level Minimax Language Agent

Paper • 2503.04094 • Published 7 days ago • 9 •

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 7 days ago • 76 •

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Paper • 2503.04606 • Published 7 days ago • 7 •

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 83 •

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Paper • 2503.04378 • Published 7 days ago • 6 •

commented a paper 7 days ago

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published 8 days ago • 19 •

commented 3 papers 10 days ago

Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids

Paper • 2502.20396 • Published 14 days ago • 12 •

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published 13 days ago • 2 •

SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

Paper • 2502.20545 • Published 14 days ago • 20 •

commented 2 papers 13 days ago

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Paper • 2502.20307 • Published 14 days ago • 17 •

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Paper • 2502.20126 • Published 14 days ago • 20 •