HuangLab Test

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

freesunshine0316 authored a paper 13 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

haitaominlp authored a paper 13 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

russwang authored a paper 2 months ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

View all activity

RussWang96's activity

freesunshine0316

authored a paper 13 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 13 days ago • 51

haitaominlp

authored a paper 13 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 13 days ago • 51

russwang

authored a paper 2 months ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 6

yetian

authored 9 papers 4 months ago

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Paper • 2309.10202 • Published Sep 18, 2023 • 10

Collaborative decoding of critical tokens for boosting factuality of large language models

Paper • 2402.17982 • Published Feb 28, 2024

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Paper • 2406.06326 • Published Jun 10, 2024 • 2

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 38

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Paper • 2410.02730 • Published Oct 3, 2024

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Paper • 2410.11623 • Published Oct 15, 2024 • 48

haitaominlp

authored a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

russwang

authored a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

freesunshine0316

authored a paper 4 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

russwang

authored a paper 4 months ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 35

haitaominlp

authored a paper 4 months ago

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Paper • 2409.17433 • Published Sep 25, 2024 • 9

haitaominlp

authored a paper 7 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 38

freesunshine0316

authored a paper 7 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 38

haitaominlp

authored a paper 8 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 97

AI & ML interests

Recent Activity

Team members 5

RussWang96's activity