chfeng

university

https://github.com/cfeng16

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

chfeng authored a paper about 2 months ago

GPS as a Control Signal for Image Generation

russwang authored a paper 3 months ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

russwang authored a paper 5 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

View all activity

chfeng123's activity

chfeng

authored a paper about 2 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 12

russwang

authored a paper 3 months ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 7

russwang

authored 2 papers 5 months ago

Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning

Paper • 2410.06508 • Published Oct 9, 2024 • 10

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 35

chfeng

authored 5 papers about 1 year ago

AVA-AVD: Audio-Visual Speaker Diarization in the Wild

Paper • 2111.14448 • Published Nov 29, 2021

Self-Supervised Video Forensics by Audio-Visual Anomaly Detection

Paper • 2301.01767 • Published Jan 4, 2023

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

Paper • 2309.03118 • Published Sep 6, 2023 • 2

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning

Paper • 2402.11690 • Published Feb 18, 2024 • 10

Binding Touch to Everything: Learning Unified Multimodal Tactile Representations

Paper • 2401.18084 • Published Jan 31, 2024

russwang

authored 7 papers about 1 year ago

Premier-TACO: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Paper • 2402.06187 • Published Feb 9, 2024 • 11

Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy

Paper • 2207.12141 • Published Jul 25, 2022

TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

Paper • 2306.13229 • Published Jun 22, 2023 • 3

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

Paper • 2310.19668 • Published Oct 30, 2023 • 3

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Paper • 2401.10529 • Published Jan 19, 2024 • 1

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Paper • 2310.07220 • Published Oct 11, 2023 • 1

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

Paper • 2302.01244 • Published Feb 2, 2023

AI & ML interests

Recent Activity

Team members 2

chfeng123's activity