Max Ku's picture

Max Ku

vinesmsuic

·

https://kuwingfung.github.io/

AI & ML interests

Computer Vision, Deep Image Synthesis

Recent Activity

updated a Space 13 days ago

TIGER-Lab/GenAI-Arena

published a dataset 28 days ago

vinesmsuic/ProteinDisAngleWeights

updated a Space about 1 month ago

TIGER-Lab/GenAI-Arena

View all activity

Organizations

vinesmsuic's activity

upvoted 2 papers 3 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 26

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 47

upvoted a paper 5 months ago

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Paper • 2309.16496 • Published Sep 28, 2023 • 9

upvoted a paper 6 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 118

upvoted 3 papers 8 months ago

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21, 2024 • 16

VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation

Paper • 2312.14867 • Published Dec 22, 2023 • 1

GenAI Arena: An Open Evaluation Platform for Generative Models

Paper • 2406.04485 • Published Jun 6, 2024 • 21

upvoted 3 papers 9 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 45

I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models

Paper • 2405.16537 • Published May 26, 2024 • 16

MANTIS: Interleaved Multi-Image Instruction Tuning

Paper • 2405.01483 • Published May 2, 2024 • 6

upvoted 4 papers 11 months ago

Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2, 2024 • 36

DreamEdit: Subject-driven Image Editing

Paper • 2306.12624 • Published Jun 22, 2023 • 1

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Paper • 2403.14468 • Published Mar 21, 2024 • 24

ImagenHub: Standardizing the evaluation of conditional image generation models

Paper • 2310.01596 • Published Oct 2, 2023 • 19

upvoted a paper about 1 year ago

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 22

upvoted a paper over 1 year ago

TheoremQA: A Theorem-driven Question Answering dataset

Paper • 2305.12524 • Published May 21, 2023 • 1