yao teng's picture

2 13 8

yao teng

tytyt

·

https://tyshiwo1.github.io/

tyshiwo1

AI & ML interests

None yet

Recent Activity

published a model about 1 month ago

tytyt/t2i_sft_offload-1data-drop-tunenlp-noaccum-ar

liked a Space about 1 month ago

Kaiyue/T2V-CompBench_Leaderboard

upvoted a paper about 2 months ago

GameFactory: Creating New Games with Generative Interactive Videos

View all activity

Organizations

tytyt's activity

upvoted a paper about 2 months ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 64

upvoted a paper 2 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

upvoted 2 papers 3 months ago

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 23

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Paper • 2412.04440 • Published Dec 5, 2024 • 20

upvoted 2 papers 4 months ago

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published Nov 11, 2024 • 26

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

upvoted 4 papers 5 months ago

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Paper • 2410.10816 • Published Oct 14, 2024 • 21

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Paper • 2308.09244 • Published Aug 18, 2023 • 2

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

Paper • 2410.01699 • Published Oct 2, 2024 • 18

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 36

upvoted 2 papers 7 months ago

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Paper • 2405.14224 • Published May 23, 2024 • 16

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Paper • 2407.14505 • Published Jul 19, 2024 • 26

upvoted a paper over 1 year ago

Merlin:Empowering Multimodal LLMs with Foresight Minds

Paper • 2312.00589 • Published Nov 30, 2023 • 27