VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection Paper • 2411.14794 • Published Nov 22, 2024 • 12
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection Paper • 2411.14794 • Published Nov 22, 2024 • 12 • 3
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection Paper • 2411.14794 • Published Nov 22, 2024 • 12
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper • 2410.10814 • Published Oct 14, 2024 • 49
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Paper • 2410.06270 • Published Oct 8, 2024 • 1
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Paper • 2410.06270 • Published Oct 8, 2024 • 1
Can OOD Object Detectors Learn from Foundation Models? Paper • 2409.05162 • Published Sep 8, 2024 • 6
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix Paper • 2407.00367 • Published Jun 29, 2024 • 9
What Matters in Detecting AI-Generated Videos like Sora? Paper • 2406.19568 • Published Jun 27, 2024 • 13
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models Paper • 2405.14917 • Published May 23, 2024 • 1
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models Paper • 2405.14917 • Published May 23, 2024 • 1
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22, 2024 • 44