Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published 2 days ago • 5 • 2
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models Paper • 2406.15718 • Published 5 days ago • 11 • 2
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far? Paper • 2406.16772 • Published 2 days ago • 3 • 2
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models Paper • 2406.16714 • Published 2 days ago • 8 • 2
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published 2 days ago • 31 • 3
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 4 days ago • 35 • 8
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Paper • 2406.14783 • Published 6 days ago • 12 • 2
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution Paper • 2406.13457 • Published 7 days ago • 12 • 2
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models Paper • 2406.14599 • Published 6 days ago • 14 • 2
LiveMind: Low-latency Large Language Models with Simultaneous Inference Paper • 2406.14319 • Published 6 days ago • 13 • 4
$\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials Paper • 2406.14347 • Published 6 days ago • 93 • 4
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published 9 days ago • 12 • 4
Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations Paper • 2406.11801 • Published 9 days ago • 15 • 4
Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation Paper • 2406.12849 • Published 8 days ago • 45 • 2
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization Paper • 2406.11431 • Published 9 days ago • 4 • 2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Paper • 2406.12753 • Published 8 days ago • 14 • 2
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers Paper • 2406.10163 • Published 12 days ago • 23 • 2
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published 8 days ago • 26 • 2
VoCo-LLaMA: Towards Vision Compression with Large Language Models Paper • 2406.12275 • Published 8 days ago • 28 • 10
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models Paper • 2406.11831 • Published 9 days ago • 17 • 4
GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors Paper • 2406.10111 • Published 12 days ago • 6 • 2
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published 14 days ago • 14 • 3
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published 14 days ago • 14 • 4
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Paper • 2406.09162 • Published 13 days ago • 13 • 3
Explore the Limits of Omni-modal Pretraining at Scale Paper • 2406.09412 • Published 13 days ago • 10 • 3
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus Paper • 2406.08707 • Published 14 days ago • 14 • 4
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published 13 days ago • 17 • 2
PowerInfer-2: Fast Large Language Model Inference on a Smartphone Paper • 2406.06282 • Published 16 days ago • 34 • 5
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters Paper • 2406.05955 • Published 17 days ago • 21 • 2
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Paper • 2406.05132 • Published 19 days ago • 27 • 2
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models Paper • 2406.06563 • Published 24 days ago • 17 • 10
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models Paper • 2406.07472 • Published 15 days ago • 10 • 3
Simple and Effective Masked Diffusion Language Models Paper • 2406.07524 • Published 15 days ago • 7 • 2
Learning Temporally Consistent Video Depth from Video Diffusion Priors Paper • 2406.01493 • Published 23 days ago • 17 • 2
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published 16 days ago • 60 • 3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Paper • 2406.06216 • Published 16 days ago • 16 • 5