VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper • 2403.08764 • Published Mar 13 • 34
Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations Paper • 2403.09704 • Published Mar 8 • 31
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Paper • 2403.03234 • Published Mar 5 • 11
Model Weight Theft With Just Noise Inputs: The Curious Case of the Petulant Attacker Paper • 1912.08987 • Published Dec 19, 2019 • 1
Taming Mode Collapse in Score Distillation for Text-to-3D Generation Paper • 2401.00909 • Published Dec 31, 2023 • 9
LLaMA Beyond English: An Empirical Study on Language Capability Transfer Paper • 2401.01055 • Published Jan 2 • 53
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image Paper • 2401.01117 • Published Jan 2 • 8
UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections Paper • 2312.13285 • Published Dec 20, 2023 • 5
Splatter Image: Ultra-Fast Single-View 3D Reconstruction Paper • 2312.13150 • Published Dec 20, 2023 • 14
Cascade Speculative Drafting for Even Faster LLM Inference Paper • 2312.11462 • Published Dec 18, 2023 • 8
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Paper • 2310.00426 • Published Sep 30, 2023 • 61
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models Paper • 2311.05997 • Published Nov 10, 2023 • 36