When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published 2 days ago • 9
ORID: Organ-Regional Information Driven Framework for Radiology Report Generation Paper • 2411.13025 • Published 2 days ago • 2
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory Paper • 2411.11922 • Published 4 days ago • 12
Stylecodes: Encoding Stylistic Information For Image Generation Paper • 2411.12811 • Published 3 days ago • 6
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations Paper • 2411.10818 • Published 6 days ago • 18
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published 3 days ago • 5
Building Trust: Foundations of Security, Safety and Transparency in AI Paper • 2411.12275 • Published 3 days ago • 10
SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning Paper • 2411.10161 • Published 7 days ago • 6
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements Paper • 2411.12044 • Published 4 days ago • 12
Continuous Speculative Decoding for Autoregressive Image Generation Paper • 2411.11925 • Published 4 days ago • 13
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 3 days ago • 40
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Paper • 2411.11045 • Published 5 days ago • 8
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers Paper • 2411.10510 • Published 7 days ago • 8
AnimateAnything: Consistent and Controllable Animation for Video Generation Paper • 2411.10836 • Published 6 days ago • 18
MARS: Unleashing the Power of Variance Reduction for Training Large Models Paper • 2411.10438 • Published 7 days ago • 11
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper • 2411.06558 • Published 12 days ago • 29