Submitted by akhaliq 42 Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models · 8 authors 4
Submitted by akhaliq 22 ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion · 6 authors 3
Submitted by akhaliq 21 BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text · 11 authors 3
Submitted by akhaliq 17 Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction · 7 authors 2
Submitted by akhaliq 5 FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing · 4 authors 1
Submitted by akhaliq 4 Towards a World-English Language Model for On-Device Virtual Assistants · 6 authors 1