Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 110
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 178
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 79
📦 3D creation workflow Collection Going from a text prompt to a nice 3D model • 3 items • Updated 26 days ago • 29
VR-NeRF: High-Fidelity Virtualized Walkable Spaces Paper • 2311.02542 • Published Nov 5, 2023 • 14
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers Paper • 2309.08532 • Published Sep 15, 2023 • 52
Flamingo: a Visual Language Model for Few-Shot Learning Paper • 2204.14198 • Published Apr 29, 2022 • 14