UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation Paper • 2308.07732 • Published Aug 15, 2023 • 2
GiT: Towards Generalist Vision Transformer through Universal Language Interface Paper • 2403.09394 • Published Mar 14 • 25