Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper • 2408.15998 • Published Aug 28, 2024 • 86
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models Paper • 2406.10900 • Published Jun 16, 2024 • 11
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models Paper • 2406.10900 • Published Jun 16, 2024 • 11
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models Paper • 2310.14566 • Published Oct 23, 2023 • 26
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models Paper • 2310.14566 • Published Oct 23, 2023 • 26
CrossLoc3D: Aerial-Ground Cross-Source 3D Place Recognition Paper • 2303.17778 • Published Mar 31, 2023 • 2
Forecasting Trajectory and Behavior of Road-Agents Using Spectral Clustering in Graph-LSTMs Paper • 1912.01118 • Published Dec 2, 2019
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers Paper • 2104.11896 • Published Apr 24, 2021
GANav: Efficient Terrain Segmentation for Robot Navigation in Unstructured Outdoor Environments Paper • 2103.04233 • Published Mar 7, 2021 • 1
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning Paper • 2306.06236 • Published Jun 9, 2023
VINet: Visual and Inertial-based Terrain Classification and Adaptive Navigation over Unknown Terrain Paper • 2209.07725 • Published Sep 16, 2022
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning Paper • 2303.01589 • Published Mar 2, 2023
TNS: Terrain Traversability Mapping and Navigation System for Autonomous Excavators Paper • 2109.06250 • Published Sep 13, 2021
CrossLoc3D: Aerial-Ground Cross-Source 3D Place Recognition Paper • 2303.17778 • Published Mar 31, 2023 • 2