|
--- |
|
license: apache-2.0 |
|
base_model: |
|
- lmsys/vicuna-7b-v1.1 |
|
--- |
|
|
|
# RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation |
|
|
|
## Model Description |
|
|
|
- **Repository:** [RoomTour3D](https://roomtour3d.github.io/) |
|
- **Paper:** [2412.08591](https://arxiv.org/abs/2412.08591) |
|
- **Point of Contact:** mailto:[Mingfei Han](hmf282@gmail.com) |
|
|
|
This contains pre-trained checkpoints and finetuned checkpoints for our RoomTour3D-NaviLLM. Please follow the instructions and license [here](https://github.com/roomtour3d/roomtour3d-NaviLLM) to use these models. |
|
|
|
--- |
|
|
|
|
|
## Citation <a name="citation"></a> |
|
|
|
If you find our work useful for your research, please consider citing the paper |
|
|
|
``` |
|
@article{han2024roomtour3d, |
|
title={RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation}, |
|
author={Mingfei Han and Liang Ma and Kamila Zhumakhanova and Ekaterina Radionova and Jingyi Zhang and Xiaojun Chang and Xiaodan Liang and Ivan Laptev}, |
|
journal={arXiv preprint arXiv:2412.08591}, |
|
year={2024} |
|
} |
|
``` |