zongzhuofan
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,61 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
tags:
|
4 |
+
- image-segmentation
|
5 |
+
- vision
|
6 |
+
datasets:
|
7 |
+
- coco
|
8 |
+
---
|
9 |
+
|
10 |
+
# DETRs with Collaborative Hybrid Assignments Training
|
11 |
+
|
12 |
+
## Introduction
|
13 |
+
|
14 |
+
In this paper, we present a novel collaborative hybrid assignments training scheme, namely Co-DETR, to learn more efficient and effective DETR-based detectors from versatile label assignment manners.
|
15 |
+
1. **Encoder optimization**: The proposed training scheme can easily enhance the encoder's learning ability in end-to-end detectors by training multiple parallel auxiliary heads supervised by one-to-many label assignments.
|
16 |
+
2. **Decoder optimization**: We conduct extra customized positive queries by extracting the positive coordinates from these auxiliary heads to improve attention learning of the decoder.
|
17 |
+
3. **State-of-the-art performance**: Co-DETR with ViT-Large (304M parameters) is **the first model to achieve 66.0 AP on COCO test-dev.**
|
18 |
+
|
19 |
+
## Model Zoo
|
20 |
+
|
21 |
+
| Model | Backbone | Aug | Dataset | box AP (val) | box AP (minival) |
|
22 |
+
| --- | --- | --- | --- | --- | --- |
|
23 |
+
| Co-DETR | ViT-L | LSJ | LVIS | 68.0 | 72.0 |
|
24 |
+
|
25 |
+
## How to use
|
26 |
+
|
27 |
+
We implement Co-DETR using [MMDetection V2.25.3](https://github.com/open-mmlab/mmdetection/releases/tag/v2.25.3) and [MMCV V1.5.0](https://github.com/open-mmlab/mmcv/releases/tag/v1.5.0). Please refer to our [github repo](https://github.com/Sense-X/Co-DETR/tree/main) for more details.
|
28 |
+
|
29 |
+
### Training
|
30 |
+
Train Co-Deformable-DETR + ResNet-50 with 8 GPUs:
|
31 |
+
```shell
|
32 |
+
sh tools/dist_train.sh projects/configs/co_deformable_detr/co_deformable_detr_r50_1x_coco.py 8 path_to_exp
|
33 |
+
```
|
34 |
+
Train using slurm:
|
35 |
+
```shell
|
36 |
+
sh tools/slurm_train.sh partition job_name projects/configs/co_deformable_detr/co_deformable_detr_r50_1x_coco.py path_to_exp
|
37 |
+
```
|
38 |
+
|
39 |
+
### Testing
|
40 |
+
Test Co-Deformable-DETR + ResNet-50 with 8 GPUs, and evaluate:
|
41 |
+
```shell
|
42 |
+
sh tools/dist_test.sh projects/configs/co_deformable_detr/co_deformable_detr_r50_1x_coco.py path_to_checkpoint 8 --eval bbox
|
43 |
+
```
|
44 |
+
Test using slurm:
|
45 |
+
```shell
|
46 |
+
sh tools/slurm_test.sh partition job_name projects/configs/co_deformable_detr/co_deformable_detr_r50_1x_coco.py path_to_checkpoint --eval bbox
|
47 |
+
```
|
48 |
+
|
49 |
+
## Cite Co-DETR
|
50 |
+
|
51 |
+
If you find this repository useful, please use the following BibTeX entry for citation.
|
52 |
+
|
53 |
+
```latex
|
54 |
+
@inproceedings{zong2023detrs,
|
55 |
+
title={Detrs with collaborative hybrid assignments training},
|
56 |
+
author={Zong, Zhuofan and Song, Guanglu and Liu, Yu},
|
57 |
+
booktitle={Proceedings of the IEEE/CVF international conference on computer vision},
|
58 |
+
pages={6748--6758},
|
59 |
+
year={2023}
|
60 |
+
}
|
61 |
+
```
|