MedOmni_DPO / README.md
Hongyu Zhou
update
149c7c3
## DPO dataset construction
```
# Make sure you have mimic files, which should be like MedOmni_DPO/data/xray/dpo/images/physionet.org
cd MedOmni_DPO/data/xray/dpo/utils
python process_mask.py # Mask DPO from https://arxiv.org/abs/2406.06496
cd MedOmni_DPO/utils
python build_dataset.py
```
You will have two JSON files `train.json` and `val.json` in `MedOmni_DPO/json_files/medinterp/`. If you prefer not to create the dataset yourself, we've already provided these two JSON files.
## Finetuning MedVersa checkpoint
```
sh run.sh
```
## Evaluation
```
sh eval.sh # Need to specify --epoch-id and --model-id (medomni/output/medomni_v0)
```