add support for batch multimodal understanding fd16886 verified ryanzhangfan commited on Oct 17, 2024