On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published 6 days ago • 39
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 9 days ago • 82
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 55
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications Paper • 2408.11878 • Published Aug 20, 2024 • 52
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal Paper • 2406.16864 • Published Jun 24, 2024 • 3
LASA: Instance Reconstruction from Real Scans using A Large-scale Aligned Shape Annotation Dataset Paper • 2312.12418 • Published Dec 19, 2023 • 2
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 61
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 61
VDC: Versatile Data Cleanser for Detecting Dirty Samples via Visual-Linguistic Inconsistency Paper • 2309.16211 • Published Sep 28, 2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper • 2312.09911 • Published Dec 15, 2023 • 53