gen-audio ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25 • 56
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25 • 56
lmm mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19 • 30 MiniCPM-V: A GPT-4V Level MLLM on Your Phone Paper • 2408.01800 • Published Aug 3 • 77 Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23 • 27
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19 • 30