Multimodal agents (robotics) Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 20 HuggingFaceM4/idefics2-8b Image-Text-to-Text • Updated Jul 30 • 53.4k • 572 VIMA/VIMA Updated Jun 20, 2023 • 13 rail-berkeley/octo-base Robotics • Updated Dec 14, 2023 • 121 • 19
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 20
Robotics stack openai/whisper-base Automatic Speech Recognition • Updated Feb 29 • 430k • 181 HuggingFaceM4/idefics2-8b-AWQ Image-Text-to-Text • Updated May 6 • 584 • 26 parler-tts/parler_tts_mini_v0.1 Text-to-Speech • Updated Apr 30 • 26.8k • 344 dora-rs/dora-idefics2 Updated May 5 • 2 • 5