SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published 12 days ago โข 101
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper โข 2408.08872 โข Published Aug 16, 2024 โข 98
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper โข 2407.12784 โข Published Jul 17, 2024 โข 49