Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 16 days ago • 560
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues Paper • 2410.10700 • Published Oct 14, 2024 • 2
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Paper • 2308.06463 • Published Aug 12, 2023 • 1
CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis Paper • 2407.13301 • Published Jul 18, 2024 • 56
Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning Paper • 2406.02265 • Published Jun 4, 2024 • 7
RRM: Relightable assets using Radiance guided Material extraction Paper • 2407.06397 • Published Jul 8, 2024 • 5
Characterizing Prompt Compression Methods for Long Context Inference Paper • 2407.08892 • Published Jul 11, 2024 • 11
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models Paper • 2407.09012 • Published Jul 12, 2024 • 10
GAVEL: Generating Games Via Evolution and Language Models Paper • 2407.09388 • Published Jul 12, 2024 • 17
StyleSplat: 3D Object Style Transfer with Gaussian Splatting Paper • 2407.09473 • Published Jul 12, 2024 • 12
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Paper • 2407.09732 • Published Jul 13, 2024 • 9
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing Paper • 2407.08770 • Published Jul 11, 2024 • 21
MUSCLE: A Model Update Strategy for Compatible LLM Evolution Paper • 2407.09435 • Published Jul 12, 2024 • 23
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers Paper • 2407.09413 • Published Jul 12, 2024 • 11
Toto: Time Series Optimized Transformer for Observability Paper • 2407.07874 • Published Jul 10, 2024 • 32
Human-like Episodic Memory for Infinite Context LLMs Paper • 2407.09450 • Published Jul 12, 2024 • 62