77 64 46

Puffy Bird

puffy310

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Qwen2.5 Technical Report

new activity 4 days ago

dalle-mini/dalle-mini:How to convert this model into safetensors format for use in comfyUI?

upvoted a collection 15 days ago

InternVL2.5

View all activity

Organizations

puffy310's activity

upvoted a paper 1 day ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 3 days ago • 283

upvoted a collection 15 days ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated about 21 hours ago • 73

upvoted a paper 3 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 138

upvoted a paper 4 months ago

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Paper • 2408.08189 • Published Aug 15 • 15

upvoted 16 papers 6 months ago

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

Paper • 2407.01906 • Published Jul 2 • 34

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 93

TokenPacker: Efficient Visual Projector for Multimodal LLM

Paper • 2407.02392 • Published Jul 2 • 21

To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Paper • 2407.01920 • Published Jul 2 • 13

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2 • 51

Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning

Paper • 2407.00782 • Published Jun 30 • 23

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Paper • 2407.01284 • Published Jul 1 • 75

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

Paper • 2406.16747 • Published Jun 24 • 18

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Paper • 2406.16714 • Published Jun 24 • 10

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22 • 45

Efficient Continual Pre-training by Mitigating the Stability Gap

Paper • 2406.14833 • Published Jun 21 • 19