150 15 33

Raushan Turganbay

RaushanTurganbay

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

liked a model 17 days ago

chenjoya/videollm-online-8b-v1plus

new activity 17 days ago

RaushanTurganbay/llava-onevision:Incomplete generation results on the pancake example?

liked a model 20 days ago

facebook/watermark-anything

View all activity

Articles

Introducing SynthID Text

Oct 23, 2024

• 39

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

• 37

Organizations

RaushanTurganbay's activity

upvoted a paper 2 months ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 25

upvoted an article 3 months ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

•

Jun 11, 2024

• 15

upvoted a collection 3 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated Nov 27, 2024 • 289

upvoted an article 4 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

•

Sep 2, 2024

• 18

upvoted a paper 4 months ago

Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6, 2024 • 23

upvoted a collection 4 months ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 34

upvoted an article 5 months ago

Article

Introduction to ggml

Aug 13, 2024

• 124

upvoted 4 papers 5 months ago

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Paper • 2408.04840 • Published Aug 9, 2024 • 32

upvoted 2 papers 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8, 2024 • 26

upvoted an article 7 months ago

Article

AI has a problem with objectifying women

•

May 24, 2024

• 55

upvoted a paper 11 months ago

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12, 2024 • 57