ogozcelik (Oguzhan Ozcelik)

liked a Space 5 days ago

Running on Zero

82

😻

OWLSAM

State-of-the-art open-vocabulary image segmentation ⚡️

liked a model 5 days ago

facebook/sam-vit-base

Mask Generation • Updated Jan 11 • 905k • 119

liked a model 8 days ago

NexaAIDev/omnivision-968M

Updated 2 days ago • 8.9k • 414

Reacted to merve's post with 🔥 8 days ago

Post

4752

OmniVision-968M: a new local VLM for edge devices, fast & small but performant
💨 a new vision language model with 9x less image tokens, super efficient
📖 aligned with DPO for reducing hallucinations
⚡️ Apache 2.0 license 🔥

Demo hf.co/spaces/NexaAIDev/omnivlm-dpo-demo
Model NexaAIDev/omnivision-968M

4 replies

·

liked a Space 24 days ago

Running on Zero

1.24k

🏃

Stable Diffusion 3.5 Large

Generate images with SD3.5

liked a model about 1 month ago

Metin/Gemma-2-9b-it-TR-DPO-V1

Text Generation • Updated Oct 22 • 2.71k • 6

liked a dataset about 1 month ago

turkish-nlp-suite/InstrucTurca

Viewer • Updated Aug 12 • 2.58M • 355 • 16

liked 4 models about 1 month ago

liked a Space 4 months ago

Running on Zero

10

🤖🇹🇷

TraVisionLM - Turkish Visual Language Model

Reacted to ucsahin's post with 🔥 4 months ago

Post

3648

🚀 Introducing TraVisionLM: Turkish Visual Language Model - The First of Its Kind! 🇹🇷🖼️

I'm thrilled to share TraVisionLM on Hugging Face! With 875M parameters, this lightweight, efficient model handles Turkish instructions for image inputs. Fully compatible with the Transformers library, it’s easy to load, fine-tune, and use—no external libraries needed!

Developed solo, TraVisionLM is a strong foundation for low-resource language research. While still improving, it's a key step for Turkish-language AI. Your feedback is welcome as I refine the model.

🎉 Explore it now:

- Model: ucsahin/TraVisionLM-base
- Demo: https://huggingface.co/spaces/ucsahin/TraVisionLM-Turkish_Visual_Language_Model
- Object Detection Finetune: ucsahin/TraVisionLM-Object-Detection-ft

Let’s push Turkish visual language processing forward!

---

🚀 TraVisionLM: Türünün İlk Örneği Türkçe Görsel Dil Modelini Sunuyorum! 🇹🇷🖼️

TraVisionLM modelini Hugging Face'te yayınladım! 875M parametre ile bu hafif ve verimli model, görüntüye dayalı Türkçe talimatları işlemek için tasarlandı. Transformers kütüphanesiyle tamamen uyumlu, yüklemesi, eğitmesi ve kullanması çok kolay—dış kütüphane gerekmez!

Tek başıma geliştirdiğim TraVisionLM, düşük kaynaklı dillerde araştırmalar için sağlam bir temel sunuyor. Geliştirmeye devam ederken geri bildirimlerinizi bekliyorum.

🎉 Hemen keşfedin:

- Model: ucsahin/TraVisionLM-base
- Demo: https://huggingface.co/spaces/ucsahin/TraVisionLM-Turkish_Visual_Language_Model
- Obje Tespiti İnce Ayarı: ucsahin/TraVisionLM-Object-Detection-ft

Türkçe görsel dil işleme sınırlarını birlikte zorlayalım!

3 replies

·

liked a model 4 months ago

ucsahin/TraVisionLM-base

Image-Text-to-Text • Updated Aug 9 • 38 • 22

liked 2 datasets 4 months ago

Metin/WikiRAG-TR

Viewer • Updated Aug 7 • 6k • 269 • 29

fka/awesome-chatgpt-prompts

Viewer • Updated Sep 3 • 170 • 9.8k • 6.36k

liked a Space 4 months ago

Running on Zero

4.71k

👁

IllusionDiffusion

Generate stunning high quality illusion artwork

updated a Space 5 months ago

Sleeping

1

📚

Turkish Fake News Detector

liked a dataset 5 months ago

ucsahin/TR-Extractive-QA-82K

Viewer • Updated Jul 6 • 82k • 34 • 5

Reacted to m-ric's post with ❤️ 5 months ago

Post

3215

𝐍𝐞𝐰 𝐝𝐞𝐜𝐨𝐝𝐢𝐧𝐠 𝐭𝐞𝐜𝐡𝐧𝐢𝐪𝐮𝐞 𝐢𝐧 𝐭𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫𝐬 𝐬𝐢𝐠𝐧𝐢𝐟𝐢𝐜𝐚𝐧𝐭𝐥𝐲 𝐫𝐞𝐝𝐮𝐜𝐞𝐬 𝐡𝐚𝐥𝐥𝐮𝐜𝐢𝐧𝐚𝐭𝐢𝐨𝐧𝐬 👏

DoLa decoding, which made a conference paper at ICLR '24, has just been merged in Transformers by @joaogante and Yung-Sung Chuang.
This new decoding method is simple yet extremely impressive!

Reminder: Decoder LLMs (the GPT kind of LLM, the most common one) generate their outputs one token at a time: at each step, given a current text, they compute a logit for each token in their vocabulary that should represent the probability of this token coming next.

Then they either pick the highest logit token (greedy decoding) or sample one with a probability defined by the logits (sampling).

The authors of DoLa wanted to improve that simple method.

They knew this established fact that transformer LMs encode low-level info (like base syntax) in early layers and more high-level info like knowledge in the later layers.

💡 This gave them their key idea: During decoding, rather than picking the token with the highest logit, 𝘄𝗵𝘆 𝗻𝗼𝘁 𝗽𝗶𝗰𝗸 𝘁𝗵𝗲 𝘁𝗼𝗸𝗲𝗻 𝘄𝗶𝘁𝗵 𝘁𝗵𝗲 𝗺𝗼𝘀𝘁 𝗶𝗺𝗽𝗿𝗲𝘀𝘀𝗶𝘃𝗲 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝗻 𝗹𝗼𝗴𝗶𝘁 𝗮𝗰𝗿𝗼𝘀𝘀 𝗹𝗮𝘆𝗲𝗿𝘀?

This gives impressive results:
🚀 𝟱% - 𝟮𝟬% 𝗯𝗮𝘀𝗲 𝗽𝗼𝗶𝗻𝘁𝘀 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗮𝗰𝗿𝗼𝘀𝘀 𝘁𝗵𝗲 𝗯𝗲𝗻𝗰𝗵𝗺𝗮𝗿𝗸𝘀
🚀 For instance on TruthfulQA / Open-ended, across all model sizes the increase in truthfulness is 14 base points, which is 𝗮𝗿𝗼𝘂𝗻𝗱 𝟰𝟬% 𝗶𝗺𝗽𝗿𝗼𝘃𝗲𝗺𝗲𝗻𝘁 𝗰𝗼𝗺𝗽𝗮𝗿𝗲𝗱 𝘁𝗼 𝘀𝘁𝗮𝗻𝗱𝗮𝗿𝗱 𝗱𝗲𝗰𝗼𝗱𝗶𝗻𝗴!

🤔 Wouldn't decoding take longer because of this added contrasting step? 👉 𝗧𝗵𝗲 𝗿𝘂𝗻𝘁𝗶𝗺𝗲 𝗶𝗻𝗰𝗿𝗲𝗮𝘀𝗲 𝗶𝘀 𝗻𝗲𝗴𝗹𝗶𝗴𝗶𝗯𝗹𝗲, 𝟭 𝘁𝗼 𝟴% 𝗼𝗻𝗹𝘆.

Paper added to my collection 👉 m-ric/optimization-mechanics-661d543a5fc6ca1dc84284a0

2 replies

·

Oguzhan Ozcelik

AI & ML interests

Recent Activity

Organizations

ogozcelik's activity

OWLSAM

facebook/sam-vit-base

NexaAIDev/omnivision-968M

Stable Diffusion 3.5 Large

Metin/Gemma-2-9b-it-TR-DPO-V1

turkish-nlp-suite/InstrucTurca

google/mt5-small

Helsinki-NLP/opus-mt-tc-bible-big-iir-en

Helsinki-NLP/opus-mt-ar-tr

Helsinki-NLP/opus-mt-tc-big-en-tr

TraVisionLM - Turkish Visual Language Model

ucsahin/TraVisionLM-base

Metin/WikiRAG-TR

fka/awesome-chatgpt-prompts

IllusionDiffusion

Turkish Fake News Detector

ucsahin/TR-Extractive-QA-82K