giboulot's picture

9 35

giboulot

danypropsy

·

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

AIDC-AI/Marco-o1

liked a Space about 22 hours ago

llamameta/llama3.1-405B

upvoted a collection about 22 hours ago

View all activity

Organizations

None yet

danypropsy's activity

liked a model about 20 hours ago

AIDC-AI/Marco-o1

Text Generation • Updated Nov 23, 2024 • 7.85k • 713

liked a Space about 22 hours ago

Llama3.1 405B

Generate text based on your input

upvoted a collection about 22 hours ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 27 items • Updated about 6 hours ago • 30

reacted to prithivMLmods's post with 🤗 about 22 hours ago

Post

1786

Gemma-3-4B : Image and Video Inference 🖼️🎥

🧤Space: prithivMLmods/Gemma-3-Multimodal

@gemma3-4b : {Tag + Space_+ 'prompt'}
@video-infer : {Tag + Space_+ 'prompt'}

+ Gemma3-4B : google/gemma-3-4b-it
+ By default, it runs : prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf

Additionally, I have also tested Aya-Vision 8B vs Custom Qwen2-VL-OCR for OCR with test case samples on messy handwriting for experimental purposes to optimize edge device VLMs for Optical Character Recognition.

📜Read the blog here: https://huggingface.co/blog/prithivMLmods/aya-vision-vs-qwen2vl-ocr-2b

1 reply

·

liked 2 models 12 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 441k • 1.13k

Wan-AI/Wan2.1-T2V-14B

Text-to-Video • Updated 2 days ago • 207k • • 1.02k

liked a model 18 days ago

perplexity-ai/r1-1776

Text Generation • Updated 16 days ago • 55k • • 2.13k

liked a Space about 1 month ago

TRELLIS

Scalable and Versatile 3D Generation from images

liked a model about 1 month ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 264k • 3.21k

liked a model about 2 months ago

Qwen/Qwen2.5-Math-PRM-72B

Text Classification • Updated Jan 17 • 1.03k • 71

updated a collection about 2 months ago

LLM Models

2 items • Updated Jan 20 • 1

upvoted a paper about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

liked a model about 2 months ago

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 18 days ago • 1.5k • 549

upvoted a collection about 2 months ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 2 days ago • 209

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 18 days ago • 1.59M • • 1.26k

liked 2 Spaces about 2 months ago

Sentient Reasoner

Chat Long COT model that uses tags

MiniMaxText01

Communicate with a multimodal chatbot

liked a model 2 months ago

HuggingFaceTB/SmolVLM-Base

Image-Text-to-Text • Updated Nov 28, 2024 • 5.85k • 66

liked a Space 2 months ago

IChat

upvoted a collection 2 months ago

LLM Models

2 items • Updated Jan 20 • 1