AMIA THIERRY STEPHANE
r4gamia
AI & ML interests
None yet
Recent Activity
replied to
prithivMLmods's
post
1 day ago
Qwen2VL Models: Vision and Language Processing 🍉
📍FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]
❄️Demo : https://huggingface.co/spaces/prithivMLmods/Qwen2-VL-2B . The demo includes the Qwen2VL 2B Base Model.
🎯The space handles documenting content from the input image along with standardized plain text. It includes adjustment tools with over 30 font styles, file formatting support for PDF and DOCX, textual alignments, font size adjustments, and line spacing modifications.
📄PDFs are rendered using the ReportLab software library toolkit.
🧵Models :
+ https://huggingface.co/prithivMLmods/Qwen2-VL-OCR-2B-Instruct
+ https://huggingface.co/prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
+ https://huggingface.co/prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct
🚀Sample Document :
+ https://drive.google.com/file/d/1Hfqqzq4Xc-3eTjbz-jcQY84V5E1YM71E/view?usp=sharing
📦Collection :
+ https://huggingface.co/collections/prithivMLmods/vision-language-models-67639f790e806e1f9799979f
.
.
.
@prithivMLmods 🤗
Organizations
None yet
models
None public yet
datasets
None public yet