Vision, Language and Reading

non-profit

Activity Feed

AI & ML interests

Multimodal AI, Document Understanding, Reading Systems.

Recent Activity

emanuelevivoli authored a paper 3 months ago

ComiCap: A VLMs pipeline for dense captioning of Comic Panels

emanuelevivoli authored a paper 3 months ago

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition

Llabres updated a Space 4 months ago

VLR-CVC/README

View all activity

VLR-CVC's activity

emanuelevivoli

authored 2 papers 3 months ago

ComiCap: A VLMs pipeline for dense captioning of Comic Panels

Paper • 2409.16159 • Published Sep 24, 2024 • 1

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition

Paper • 2409.01835 • Published Sep 3, 2024

Llabres

updated a Space 4 months ago

Running

🐨

README

Llabres

authored a paper 4 months ago

Image-text matching for large-scale book collections

Paper • 2407.19812 • Published Jul 29, 2024

msouibgui

authored 3 papers 4 months ago

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

Paper • 2010.08764 • Published Oct 17, 2020

Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition

Paper • 2107.10064 • Published Jul 21, 2021

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 23

Llabres

authored a paper 4 months ago

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 23

emanuelevivoli

authored 5 papers 4 months ago

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 23

Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents

Paper • 2208.11203 • Published Aug 23, 2022

emanuelevivoli

authored a paper about 1 year ago

MUST-VQA: MUltilingual Scene-text VQA

Paper • 2209.06730 • Published Sep 14, 2022 • 2

AI & ML interests

Recent Activity

Team members 3

VLR-CVC's activity

README