ComiCap: A VLMs pipeline for dense captioning of Comic Panels Paper • 2409.16159 • Published Sep 24, 2024 • 1
Towards Generative Class Prompt Learning for Fine-grained Visual Recognition Paper • 2409.01835 • Published Sep 3, 2024
DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement Paper • 2010.08764 • Published Oct 17, 2020
Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text Recognition Paper • 2107.10064 • Published Jul 21, 2021
One missing piece in Vision and Language: A Survey on Comics Understanding Paper • 2409.09502 • Published Sep 14, 2024 • 23
One missing piece in Vision and Language: A Survey on Comics Understanding Paper • 2409.09502 • Published Sep 14, 2024 • 23
One missing piece in Vision and Language: A Survey on Comics Understanding Paper • 2409.09502 • Published Sep 14, 2024 • 23
Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents Paper • 2208.11203 • Published Aug 23, 2022
Comics Datasets Framework: Mix of Comics datasets for detection benchmarking Paper • 2407.03540 • Published Jul 3, 2024 • 3
CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding Paper • 2407.03550 • Published Jul 4, 2024 • 2