Team-PIXEL

university

https://github.com/xplip/pixel

Activity Feed Request to join this org

AI & ML interests

Language modelling with pixels

Recent Activity

e-bug authored a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

lyan62 authored a paper 6 months ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

lyan62 authored a paper 6 months ago

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

View all activity

Team-PIXEL's activity

e-bug

authored a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published about 1 month ago • 120

lyan62

authored 3 papers 6 months ago

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Paper • 2406.11030 • Published Jun 16, 2024

Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

Paper • 2406.02265 • Published Jun 4, 2024 • 6

The Role of Data Curation in Image Captioning

Paper • 2305.03610 • Published May 5, 2023

e-bug

authored a paper 6 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

e-bug

authored a paper 8 months ago

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 15

plip

updated a Space 9 months ago

Runtime error

🐱

PIXEL

ilkerkesen

authored a paper 12 months ago

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Paper • 2311.07022 • Published Nov 13, 2023 • 1

jflotz

updated 4 datasets 12 months ago

elliottd

authored a paper about 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 10

plip

authored a paper about 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 10

esalesky

authored a paper about 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 10

jflotz

authored a paper about 1 year ago

Text Rendering Strategies for Pixel Language Models

Paper • 2311.00522 • Published Nov 1, 2023 • 10

jflotz

updated 2 datasets over 1 year ago

Team-PIXEL/bigrams_wiki-en_529

Viewer • Updated Oct 2, 2023 • 18.4M • 105

Team-PIXEL/bigrams_bookcorpus_529

Viewer • Updated Oct 2, 2023 • 9.81M • 68

e-bug

authored a paper over 1 year ago

Measuring Progress in Fine-grained Vision-and-Language Understanding

Paper • 2305.07558 • Published May 12, 2023 • 1

jflotz

updated a model over 1 year ago

Team-PIXEL/pixel-base-bigrams

Updated May 11, 2023 • 180

AI & ML interests

Recent Activity

Team members 13

Team-PIXEL's activity

PIXEL