2 287 27

L

abunchofrandomwords

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

View all activity

Organizations

None yet

abunchofrandomwords's activity

upvoted a paper 26 days ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published 27 days ago • 21

upvoted a paper 4 months ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

upvoted 2 articles 6 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 181

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

•

Jul 19, 2024

• 18

upvoted 2 papers 7 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 87

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 73

upvoted a collection 9 months ago

📀 Dataset comparison models

Collection

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 35

upvoted a paper about 1 year ago

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Paper • 2309.04269 • Published Sep 8, 2023 • 32

upvoted 12 papers over 1 year ago

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 75

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

Paper • 2308.04556 • Published Aug 8, 2023 • 8

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

Paper • 2308.04729 • Published Aug 9, 2023 • 31

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 31

PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers

Paper • 2308.05732 • Published Aug 10, 2023 • 8

Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

Paper • 2308.05221 • Published Aug 9, 2023 • 9

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

Paper • 2308.05374 • Published Aug 10, 2023 • 27

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

Paper • 2308.01390 • Published Aug 2, 2023 • 33