iMeanAI

company

AI & ML interests

None defined yet.

iMeanAI's activity

PanYC

updated a dataset 3 months ago

iMeanAI/Mind2Web-Live

Viewer • Updated Oct 31, 2024 • 646 • 73 • 10

magicgh

authored 2 papers 3 months ago

On the Multi-turn Instruction Following for Conversational Web Agents

Paper • 2402.15057 • Published Feb 23, 2024

Ask-before-Plan: Proactive Language Agents for Real-World Planning

Paper • 2406.12639 • Published Jun 18, 2024

ziyjiang

authored a paper 3 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38

han032206

authored a paper 7 months ago

WebCanvas: Benchmarking Web Agents in Online Environments

Paper • 2406.12373 • Published Jun 18, 2024

ziyjiang

authored 3 papers 7 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 45

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21, 2024 • 16

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 64

PanYC

updated a dataset 8 months ago

iMeanAI/Mind2Web-Live

Viewer • Updated Oct 31, 2024 • 646 • 73 • 10