mikelabs (Mike Young)

upvoted a paper 4 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 76

upvoted 11 articles 4 months ago

Article

AIGS: Generating Science from AI-Powered Automated Falsification

By

•

Nov 22, 2024

• 2

Article

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

By

•

Nov 21, 2024

• 2

Article

Robust ASR Error Correction with Conservative Data Filtering

By

•

Nov 20, 2024

• 2

Article

That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design

By

•

Nov 19, 2024

• 1

Article

Generative Agent Simulations of 1,000 People

By

•

Nov 19, 2024

• 9

Article

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions

By

•

Nov 19, 2024

• 3

Article

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

By

•

Nov 19, 2024

• 2

Article

Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations

By

•

Nov 19, 2024

• 1

Article

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

By

•

Nov 19, 2024

• 2

Article

GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees

By

•

Nov 18, 2024

• 2

Article

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

By

•

Nov 18, 2024

• 1

upvoted 2 papers 10 months ago

Part123: Part-aware 3D Reconstruction from a Single-view Image

Paper • 2405.16888 • Published May 27, 2024 • 12

STT: Stateful Tracking with Transformers for Autonomous Driving

Paper • 2405.00236 • Published Apr 30, 2024 • 9

upvoted 2 papers 11 months ago

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Paper • 2404.16820 • Published Apr 25, 2024 • 17

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 78

upvoted a paper 12 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 126

upvoted 3 papers over 1 year ago

Mike Young PRO

AI & ML interests

Organizations

mikelabs's activity

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

AIGS: Generating Science from AI-Powered Automated Falsification

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Robust ASR Error Correction with Conservative Data Filtering

That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design

Generative Agent Simulations of 1,000 People

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

GPTree: Towards Explainable Decision-Making via LLM-powered Decision Trees

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Part123: Part-aware 3D Reconstruction from a Single-view Image

STT: Stateful Tracking with Transformers for Autonomous Driving

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

HyperFields: Towards Zero-Shot Generation of NeRFs from Text

3D-GPT: Procedural 3D Modeling with Large Language Models

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection