Last Week in Medical AI: Top Research Papers/Models š (September 21 - September 27, 2024) Sep 28 ā¢ 2
Performance Comparison: Llama-3.2 vs. Llama-3.1 LLMs and Smaller Models (3B, 1B) in Medical and Healthcare AI Domains š©ŗš§¬š Sep 26 ā¢ 6
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper ā¢ 2408.08872 ā¢ Published Aug 16 ā¢ 98
TraDiffusion: Trajectory-Based Training-Free Image Generation Paper ā¢ 2408.09739 ā¢ Published Aug 19 ā¢ 8
Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges Paper ā¢ 2408.08946 ā¢ Published Aug 16 ā¢ 11
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data Paper ā¢ 2408.10119 ā¢ Published Aug 19 ā¢ 16
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Paper ā¢ 2408.10195 ā¢ Published Aug 19 ā¢ 12
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Paper ā¢ 2408.10198 ā¢ Published Aug 19 ā¢ 32
MambaEVT: Event Stream based Visual Object Tracking using State Space Model Paper ā¢ 2408.10487 ā¢ Published Aug 20 ā¢ 6
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model Paper ā¢ 2408.10764 ā¢ Published Aug 20 ā¢ 8
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Paper ā¢ 2408.11049 ā¢ Published Aug 20 ā¢ 12
NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency Paper ā¢ 2408.11054 ā¢ Published Aug 20 ā¢ 12
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning Paper ā¢ 2408.11001 ā¢ Published Aug 20 ā¢ 11
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper ā¢ 2408.11039 ā¢ Published Aug 20 ā¢ 58
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Paper ā¢ 2408.09174 ā¢ Published Aug 17 ā¢ 51
Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification Paper ā¢ 2408.11237 ā¢ Published Aug 20 ā¢ 5
Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer Paper ā¢ 2408.08793 ā¢ Published Aug 16 ā¢ 5
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting Paper ā¢ 2408.11706 ā¢ Published Aug 21 ā¢ 6
TrackGo: A Flexible and Efficient Method for Controllable Video Generation Paper ā¢ 2408.11475 ā¢ Published Aug 21 ā¢ 17
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models Paper ā¢ 2408.11817 ā¢ Published Aug 21 ā¢ 8