Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models Paper • 2410.12771 • Published 4 days ago • 3
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment Paper • 2410.13785 • Published 3 days ago • 17
BenTo: Benchmark Task Reduction with In-Context Transferability Paper • 2410.13804 • Published 3 days ago • 18
MoH: Multi-Head Attention as Mixture-of-Head Attention Paper • 2410.11842 • Published 5 days ago • 17
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published 3 days ago • 25
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published 4 days ago • 20
JudgeBench: A Benchmark for Evaluating LLM-based Judges Paper • 2410.12784 • Published 4 days ago • 25
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models Paper • 2410.13085 • Published 4 days ago • 19
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper • 2410.13824 • Published 3 days ago • 23
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published 3 days ago • 65
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities Paper • 2410.07722 • Published 10 days ago • 12
Tracking Universal Features Through Fine-Tuning and Model Merging Paper • 2410.12391 • Published 4 days ago • 5
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs Paper • 2410.12405 • Published 4 days ago • 13
ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression Paper • 2410.08584 • Published 9 days ago • 10
Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published 4 days ago • 18
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published 4 days ago • 16
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks Paper • 2410.12381 • Published 4 days ago • 39
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published 6 days ago • 14
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory Paper • 2410.10813 • Published 6 days ago • 9
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Paper • 2410.09223 • Published 9 days ago • 5
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published 6 days ago • 34
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Paper • 2410.09584 • Published 8 days ago • 42
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models Paper • 2410.09732 • Published 7 days ago • 53
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published 9 days ago • 34
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness Paper • 2410.07035 • Published 11 days ago • 16
SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights Paper • 2410.09008 • Published 9 days ago • 16
Falcon Mamba: The First Competitive Attention-free 7B Language Model Paper • 2410.05355 • Published 13 days ago • 26
CursorCore: Assist Programming through Aligning Anything Paper • 2410.07002 • Published 11 days ago • 12
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published 13 days ago • 8
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper • 2410.02707 • Published 17 days ago • 44
TLDR: Token-Level Detective Reward Model for Large Vision Language Models Paper • 2410.04734 • Published 14 days ago • 15
Presto! Distilling Steps and Layers for Accelerating Music Generation Paper • 2410.05167 • Published 13 days ago • 15
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide Paper • 2410.04364 • Published 14 days ago • 26
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction Paper • 2410.04932 • Published 13 days ago • 8
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published 19 days ago • 136
MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction Paper • 2410.02241 • Published 17 days ago • 6
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise Paper • 2410.03017 • Published 17 days ago • 25
Learning the Latent Rules of a Game from Data: A Chess Story Paper • 2410.02426 • Published 17 days ago • 4
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Paper • 2410.02056 • Published 18 days ago • 4
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models Paper • 2410.01782 • Published 18 days ago • 9
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published Aug 28 • 34
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 115
Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? Paper • 2407.16607 • Published Jul 23 • 21
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries Paper • 2406.12824 • Published Jun 18 • 20
HARE: HumAn pRiors, a key to small language model Efficiency Paper • 2406.11410 • Published Jun 17 • 38