Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published 5 days ago • 70
LongIns: A Challenging Long-context Instruction-based Exam for LLMs Paper • 2406.17588 • Published 8 days ago • 18
Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Paper • 2406.17660 • Published 8 days ago • 5
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper • 2406.15319 • Published 12 days ago • 55
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Paper • 2406.14783 • Published 12 days ago • 15
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published 13 days ago • 30
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published 13 days ago • 75
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published 14 days ago • 7
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries Paper • 2406.12824 • Published 15 days ago • 20
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published 16 days ago • 54
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models Paper • 2309.12307 • Published Sep 21, 2023 • 84
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper • 2309.10400 • Published Sep 19, 2023 • 23
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study Paper • 2404.14047 • Published Apr 22 • 38
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated 21 days ago • 193
Korean Datasets I've released so far. Collection 지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24 • 15