lee dong ryeol's picture

2 20 7

lee dong ryeol

drlee1

·

AI & ML interests

None yet

Organizations

None yet

drlee1's activity

upvoted a paper 2 days ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published 5 days ago • 70

upvoted 3 papers 5 days ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published 8 days ago • 18

Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

Paper • 2406.17660 • Published 8 days ago • 5

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published 9 days ago • 21

upvoted 2 papers 6 days ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published 12 days ago • 55

Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework

Paper • 2406.14783 • Published 12 days ago • 15

upvoted 3 papers 7 days ago

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published 13 days ago • 30

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published 13 days ago • 75

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Paper • 2406.13663 • Published 14 days ago • 7

upvoted a paper 12 days ago

Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 47

upvoted a collection 13 days ago

RAG

RAG research • 9 items • Updated 14 days ago • 2

upvoted 2 papers 13 days ago

From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published 15 days ago • 20

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published 16 days ago • 54

upvoted a paper 27 days ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 84

upvoted a paper 28 days ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 23

upvoted a paper about 2 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 76

upvoted 2 papers 2 months ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22 • 38

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55

upvoted 2 collections 2 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated 21 days ago • 193

Korean Datasets I've released so far.

지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24 • 15