instruction-pretrain's picture

instruction-pretrain

instruction-pretrain

·

https://huggingface.co/papers/2406.14491

DaixuanC45443

AI & ML interests

Synthetic Instructions for Pre-Training

Recent Activity

updated a model 13 days ago

instruction-pretrain/instruction-synthesizer

upvoted a paper 13 days ago

How to Synthesize Text Data without Model Collapse?

updated a dataset about 1 month ago

instruction-pretrain/general-instruction-augmented-corpora

View all activity

Organizations

None yet

instruction-pretrain's activity

upvoted a paper 13 days ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 15 days ago • 48

upvoted 2 papers about 1 month ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 25

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published Nov 25, 2024 • 23

upvoted a paper about 2 months ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 108

upvoted 3 papers 3 months ago

Data Selection via Optimal Control for Language Models

Paper • 2410.07064 • Published Oct 9, 2024 • 8

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 168

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 60

upvoted 3 collections 7 months ago

synthetic-data-generation-demos

A collection of demos for various approaches to synthetic data generation • 4 items • Updated Jun 25, 2024 • 14

Instruction Pre-Training

8 items • Updated Jun 21, 2024 • 26

Daily paper that is inspiring (abstract is enough)

42 items • Updated Jul 19, 2024 • 1

upvoted 2 papers 7 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 86

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77