devngho's picture

devngho PRO

devngho

·

https://ngho.dev

devngho

AI & ML interests

Efficient Korean NLP, Fine Korean datasets

Recent Activity

liked a dataset 5 days ago

HuggingFaceTB/finemath

liked a dataset 17 days ago

HuggingFaceFW/fineweb-2

liked a Space 18 days ago

huggingface/open-source-ai-year-in-review-2024

View all activity

Organizations

devngho's activity

upvoted a paper 2 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47

upvoted a collection 3 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 28 days ago • 444

upvoted a paper 3 months ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 84

upvoted an article 4 months ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

By

•

Jun 3

• 42

upvoted 3 papers 4 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 86

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 53

upvoted a paper 5 months ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8 • 9

upvoted 2 articles 8 months ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

Apr 28

• 37

Article

Can We Train Chat Models with Raw Data?

By

•

Apr 25

• 17