AlbertS

LoserCheems

AI & ML interests

None yet

Recent Activity

upvoted a collection 21 days ago

Doge

liked a model 21 days ago

SmallDoge/Doge-20M-Instruct

liked a model 21 days ago

SmallDoge/Doge-160M-checkpoint

View all activity

Organizations

None yet

LoserCheems's activity

upvoted a collection 21 days ago

Doge

Collection

Doge family of small language models. • 7 items • Updated 19 days ago • 5

liked 2 models 21 days ago

SmallDoge/Doge-20M-Instruct

Question Answering • Updated 3 days ago • 52.7k • 3

SmallDoge/Doge-160M-checkpoint

Text Generation • Updated 3 days ago • 13.8k • 3

reacted to JingzeShi's post with 👍🤯👀 21 days ago

Post

2076

Only a single RTX 4090 running model pre-training is really slow, even for small language models!!! (https://huggingface.co/collections/JingzeShi/doge-slm-677fd879f8c4fd0f43e05458)

2 replies

reacted to JingzeShi's post with 🔥 21 days ago

Post

1700

🤩warmup -> stable -> decay leanring rate scheduler:
😎use the Stable Phase CheckPoints to Continue Training the model on Any New Dataset without spikes of the training!!!
SmallDoge/Doge-20M-checkpoint
SmallDoge/Doge-60M-checkpoint

4 replies

liked 5 models 21 days ago

upvoted a paper 2 months ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 7

upvoted a paper 4 months ago

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Paper • 2407.16958 • Published Jul 24, 2024 • 3

updated a dataset over 1 year ago

LoserCheems/Cartoon_pictures

Updated Jul 12, 2023 • 17

liked a Space over 1 year ago

798

StarChat Playground

⭐

Generate coding assistance and answers

liked a model over 1 year ago

openai-community/openai-gpt

Text Generation • Updated Feb 19, 2024 • 35.4k • 249