dumball's picture

dumball

archit11

·

https://archit-spec.github.io

AI & ML interests

small language models, looking for work please reachout archit1290@gmail.com

Recent Activity

upvoted an article about 5 hours ago

Train 400x faster Static Embedding Models with Sentence Transformers

updated a dataset about 1 month ago

archit11/uptso3

updated a dataset about 1 month ago

archit11/uptso2

View all activity

Organizations

archit11's activity

upvoted an article about 5 hours ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

3 days ago

• 90

upvoted 2 collections about 1 month ago

Reasoning

151 items • Updated Apr 6, 2024 • 29

🤖 Agents

21 items • Updated 17 days ago • 105

upvoted 2 collections about 2 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 11 days ago • 64

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 11 days ago • 53

upvoted a paper about 2 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 76

upvoted a collection about 2 months ago

VLM Datasets

29 items • Updated 5 days ago • 1

upvoted 2 articles about 2 months ago

Article

Low Code Large Language Model Alignment

By

•

Nov 19, 2024

• 13

Article

The Beginners Guide to Cleaning a Dataset

By

•

Nov 18, 2024

• 24

upvoted an article 2 months ago

Article

PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face

By

•

Nov 11, 2024

• 12

upvoted a collection 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 260

upvoted an article 2 months ago

Article

Recipe: Preparing Multilingual Speech Datasets for TTS Training

By

•

Nov 4, 2024

• 15

upvoted 2 collections 3 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 103

🌌 Synthetic textbooks

Synthetically generated textbooks • 5 items • Updated Jun 2, 2024 • 2

upvoted 3 articles 3 months ago

Article

How to optimize your data labelling project with custom interfaces

By

•

Oct 16, 2024

• 18

Article

Recoloring photos with diffusers

By

•

Oct 9, 2024

• 28

Article

Synthetic dataset generation techniques: Self-Instruct

By

•

May 15, 2024

• 14

upvoted 2 collections 4 months ago

Top Mini LLM

Collection of top mini llms • 5 items • Updated Oct 1, 2024 • 14

Paper-to-Read

5 items • Updated Sep 23, 2024 • 2

upvoted an article 4 months ago

Article

Understanding Vector Quantization in VQ-VAE

By

•

Aug 28, 2024

• 15