dumball
archit11
AI & ML interests
small language models, looking for work please reachout archit1290@gmail.com
Recent Activity
liked
a dataset
about 18 hours ago
simplescaling/s1K
upvoted
an
article
about 21 hours ago
The case for specialized pre-training: ultra-fast foundation models for dedicated tasks
upvoted
a
collection
2 days ago
Scotch & SOTA π₯ Pt. 7: Human Feedback Datasets π«£
Organizations
Collections
6
spaces
9
models
22
archit11/smollm350m-grpo
Text Generation
β’
Updated
β’
13
archit11/outputs
Updated
archit11/token_classification_model
Updated
β’
54
archit11/tinystories
Text Generation
β’
Updated
β’
172
archit11/Llama-1B-abliterated
Text Generation
β’
Updated
β’
154
archit11/Neuralqwen3-0.5B-slerp
Text Generation
β’
Updated
β’
11
archit11/Neuralqwen2-0.5B-slerp
Text Generation
β’
Updated
β’
6
archit11/qwen_worldmodel
Text Generation
β’
Updated
β’
12
archit11/small-function-calling
Text2Text Generation
β’
Updated
β’
28
archit11/worldmodel2
Text Generation
β’
Updated
β’
7
datasets
9
archit11/arxiv_links
Viewer
β’
Updated
β’
842
β’
27
archit11/uptso3
Preview
β’
Updated
β’
18
archit11/uptso2
Updated
β’
24
archit11/uspto
Preview
β’
Updated
β’
18
archit11/chiptune_music
Viewer
β’
Updated
β’
18
β’
460
archit11/distilabel-example4
Viewer
β’
Updated
β’
5
β’
55
archit11/distilabel-example
Viewer
β’
Updated
β’
10
β’
51
archit11/worldbuilding_dpo
Viewer
β’
Updated
β’
26.7k
β’
44
archit11/worldbuilding
Viewer
β’
Updated
β’
29k
β’
50