Smol Community

community

s3nh

AI & ML interests

The SmolTuners group is a community dedicated to the development of small-scale Large Language Models (LLMs) using consumer-grade GPUs.

Recent Activity

s3nh updated a Space 9 days ago

SmolTuners/README

s3nh new activity 9 days ago

SmolTuners/README:Gh organization

Delta-Vector new activity 9 days ago

SmolTuners/README:Gh organization

View all activity

SmolTuners's activity

s3nh

updated a Space 9 days ago

Running

😻

README

s3nh

in SmolTuners/README 9 days ago

Gh organization

#3 opened 10 days ago by

s3nh

Delta-Vector

in SmolTuners/README 9 days ago

Gh organization

#3 opened 10 days ago by

s3nh

in SmolTuners/README 10 days ago

Optimizers

#2 opened 10 days ago by

s3nh

in SmolTuners/README 12 days ago

Datasets

#1 opened 14 days ago by

s3nh

Delta-Vector

in SmolTuners/README 12 days ago

Datasets

#1 opened 14 days ago by

s3nh

KnutJaegersberg

posted an update 14 days ago

Post

1296

Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs

I found it useful to think of AI agent design as progressing up a ladder, through evolutionary selection.

https://huggingface.co/blog/KnutJaegersberg/intelligence-potentiation

s3nh

posted an update 15 days ago

Post

1731

Welcome back,

Small Language Models Enthusiasts and GPU Poor oss enjoyers lets connect.
Just created an organization which main target is to have fun with smaller models tuneable on consumer range GPUs, feel free to join and lets have some fun, much love ;3

https://huggingface.co/SmolTuners

3 replies

KnutJaegersberg

posted an update 29 days ago

Post

1223

Practical Consciousness Theory for AI System Design

Wrote a blog post about practical consciousness theory

https://huggingface.co/blog/KnutJaegersberg/practical-consciousness-theory

KnutJaegersberg

posted an update about 1 month ago

Post

1834

DrNicefellow/Qwen-QwQ-32B-Preview-4.25bpw-exl2

Rumor has it this is currently the best model for 24 GB VRAM local usage.

DrNicefellow/Qwen-QwQ-32B-Preview-4.25bpw-exl2

KnutJaegersberg

posted an update about 1 month ago

Post

1110

openGPT-X/Teuken-7B-instruct-research-v0.4

New European LLM

openGPT-X/Teuken-7B-instruct-research-v0.4

Felladrin

posted an update about 2 months ago

Post

1458

I'm curating an AI-powered web search software timeline at https://github.com/felladrin/awesome-ai-web-search

The list covers three main categories:

1. Web Search with LLM summarization and follow-up capabilities
2. LLM chat interfaces with Web Search integration
3. Agent-driven research tools using LLM + Web Search

The timeline helps track the evolution of this space and serves as a reference for anyone looking for alternatives. If you know of any tools that should be included, please contribute by:
- opening a PR to edit the readme: https://github.com/felladrin/awesome-ai-web-search/edit/main/readme.md
- creating an issue in the repository: https://github.com/felladrin/awesome-ai-web-search/issues/new/choose
- or sharing in the comments below.

1 reply

Felladrin

posted an update 3 months ago

Post

2898

MiniSearch is celebrating its 1st birthday! 🎉

Exactly one year ago, I shared the initial version of this side-project on Hugging Face. Since then, there have been numerous changes under the hood. Nowadays it uses [Web-LLM](https://github.com/mlc-ai/web-llm), [Wllama](https://github.com/ngxson/wllama) and [SearXNG](https://github.com/searxng/searxng). I use it daily as my default search engine and have done my best to make it useful. I hope it's interesting for you too!

HF Space: Felladrin/MiniSearch
Embeddable URL: https://felladrin-minisearch.hf.space

1 reply

KnutJaegersberg

posted an update 4 months ago

Post

1176

appvoid/arco

arco consistently outperforms every sota model below 600m parameters on average

appvoid/arco

KnutJaegersberg

posted an update 5 months ago

Post

2240

Wrote a blog post with some ideas about prompt engineering

https://huggingface.co/blog/KnutJaegersberg/first-principles-prompt-engineering

KnutJaegersberg

posted an update 5 months ago

Post

2314

mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq

99% of the performance across various benchmarks!

mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq

KnutJaegersberg

posted an update 5 months ago

Post

924

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8

Requant of the big llama, using 20% less memory

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8

KnutJaegersberg

posted an update 6 months ago

Post

1378

Decensored Gemma2-27b

TheDrummer/Big-Tiger-Gemma-27B-v1

KnutJaegersberg

posted an update 6 months ago

Post

641

Unsocial Intelligence: an Investigation of the Assumptions of AGI Discourse

I don't agree with some of the assertions made here, but it is an interesting paper and a good overview.

https://arxiv.org/abs/2401.13142

Fizzarolli

posted an update 7 months ago

Post

1954

hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1

AI & ML interests

Recent Activity

Team members 21

SmolTuners's activity

README

Gh organization

Gh organization

Optimizers

Datasets

Datasets