Smol Community

community

AI & ML interests

The SmolTuners group is a community dedicated to the development of small-scale Large Language Models (LLMs) using consumer-grade GPUs.

Recent Activity

SmolTuners's activity

s3nh 
updated a Space 9 days ago
s3nh 
in SmolTuners/README 9 days ago

Gh organization

3
#3 opened 10 days ago by
s3nh

Gh organization

3
#3 opened 10 days ago by
s3nh
s3nh 
in SmolTuners/README 10 days ago

Optimizers

#2 opened 10 days ago by
s3nh
s3nh 
in SmolTuners/README 12 days ago

Datasets

3
#1 opened 14 days ago by
s3nh

Datasets

3
#1 opened 14 days ago by
s3nh
KnutJaegersberg 
posted an update 14 days ago
s3nh 
posted an update 15 days ago
view post
Post
1731
Welcome back,

Small Language Models Enthusiasts and GPU Poor oss enjoyers lets connect.
Just created an organization which main target is to have fun with smaller models tuneable on consumer range GPUs, feel free to join and lets have some fun, much love ;3

https://huggingface.co/SmolTuners
·
KnutJaegersberg 
posted an update 29 days ago
KnutJaegersberg 
posted an update about 1 month ago
KnutJaegersberg 
posted an update about 1 month ago
Felladrin 
posted an update about 2 months ago
view post
Post
1458
I'm curating an AI-powered web search software timeline at https://github.com/felladrin/awesome-ai-web-search

The list covers three main categories:

1. Web Search with LLM summarization and follow-up capabilities
2. LLM chat interfaces with Web Search integration
3. Agent-driven research tools using LLM + Web Search

The timeline helps track the evolution of this space and serves as a reference for anyone looking for alternatives. If you know of any tools that should be included, please contribute by:
- opening a PR to edit the readme: https://github.com/felladrin/awesome-ai-web-search/edit/main/readme.md
- creating an issue in the repository: https://github.com/felladrin/awesome-ai-web-search/issues/new/choose
- or sharing in the comments below.
  • 1 reply
·
Felladrin 
posted an update 3 months ago
view post
Post
2898
MiniSearch is celebrating its 1st birthday! 🎉

Exactly one year ago, I shared the initial version of this side-project on Hugging Face. Since then, there have been numerous changes under the hood. Nowadays it uses [Web-LLM](https://github.com/mlc-ai/web-llm), [Wllama](https://github.com/ngxson/wllama) and [SearXNG](https://github.com/searxng/searxng). I use it daily as my default search engine and have done my best to make it useful. I hope it's interesting for you too!

HF Space: Felladrin/MiniSearch
Embeddable URL: https://felladrin-minisearch.hf.space
  • 1 reply
·
KnutJaegersberg 
posted an update 4 months ago
view post
Post
1176
appvoid/arco

arco consistently outperforms every sota model below 600m parameters on average

appvoid/arco
KnutJaegersberg 
posted an update 5 months ago
KnutJaegersberg 
posted an update 5 months ago
KnutJaegersberg 
posted an update 5 months ago
KnutJaegersberg 
posted an update 6 months ago
KnutJaegersberg 
posted an update 6 months ago
view post
Post
641
Unsocial Intelligence: an Investigation of the Assumptions of AGI Discourse

I don't agree with some of the assertions made here, but it is an interesting paper and a good overview.

https://arxiv.org/abs/2401.13142
Fizzarolli 
posted an update 7 months ago
view post
Post
1954
hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1