Robert Sinclair

ZeroWw

AI & ML interests

LLMs optimization (model quantization and back-end optimizations) so that LLMs can run on computers of people with both kidneys. Discord: https://discord.com/channels/@robert_46007

Recent Activity

Organizations

Blog-explorers's profile picture Robert Sinclair's profile picture

ZeroWw's activity

New activity in inflatebot/MN-12B-Mag-Mell-R1 17 days ago
New activity in ggerganov/whisper.cpp 18 days ago

Please add the medium-it model

#22 opened 18 days ago by
ZeroWw

Silly version

#2 opened about 2 months ago by
ZeroWw
replied to TuringsSolutions's post about 2 months ago
view reply

hence my idea of the SILLY versions... ;)

replied to TuringsSolutions's post about 2 months ago
view reply

I am pretty sure that the actual models "AS THEY ARE" could perform 10 times better using chain of thought and some algorithms like these. Without needing a different training. And I think that's probably what CLAUDE does,

reacted to TuringsSolutions's post with ❤️ about 2 months ago
view post
Post
2105
Transformers are not all we need, that is being proven repeatedly now as more alternative frameworks emerge. Another such framework is Kolmogorov Arnold Network based Transformers. I break down exactly how these differ from Perceptron based Transformers and give you the link to my Colab where I create a model based on the research paper that absolutely destroys a standard Transformers based model. Check out the video here: https://www.youtube.com/watch?v=Sw0euxNZCc4
reacted to TuringsSolutions's post with ❤️ about 2 months ago
view post
Post
1411
I think Reinforcement Learning is the future, for a lot of reasons. I spell them out for you in this video, and also provide you with the basic code to get up and running with Atari and OpenAI Gym. If you want to get into RL, this is your ticket. Link to a cool training montage of the model in the description of the video as well. Step 2 from here would be the full-on training and certification that HuggingFace offers for RL.

https://youtu.be/ueZl3A36ZQk
New activity in TuringsSolutions/Phi3Unlocked about 2 months ago

My quants and silly expriment.

2
#1 opened about 2 months ago by
ZeroWw
New activity in CohereForAI/aya-expanse-8b about 2 months ago

Any chance of a 1B/2B/3B/4B model?

2
#5 opened about 2 months ago by
ZeroWw