SEWYLM 2
New architecture using a blend of the following:
- nGPT
- LOCONUT (Limited COCONUT) (variation of COCONUT)
- Gemma2
- Differential Transformer
- NeuTRENO
As of 16th dec. 2024, you need to use my library to use this model
SewyLM
link if not visible https://github.com/AarushCodes/SewyLM
LICENSE
GNU GPL v3
- Downloads last month
- 131
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.