Kaio Ken
kaiokendev
AI & ML interests
aah aah...
Organizations
kaiokendev's activity
Interesting Papers
151
#1 opened about 1 year ago
by
PapersAnon
What exactly is SuperCOT-LoRA
1
#2 opened 12 months ago
by
FarziBuilder
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1667040186339-noauth.jpeg)
Possibility that Claude/ChatGPT uses similar techniques on adjusting RoPE sampling rate?
1
#4 opened 12 months ago
by
Yhyu13
Thanks for all the hard work! Chance to see superhot-65b?
9
#1 opened about 1 year ago
by
Panchovix
Work on a paper
3
#2 opened about 1 year ago
by
emozilla
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1676652577978-630581db99870e13d3e0006f.jpeg)
Difference between this and 8k version?
10
#1 opened about 1 year ago
by
flashvenom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637c621facc078d5bec14073/MOKvlABZuesOL3rVmxalE.png)
Is my understanding correct that the monkey patch will be needed to be added for inference only?
5
#1 opened about 1 year ago
by
flashvenom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637c621facc078d5bec14073/MOKvlABZuesOL3rVmxalE.png)
7B, 33B and 65B versions?
3
#2 opened about 1 year ago
by
flashvenom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/637c621facc078d5bec14073/MOKvlABZuesOL3rVmxalE.png)
Training info
3
#1 opened about 1 year ago
by
ausboss
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63b82ef659060ca9f4c79b73/CKRDlGGcxIQujL78hpSoE.jpeg)
v230502 Testing and Discussion
89
#23 opened about 1 year ago
by
deleted
V4.3 Early Testing.
109
#15 opened about 1 year ago
by
deleted
The V4 is here
80
#11 opened about 1 year ago
by
TheYuriLover