Jue Wang
juewang
AI & ML interests
None yet
Organizations
juewang's activity
Context length?
10
#2 opened about 1 year ago
by
turboderp

Missing files?
#1 opened over 1 year ago
by
juewang

Correct the output dtype of rmsnorm_func
2
#13 opened over 1 year ago
by
ag0
how to fine tune peft qlora and SFTTrainer?
12
#2 opened over 1 year ago
by
NickyNicky

Poor performance?
4
#6 opened over 1 year ago
by
Fionn
Can you help me fine-tune this with LoRA? (Having an error)
1
#12 opened almost 2 years ago
by
AayushShah

What kind of machine would be suitable for this model (in amazon sagemaker)?
5
#7 opened almost 2 years ago
by
juusohugs

Will it be possible to run this on PC with 8 GeForce RTX 3060 with 8 Gb VRAM each?
2
#11 opened almost 2 years ago
by
ai2p
Any way to set the "stop, split by" when running the model locally?
4
#26 opened almost 2 years ago
by
johnnyracer
Issue with loading model to GPU when using pipeline
2
#5 opened almost 2 years ago
by
AlpYu-HubX
Is it a wrong prompt?
4
#8 opened almost 2 years ago
by
tatyanavidrevich
Feature requests and suggestions for V2
9
#4 opened over 2 years ago
by
zhangce
use accelerate to load model
1
#4 opened almost 2 years ago
by
adolf669
This model requires A LOT of resources... But how much? Trying to build a chatbot
9
#3 opened about 2 years ago
by
joanfmendo
Generated Text have issues
10
#22 opened about 2 years ago
by
asifahmed

Is UL2 used?
1
#2 opened about 2 years ago
by
JunnanLi
Question-Answering over documents
3
#19 opened about 2 years ago
by
tmishinev
Confused about bidirectional attention when implementing custom sampling loop
2
#25 opened about 2 years ago
by
ericanthonymitchell
Model behavior during adaptation phase
2
#24 opened about 2 years ago
by
jlli
Fine Tuning // Download Full Weights
2
#23 opened about 2 years ago
by
idop11