Rafa
rafa9
·
AI & ML interests
None yet
Organizations
rafa9's activity
Delay time and execution time considerably larger compared to 3.0 model?
2
#8 opened 2 months ago
by
rafa9
How to try this model with Ooba's text-generation-webui?
#13 opened 9 months ago
by
rafa9
What quantization is it? Can be used with vLLM?
1
#1 opened 9 months ago
by
rafa9
Is there a way to avoid it identifying itself as AI?
1
#2 opened about 1 year ago
by
rafa9
How do we make a request? Result from inference API and inference endpoint are different
#2 opened over 1 year ago
by
rafa9
Getting KeyError: 'model.layers.16.self_attn.q_proj.wf1' on trying to prompt on Runpod
2
#10 opened over 1 year ago
by
rafa9
Doesn't return any value using API? Model loaded successfully on runpod using The_Bloke's template
1
#4 opened over 1 year ago
by
rafa9
How to run this on runpod?
1
#2 opened over 1 year ago
by
rafa9
Is it 30B? In the description it says 13B
3
#2 opened over 1 year ago
by
rafa9
Error on start - raise EnvironmentError( OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack
12
#10 opened over 1 year ago
by
ilnurshams
Anyone has been successful in deploying this to Sagemaker or so?
#9 opened over 1 year ago
by
rafa9