microsoft/HuggingGPT · why do I get charged for errors!?

Apr 18, 2023

All I get for my requests are errors and charging my openai account!!

Apr 19, 2023

@kehsani You will be charged for the number of tokens used in the demo, irrespective of its success or failure. Occasionally, an error may occur in one of the stages; however, by that point, you would have already utilized some tokens (for interpreting your input, selecting the appropriate model, and so on). Hope this helps.

kehsani

Apr 19, 2023

@taesis I am using openAI key for gpt3 as I am waiting for gpt4, can that cause error? Thanks

taesiri

Apr 19, 2023

@kehsani Not at all. By default, this space uses text-davinci-003, which is publicly available. (By the way, you can duplicate this space and make it work with gpt-3.5-turbo, which is significantly more cheaper).

michal-lukomski

Apr 21, 2023

@microsoft Are you considering changing the default model to gpt-3.5-turbo? That would reduce costs 10 times without the hustle of duplicating the space.

kehsani

Apr 21, 2023

•

edited Apr 21, 2023

@microsoft EOF

kehsani

Apr 21, 2023

•

edited Apr 21, 2023

@taesiri when I tryp to duplicate hugginggpt it says your hardware will be downgraded to a free cpu and this might break the system. Is this what you were referring to or is there a way to remain on gpu and pay a bit for it? I went ahead to duplicate but got runtime error of limit 16Gi reached. Seems there are a few of them around

taesiri

Apr 22, 2023

@kehsani The role of the LLM (GPT-3.5/4) in this project is to parse natural language input and utilize available models to answer the query and produce some output. Typical tasks, such as object detection or image captioning, can be performed using CPU-only spaces (either free or paid); however, Text2Image models require powerful GPUs. You can enable or disable available models here, depending on your use cases. If you disable a few models, it might run on under 16GB of RAM.

kehsani

Apr 22, 2023

@taesiri Thanks for the feed back. I would be happy to run it on a faster machine and pay, perhaps not A10G model, but duplicating a space does not give options what gpu to run under. Also now that duplication has failed, if I try to run duplication again it says you have already duplicated this space, I do not see any where that I can delete this space nor do I see it as listed, there is nothing under spaces! I have the option to create a new space!?

kehsani changed discussion status to closed Apr 22, 2023

kehsani changed discussion status to open Apr 22, 2023

taesiri

Apr 22, 2023

@kehsani You can change the hardware specifications after you have duplicated the space. Go to the settings page (located in the upper right corner of your space; the duplicated one), and you should be able to see the 'Settings' tab next to the 'Community' tab.

kehsani

Apr 25, 2023

•

edited Apr 25, 2023

@taesiri A couple of questions. For disabling a model, do I go to the site link you provided and simply comment out some of the libraries? And if a duplication fails due to size limit, I guess restarting the duplication will not help since the size of the imported libraries will always be greater than 16G for cpu limit!? I am trying factory reboot. I get this error wehn factory rebooting "RuntimeError: Found no NVIDIA driver on your system. Please check that you have an NVIDIA GPU and installed a driver from http://www.nvidia.com/Download/index.aspx". I do have a gpu and I do use it. So not sure if this is really the problem. The space is suppose to run on cpu anyways!?

taesiri

Apr 26, 2023

@kehsani Check out the load_pipes function in the models_server.py file. You can add or remove models there and also configure them to run on a CPU instead of a GPU (change cuda/cuda:0 to cpu).

kehsani

Apr 27, 2023

@taesiri I commented out everything that said cuda:0 to "cpu" and factory rebooted a couple of times had different error before but getting nvidia gpu error again! Have you duplicated this space yourself? Perhaps if I follow your steps.

taesiri

Apr 27, 2023

•

edited Apr 27, 2023

@kehsani Here you are https://huggingface.co/spaces/taesiri/HuggingGPT-Lite

Vito99

May 10, 2023

@kehsani Here you are https://huggingface.co/spaces/taesiri/HuggingGPT-Lite

Hi, I just used the link, but the model keeps on returning "{‘error’: {‘message’: ‘This is not a chat model and thus not supported in the v1/chat/completions endpoint. Did you mean to use v1/completions?’, ‘type’: ‘invalid_request_error’, ‘param’: ‘model’, ‘code’: None}}". Not sure what's going on

taesiri

May 10, 2023

Hello @Vito99

Depending on the model you are using, you should adjust the use_completion flag in the config.gradio.yaml file.

Vito99

May 10, 2023

Hello @Vito99

Depending on the model you are using, you should adjust the use_completion flag in the config.gradio.yaml file.

Thank You！

Vito99

May 10, 2023

Hello @Vito99

Depending on the model you are using, you should adjust the use_completion flag in the config.gradio.yaml file.

Hi, after I adjusted the use_completion flag, the model still return the same issue. Here's the screen shot

taesiri

May 10, 2023

@Vito99

I have tested this, but I am unable to replicate the problem you are experiencing (both on Spaces and my Local machine). Could you please provide more details so that we can assist you better

Vito99

May 10, 2023

•

edited May 10, 2023

@Vito99

I have tested this, but I am unable to replicate the problem you are experiencing (both on Spaces and my Local machine). Could you please provide more details so that we can assist you better

Hi, all I did yesterday was duplicate HuggingGPT-Lite to my space and ran it, nothing changed. I tried both on Mac and Win, I've made the model I duplicate public if you can check it.
Here 's the result I got when running the example

taesiri

May 11, 2023

•

edited May 11, 2023

@Vito99 This is strange. Are you using the gpt-3.5-turbo model? Additionally, were you able to access this model on OpenAI's playground?

Vito99

May 11, 2023

@Vito99 This is strange. Are you using the gpt-3.5-turbo model? Additionally, were you able to access this model on OpenAI's playground?

Yes, I'm using gpt-3.5-turbo. And this is what I got in openAI's play ground

taesiri

May 11, 2023

@Vito99 You are receiving the same error message here, which is interesting. To fix this, You should reach out to OpenAI or alternatively, ask about this error on https://community.openai.com/.

Vito99

May 11, 2023

@Vito99 You are receiving the same error message here, which is interesting. To fix this, You should reach out to OpenAI or alternatively, ask about this error on https://community.openai.com/.

Sure！ Thanks a LOT!!!!!