import gradio as gr gr.Interface.load("models/h2oai/h2ogpt-4096-llama2-70b-chat-4bit").launch()