Spaces:

TRI-ML
/

vlm-demo

Paused

File size: 894 Bytes

74b944d
 
4b2d5d2
4c492d6
74b944d
 
7c09547
 
 
d080507
 
 
83cb829
e71c8dc
 
83cb829
bb51ecc

---
title: VLM Demo
sdk: docker
license: 	mit
---

This demo illustrates the work published in the paper ["Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models"](https://arxiv.org/pdf/2402.07865.pdf)


# Source code

For more information, please refer to this repository:

> *VLM Demo*: Lightweight repo for chatting with VLMs supported by our 
[VLM Evaluation Suite](https://github.com/TRI-ML/vlm-evaluation/tree/main).

# Huffing Face Space architecture

Hugging Face Space build a container image based on the `Dockerfile`. In this file, we use the base Nvidia base image and install additional packages and external repositories.

The Hugging Face Space start the container and execute `startup.sh`. The script loads each model on a separate GPU of the 4xA10G. Then it launches several processes: one for each model, the Gradio API controller and frontend.