File size: 894 Bytes
74b944d
 
4b2d5d2
4c492d6
74b944d
 
7c09547
 
 
d080507
 
 
83cb829
e71c8dc
 
83cb829
bb51ecc
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
title: VLM Demo
sdk: docker
license: 	mit
---

This demo illustrates the work published in the paper ["Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models"](https://arxiv.org/pdf/2402.07865.pdf)


# Source code

For more information, please refer to this repository:

> *VLM Demo*: Lightweight repo for chatting with VLMs supported by our 
[VLM Evaluation Suite](https://github.com/TRI-ML/vlm-evaluation/tree/main).

# Huffing Face Space architecture

Hugging Face Space build a container image based on the `Dockerfile`. In this file, we use the base Nvidia base image and install additional packages and external repositories.

The Hugging Face Space start the container and execute `startup.sh`. The script loads each model on a separate GPU of the 4xA10G. Then it launches several processes: one for each model, the Gradio API controller and frontend.