Haoxiang-Wang
commited on
Commit
•
d28d923
1
Parent(s):
a9d1ed2
Update README.md
Browse files
README.md
CHANGED
@@ -278,7 +278,7 @@ Model Type:
|
|
278 |
Intended Users: | Generative AI developers for image and video generation models
|
279 |
Output: | Images/Videos and Latent Tokens
|
280 |
Describe how the model works: | Compresses and decompresses visual input (image/video).
|
281 |
-
Technical Limitations: |
|
282 |
Verified to have met prescribed NVIDIA quality standards: | Yes
|
283 |
Performance Metrics: | Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
|
284 |
Potential Known Risks: | Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.
|
|
|
278 |
Intended Users: | Generative AI developers for image and video generation models
|
279 |
Output: | Images/Videos and Latent Tokens
|
280 |
Describe how the model works: | Compresses and decompresses visual input (image/video).
|
281 |
+
Technical Limitations: | Due to tokenizer compression limitations, some visual information (such as small text and other structured fine details) may not be reconstructed accurately.
|
282 |
Verified to have met prescribed NVIDIA quality standards: | Yes
|
283 |
Performance Metrics: | Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
|
284 |
Potential Known Risks: | Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.
|