nvidia
/

Cosmos-Tokenizer-DV8x16x16

Model card Files Files and versions Community

Haoxiang-Wang commited on 12 days ago

Commit

d28d923

•

1 Parent(s): a9d1ed2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -278,7 +278,7 @@ Model Type:
 Intended Users:                                                                                        |  Generative AI developers for image and video generation models
 Output:                                                                                                |  Images/Videos and Latent Tokens
 Describe how the model works:                                                                          |  Compresses and decompresses visual input (image/video).
-Technical Limitations:                                                                                 |  Some visual information (such as small text) may not be reconstructed accurately by the model.
 Verified to have met prescribed NVIDIA quality standards:  |  Yes
 Performance Metrics:                                                                                   |  Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
 Potential Known Risks:                                                                                 |  Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.

 Intended Users:                                                                                        |  Generative AI developers for image and video generation models
 Output:                                                                                                |  Images/Videos and Latent Tokens
 Describe how the model works:                                                                          |  Compresses and decompresses visual input (image/video).
+Technical Limitations:                                                                                 |  Due to tokenizer compression limitations, some visual information (such as small text and other structured fine details) may not be reconstructed accurately.
 Verified to have met prescribed NVIDIA quality standards:  |  Yes
 Performance Metrics:                                                                                   |  Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Reconstruction Fréchet Video Distance (rFVD), Reconstruction Fréchet Inception Distance (rFID), Latency
 Potential Known Risks:                                                                                 |  Tokenizer's output can parse all forms of input, including what may be considered toxic, offensive, or indecent.