multimodalart HF staff commited on
Commit
93160e7
1 Parent(s): 953fb1d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -147,6 +147,8 @@ Using the model to generate content that is cruel to individuals is a misuse of
147
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
148
  and is not fit for product use without additional safety mechanisms and
149
  considerations.
 
 
150
 
151
  ### Bias
152
 
 
147
  [LAION-5B](https://laion.ai/blog/laion-5b/) which contains adult material
148
  and is not fit for product use without additional safety mechanisms and
149
  considerations.
150
+ - No additional measures were used to deduplicate the dataset. As a result, we observe some degree of memorization for images that are duplicated in the training data.
151
+ The training data can be searched at [https://rom1504.github.io/clip-retrieval/](https://rom1504.github.io/clip-retrieval/) to possibly assist in the detection of memorized images.
152
 
153
  ### Bias
154