Add pipeline tag, library name, link to paper

#2
by nielsr HF staff - opened
README.md CHANGED
@@ -3,6 +3,7 @@ license: mit
3
  license_link: https://github.com/microsoft/VidTok/blob/main/LICENSE
4
 
5
  tags:
 
6
  - tokenization
7
  - video generation
8
  - world model
@@ -27,8 +28,8 @@ VidTok, trained on a large-scale video dataset, outperforms previous models acro
27
  Resources and technical documentation:
28
 
29
  + [GitHub](https://github.com/microsoft/VidTok)
30
- + [arXiv](https://arxiv.org/pdf/2412.13061)
31
-
32
 
33
  ## Model Performance
34
 
@@ -103,4 +104,27 @@ The model is released under the [MIT license](https://github.com/microsoft/VidTo
103
 
104
  ## Contact
105
 
106
- We welcome feedback and collaboration from our audience. If you have suggestions, questions, or observe unexpected/offensive behavior in our technology, please contact us at tianyuhe@microsoft.com.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  license_link: https://github.com/microsoft/VidTok/blob/main/LICENSE
4
 
5
  tags:
6
+ - video-to-video
7
  - tokenization
8
  - video generation
9
  - world model
 
28
  Resources and technical documentation:
29
 
30
  + [GitHub](https://github.com/microsoft/VidTok)
31
+ + [arXiv](https://arxiv.org/abs/2412.13061)
32
+ + [paper](https://huggingface.co/papers/2412.17726)
33
 
34
  ## Model Performance
35
 
 
104
 
105
  ## Contact
106
 
107
+ We welcome feedback and collaboration from our audience. If you have suggestions, questions, or observe unexpected/offensive behavior in our technology, please contact us at tianyuhe@microsoft.com.
108
+
109
+ ## Contributing
110
+
111
+ This project welcomes contributions and suggestions. Most contributions require you to agree to a
112
+ Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us
113
+ the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
114
+
115
+ When you submit a pull request, a CLA bot will automatically determine whether you need to provide
116
+ a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions
117
+ provided by the bot. You will only need to do this once across all repos using our CLA.
118
+
119
+ This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
120
+ For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or
121
+ contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.
122
+
123
+
124
+ ## Trademarks
125
+
126
+ This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft
127
+ trademarks or logos is subject to and must follow
128
+ [Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).
129
+ Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.
130
+ Any use of third-party trademarks or logos are subject to those third-party's policies.
checkpoints/vidtok_fsq_causal_41616_262144.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:86035579f7037d9ec2ca1ef9e0c310c03882fcbad82b0ce51a40568db786be63
3
- size 866056490
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eea057fe31932586474906647feed67088603491703132d6b5cf7a48f6d6e6bf
3
+ size 2358846352
checkpoints/vidtok_fsq_causal_488_262144.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:56139b893176f11a6bf03f44a384c4a9c838fb7fc05cf97352b1e96a07a8c4bf
3
- size 699955790
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3f546bd9bdf20691e0f60b37de625483225467282d3f407b9dbac20fffa272bd
3
+ size 1937656428
checkpoints/vidtok_fsq_causal_488_32768.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc7f0039c53ec1de83322698f1a8847feaba95d3060798c28cb0e1313604283d
3
- size 699844722
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65ef404fc44cc0ed95a61129d1fe73a34c16b4d25464ae3d91d3c3f781bec4ab
3
+ size 1937379948
checkpoints/vidtok_fsq_causal_488_4096.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:610348b0c8c25df1e92d31e6135089f8daed50fe30af40f4432994d9ce283fb1
3
- size 699733654
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22c45558789b942858aa163da27f862453118c8c1131c91374cf1072ee0b009e
3
+ size 1937103404
checkpoints/vidtok_fsq_noncausal_41616_262144.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22127b45eaac642693041be2f5551a488de04ad17bcfb20c7b392d61c99eda99
3
- size 866052922
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31cbc6ec6a539dcee9382423ff40d125c946a3354e3390861ca897bcd7ec69a7
3
+ size 2358844048
checkpoints/vidtok_fsq_noncausal_488_262144.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1dcb479f276e8daef9aacd252912e1efc883669adb335e5a4b82aa17bd5387ce
3
- size 699952738
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:26479a9cfd56e9c95da7d322ae93bc82a5445b684749adfa97feedd7edf290c4
3
+ size 1937651628
checkpoints/vidtok_kl_causal_288_8chn.ckpt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:097f9ce6ad8ccf36d83ee6953118d6f426398e89188ea9f2e07afc8872b904b0
3
- size 665222874
 
 
 
 
checkpoints/vidtok_kl_causal_41616_4chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d92d7b9d639cc0633f23b5447e0a9f7b460403ec1eec4d755ce56575037814c3
3
- size 866054682
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d379b4f482115656de0b1cbd434bdaa169997531497ac6c83c503243eebadbb
3
+ size 2358514448
checkpoints/vidtok_kl_causal_444_4chn.ckpt DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:dcc2e0fce3c127effcd17c5ca47f9cd29b8dd2f67a800e054154c56fa5673d72
3
- size 689923130
 
 
 
 
checkpoints/vidtok_kl_causal_488_16chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5efdf675a98ed6867a454bc4f65130de79b1caddd89d9fcd3a43eb1a981f7eb6
3
- size 701945558
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5393b0a8452590d6d50703796a09ae7cf9115b2f9da485dc7a126c0bb2fceed5
3
+ size 1941305964
checkpoints/vidtok_kl_causal_488_4chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e10481b370af68b3712d91affd0d5a8a59e83a1d18dcbdcc3fa02376668a682c
3
- size 699954234
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c1fe29a67565fc41ead1b0d4ac6837cc443fe1565a427d42a42948412752496
3
+ size 1915188947
checkpoints/vidtok_kl_causal_488_8chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6674a27f4ae661eebf105a336b6ac10d1a09ef7b38edd71470081360a4607331
3
- size 700617850
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc8e08a8d07bcb39a44e185e81e0826a8adea0bb4bfa278f18a192b264b40c83
3
+ size 1938651692
checkpoints/vidtok_kl_noncausal_41616_4chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:64273b7030a3b3c2d194521e4778cfa8a684cda03d71b05b766e68e4112980c6
3
- size 866051114
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:acf9f24c0a3b73ab42ca0b53cfe49ba6b8503dfe3e28b67a4475247ec654a764
3
+ size 2358510865
checkpoints/vidtok_kl_noncausal_488_4chn.ckpt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a0ebf5e03f4bc1855f98a83c45097e305f2704a3d814e916e90b6b730d4b49e7
3
- size 699951182
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a653f5ce8c080ba60e3fdf88fca255f58c152bc5c69f2fc5a2c5aac16aaea76
3
+ size 1937319660