Add pipeline tag, library name, link to paper
#2
by
nielsr
HF staff
- opened
- README.md +27 -3
- checkpoints/vidtok_fsq_causal_41616_262144.ckpt +2 -2
- checkpoints/vidtok_fsq_causal_488_262144.ckpt +2 -2
- checkpoints/vidtok_fsq_causal_488_32768.ckpt +2 -2
- checkpoints/vidtok_fsq_causal_488_4096.ckpt +2 -2
- checkpoints/vidtok_fsq_noncausal_41616_262144.ckpt +2 -2
- checkpoints/vidtok_fsq_noncausal_488_262144.ckpt +2 -2
- checkpoints/vidtok_kl_causal_288_8chn.ckpt +0 -3
- checkpoints/vidtok_kl_causal_41616_4chn.ckpt +2 -2
- checkpoints/vidtok_kl_causal_444_4chn.ckpt +0 -3
- checkpoints/vidtok_kl_causal_488_16chn.ckpt +2 -2
- checkpoints/vidtok_kl_causal_488_4chn.ckpt +2 -2
- checkpoints/vidtok_kl_causal_488_8chn.ckpt +2 -2
- checkpoints/vidtok_kl_noncausal_41616_4chn.ckpt +2 -2
- checkpoints/vidtok_kl_noncausal_488_4chn.ckpt +2 -2
README.md
CHANGED
@@ -3,6 +3,7 @@ license: mit
|
|
3 |
license_link: https://github.com/microsoft/VidTok/blob/main/LICENSE
|
4 |
|
5 |
tags:
|
|
|
6 |
- tokenization
|
7 |
- video generation
|
8 |
- world model
|
@@ -27,8 +28,8 @@ VidTok, trained on a large-scale video dataset, outperforms previous models acro
|
|
27 |
Resources and technical documentation:
|
28 |
|
29 |
+ [GitHub](https://github.com/microsoft/VidTok)
|
30 |
-
+ [arXiv](https://arxiv.org/
|
31 |
-
|
32 |
|
33 |
## Model Performance
|
34 |
|
@@ -103,4 +104,27 @@ The model is released under the [MIT license](https://github.com/microsoft/VidTo
|
|
103 |
|
104 |
## Contact
|
105 |
|
106 |
-
We welcome feedback and collaboration from our audience. If you have suggestions, questions, or observe unexpected/offensive behavior in our technology, please contact us at tianyuhe@microsoft.com.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
license_link: https://github.com/microsoft/VidTok/blob/main/LICENSE
|
4 |
|
5 |
tags:
|
6 |
+
- video-to-video
|
7 |
- tokenization
|
8 |
- video generation
|
9 |
- world model
|
|
|
28 |
Resources and technical documentation:
|
29 |
|
30 |
+ [GitHub](https://github.com/microsoft/VidTok)
|
31 |
+
+ [arXiv](https://arxiv.org/abs/2412.13061)
|
32 |
+
+ [paper](https://huggingface.co/papers/2412.17726)
|
33 |
|
34 |
## Model Performance
|
35 |
|
|
|
104 |
|
105 |
## Contact
|
106 |
|
107 |
+
We welcome feedback and collaboration from our audience. If you have suggestions, questions, or observe unexpected/offensive behavior in our technology, please contact us at tianyuhe@microsoft.com.
|
108 |
+
|
109 |
+
## Contributing
|
110 |
+
|
111 |
+
This project welcomes contributions and suggestions. Most contributions require you to agree to a
|
112 |
+
Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us
|
113 |
+
the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
|
114 |
+
|
115 |
+
When you submit a pull request, a CLA bot will automatically determine whether you need to provide
|
116 |
+
a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions
|
117 |
+
provided by the bot. You will only need to do this once across all repos using our CLA.
|
118 |
+
|
119 |
+
This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
|
120 |
+
For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or
|
121 |
+
contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.
|
122 |
+
|
123 |
+
|
124 |
+
## Trademarks
|
125 |
+
|
126 |
+
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft
|
127 |
+
trademarks or logos is subject to and must follow
|
128 |
+
[Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).
|
129 |
+
Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.
|
130 |
+
Any use of third-party trademarks or logos are subject to those third-party's policies.
|
checkpoints/vidtok_fsq_causal_41616_262144.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eea057fe31932586474906647feed67088603491703132d6b5cf7a48f6d6e6bf
|
3 |
+
size 2358846352
|
checkpoints/vidtok_fsq_causal_488_262144.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f546bd9bdf20691e0f60b37de625483225467282d3f407b9dbac20fffa272bd
|
3 |
+
size 1937656428
|
checkpoints/vidtok_fsq_causal_488_32768.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:65ef404fc44cc0ed95a61129d1fe73a34c16b4d25464ae3d91d3c3f781bec4ab
|
3 |
+
size 1937379948
|
checkpoints/vidtok_fsq_causal_488_4096.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:22c45558789b942858aa163da27f862453118c8c1131c91374cf1072ee0b009e
|
3 |
+
size 1937103404
|
checkpoints/vidtok_fsq_noncausal_41616_262144.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:31cbc6ec6a539dcee9382423ff40d125c946a3354e3390861ca897bcd7ec69a7
|
3 |
+
size 2358844048
|
checkpoints/vidtok_fsq_noncausal_488_262144.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:26479a9cfd56e9c95da7d322ae93bc82a5445b684749adfa97feedd7edf290c4
|
3 |
+
size 1937651628
|
checkpoints/vidtok_kl_causal_288_8chn.ckpt
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:097f9ce6ad8ccf36d83ee6953118d6f426398e89188ea9f2e07afc8872b904b0
|
3 |
-
size 665222874
|
|
|
|
|
|
|
|
checkpoints/vidtok_kl_causal_41616_4chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4d379b4f482115656de0b1cbd434bdaa169997531497ac6c83c503243eebadbb
|
3 |
+
size 2358514448
|
checkpoints/vidtok_kl_causal_444_4chn.ckpt
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:dcc2e0fce3c127effcd17c5ca47f9cd29b8dd2f67a800e054154c56fa5673d72
|
3 |
-
size 689923130
|
|
|
|
|
|
|
|
checkpoints/vidtok_kl_causal_488_16chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5393b0a8452590d6d50703796a09ae7cf9115b2f9da485dc7a126c0bb2fceed5
|
3 |
+
size 1941305964
|
checkpoints/vidtok_kl_causal_488_4chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c1fe29a67565fc41ead1b0d4ac6837cc443fe1565a427d42a42948412752496
|
3 |
+
size 1915188947
|
checkpoints/vidtok_kl_causal_488_8chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cc8e08a8d07bcb39a44e185e81e0826a8adea0bb4bfa278f18a192b264b40c83
|
3 |
+
size 1938651692
|
checkpoints/vidtok_kl_noncausal_41616_4chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:acf9f24c0a3b73ab42ca0b53cfe49ba6b8503dfe3e28b67a4475247ec654a764
|
3 |
+
size 2358510865
|
checkpoints/vidtok_kl_noncausal_488_4chn.ckpt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9a653f5ce8c080ba60e3fdf88fca255f58c152bc5c69f2fc5a2c5aac16aaea76
|
3 |
+
size 1937319660
|