apple
/

coreml-FastViT-T8

Image Classification

Core ML

Model card Files Files and versions Community

pcuenq HF staff commited on 20 days ago

Commit

9d35df4

•

1 Parent(s): 2e66525

Model card updates and fixes

Browse files

Files changed (1) hide show

README.md +25 -4

README.md CHANGED Viewed

@@ -8,11 +8,13 @@ license_link: LICENSE
 datasets:
 - imagenet-1k
 ---
 # FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
 Please observe [original license](https://github.com/apple/ml-fastvit/blob/8af5928238cab99c45f64fc3e4e7b1516b8224ba/LICENSE).
 ## Model Details
 - **Model Type:** Image classification / feature backbone
 - **Model Stats:**
   - Params (M): 4.0
@@ -28,10 +30,8 @@ Please observe [original license](https://github.com/apple/ml-fastvit/blob/8af59
 | Variant                                                 | Parameters | Size (MB) | Weight precision | Act. precision | Δ Pytorch acc |
 | ------------------------------------------------------- | ---------: | --------: | ---------------- | -------------- | ------------- |
-| T8                                                      |      3.6M  |       7.8 | Float16          | Float16        |  -0.9%        |
-| MA36                                                    |      42.7M |        84 | Float16          | Float16        | -0.06%        |
 ## Evaluation - Inference time
@@ -42,7 +42,28 @@ Please observe [original license](https://github.com/apple/ml-fastvit/blob/8af59
 |   MA36  | iPhone 12 Pro Max    | 18.0 |                4.50 | Neural Engine         |
 |   MA36  | M3 Max               | 15.0 |                2.99 | Neural Engine         |
 ## Citation
 ```bibtex
 @inproceedings{vasufastvit2023,
   author = {Pavan Kumar Anasosalu Vasu and James Gabriel and Jeff Zhu and Oncel Tuzel and Anurag Ranjan},

 datasets:
 - imagenet-1k
 ---
 # FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
 Please observe [original license](https://github.com/apple/ml-fastvit/blob/8af5928238cab99c45f64fc3e4e7b1516b8224ba/LICENSE).
 ## Model Details
 - **Model Type:** Image classification / feature backbone
 - **Model Stats:**
   - Params (M): 4.0
 | Variant                                                 | Parameters | Size (MB) | Weight precision | Act. precision | Δ Pytorch acc |
 | ------------------------------------------------------- | ---------: | --------: | ---------------- | -------------- | ------------- |
+| [T8](https://huggingface.co/apple/FastViTT8F16.mlpackage)     |      3.6M  |       7.8 | Float16          | Float16        |  -0.9%        |
+| [MA36](https://huggingface.co/apple/FastViTMA36F16.mlpackage) |      42.7M |        84 | Float16          | Float16        | -0.06%        |
 ## Evaluation - Inference time
 |   MA36  | iPhone 12 Pro Max    | 18.0 |                4.50 | Neural Engine         |
 |   MA36  | M3 Max               | 15.0 |                2.99 | Neural Engine         |
+## Download
+Install `huggingface-cli`
+```bash
+brew install huggingface-cli
+```
+To download one of the `.mlpackage` folders to the `models` directory:
+```bash
+huggingface-cli download \
+  --local-dir models --local-dir-use-symlinks False \
+  apple/coreml-FastViT-T8
+```
+## Integrate in Swift apps
+The [`huggingface/coreml-examples`](https://github.com/huggingface/coreml-examples/blob/main/depth-anything-example/README.md) repository contains sample Swift code for `coreml-FastViT-T8` and other models. See [the instructions there](https://github.com/huggingface/coreml-examples/tree/main/FastViTSample) to build the demo app, which shows how to use the model in your own Swift apps.
 ## Citation
 ```bibtex
 @inproceedings{vasufastvit2023,
   author = {Pavan Kumar Anasosalu Vasu and James Gabriel and Jeff Zhu and Oncel Tuzel and Anurag Ranjan},