pcuenq HF staff commited on
Commit
e830c08
1 Parent(s): 4e1efab

Upload folder using huggingface_hub

Browse files
.DS_Store ADDED
Binary file (8.2 kB). View file
 
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: mask-generation
4
+ library_name: coreml
5
+ ---
6
+
7
+ # SAM 2.1 Large Core ML
8
+
9
+ SAM 2 (Segment Anything in Images and Videos), is a collection of foundation models from FAIR that aim to solve promptable visual segmentation in images and videos. See the [SAM 2 paper](https://arxiv.org/abs/2408.00714) for more information.
10
+
11
+ This is the Core ML version of [SAM 2.1 Large](https://huggingface.co/facebook/sam2.1-hiera-large), and is suitable for use with the [SAM2 Studio demo app](https://github.com/huggingface/sam2-studio). It was converted in `float16` precision using [this fork](https://github.com/huggingface/segment-anything-2/tree/coreml-conversion) of the original code repository.
12
+
13
+ ## Download
14
+
15
+ Install `huggingface-cli`
16
+
17
+ ```bash
18
+ brew install huggingface-cli
19
+ ```
20
+
21
+ ```bash
22
+ huggingface-cli download --local-dir models apple/coreml-sam2.1-large
23
+ ```
24
+
25
+ ## Citation
26
+
27
+ To cite the paper, model, or software, please use the below:
28
+ ```
29
+ @article{ravi2024sam2,
30
+ title={SAM 2: Segment Anything in Images and Videos},
31
+ author={Ravi, Nikhila and Gabeur, Valentin and Hu, Yuan-Ting and Hu, Ronghang and Ryali, Chaitanya and Ma, Tengyu and Khedr, Haitham and R{\"a}dle, Roman and Rolland, Chloe and Gustafson, Laura and Mintun, Eric and Pan, Junting and Alwala, Kalyan Vasudev and Carion, Nicolas and Wu, Chao-Yuan and Girshick, Ross and Doll{\'a}r, Piotr and Feichtenhofer, Christoph},
32
+ journal={arXiv preprint arXiv:2408.00714},
33
+ url={https://arxiv.org/abs/2408.00714},
34
+ year={2024}
35
+ }
36
+ ```
SAM2_1LargeImageEncoderFLOAT16.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b382195d0c3e2a93798f39e36085a71e66b1525ebd679d23a1bc8466361028b8
3
+ size 528540
SAM2_1LargeImageEncoderFLOAT16.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2908ac87bd862d756a4966c0ea9e710297bbc38b37b8c0243fc33dac5ff51d0
3
+ size 444351360
SAM2_1LargeImageEncoderFLOAT16.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "0B729D2F-6020-4569-83DF-08B829F14B34": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "6F8459D6-7777-47FB-A310-4056C9DEB630": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "6F8459D6-7777-47FB-A310-4056C9DEB630"
18
+ }
SAM2_1LargeMaskDecoderFLOAT16.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71d343eb9464951bb6e7de6d8099c268b0dd2bdeb78e694e792b2562a5b55bea
3
+ size 75167
SAM2_1LargeMaskDecoderFLOAT16.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d7ccf6fc95e4fe21dba407f0d97a5859c1abce0c31edf1befb14714cfac58a7f
3
+ size 10222400
SAM2_1LargeMaskDecoderFLOAT16.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "7C14C788-4FA7-4013-BF0B-10D6283B2DD5": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Specification",
7
+ "name": "model.mlmodel",
8
+ "path": "com.apple.CoreML/model.mlmodel"
9
+ },
10
+ "924A88C9-15B3-4193-8BBB-828E36705A28": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Weights",
13
+ "name": "weights",
14
+ "path": "com.apple.CoreML/weights"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "7C14C788-4FA7-4013-BF0B-10D6283B2DD5"
18
+ }
SAM2_1LargePromptEncoderFLOAT16.mlpackage/Data/com.apple.CoreML/model.mlmodel ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a83c167d8bd63e80f86349a78c2ab0527ce97eca1f848a4ce57fe5351241fa3
3
+ size 20618
SAM2_1LargePromptEncoderFLOAT16.mlpackage/Data/com.apple.CoreML/weights/weight.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eadc79dd01eaf71001525a84bf835c8925d3a0463d1b34786bb6b734714fb0e7
3
+ size 2101056
SAM2_1LargePromptEncoderFLOAT16.mlpackage/Manifest.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "fileFormatVersion": "1.0.0",
3
+ "itemInfoEntries": {
4
+ "14032019-108D-4843-B3DB-EF5CBB615F7F": {
5
+ "author": "com.apple.CoreML",
6
+ "description": "CoreML Model Weights",
7
+ "name": "weights",
8
+ "path": "com.apple.CoreML/weights"
9
+ },
10
+ "215E4A09-5F4A-4DD9-AC2F-62D1313CAA88": {
11
+ "author": "com.apple.CoreML",
12
+ "description": "CoreML Model Specification",
13
+ "name": "model.mlmodel",
14
+ "path": "com.apple.CoreML/model.mlmodel"
15
+ }
16
+ },
17
+ "rootModelIdentifier": "215E4A09-5F4A-4DD9-AC2F-62D1313CAA88"
18
+ }