anonymous-birder commited on
Commit
c5d3f55
1 Parent(s): 4dd91c4

Upload 3 files

Browse files
Files changed (3) hide show
  1. README.md +39 -0
  2. config.json +0 -0
  3. model.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags: []
4
+ ---
5
+
6
+ # ConvNext (trained on XCL from BirdSet)
7
+
8
+ ConvNext trained on the XCL dataset from BirdSet, covering 9736 bird species from Xeno-Canto. Please refer to the [BirdSet Paper](https://arxiv.org/pdf/2403.10380) and the
9
+ [BirdSet Repository](https://github.com/DBD-research-group/BirdSet/tree/main) for further information.
10
+
11
+ ### Model Details
12
+ ConvNeXT is a pure convolutional model (ConvNet), inspired by the design of Vision Transformers, that claims to outperform them.
13
+
14
+ ## How to use
15
+ The BirdSet data needs a custom processor that is available in the BirdSet repository. The model does not have a processor available.
16
+ The model accepts a mono image (spectrogram) as input (e.g., `torch.Size([16, 1, 128, 1024])`)
17
+
18
+ - The model is trained on 5-second clips of bird vocalizations.
19
+ - num_channels: 1
20
+ - pretrained checkpoint: facebook/convnext-base-224-22k
21
+ - sampling_rate: 32_000
22
+ - normalize spectrogram: mean: -4.268, std: 4.569 (from esc-50)
23
+ - spectrogram: n_fft: 1024, hop_length: 320, power: 2.0
24
+ - melscale: n_mels: 128, n_stft: 513
25
+ - dbscale: top_db: 80
26
+
27
+ ```python
28
+ import torch
29
+ from transformers import AutoModelForImageClassification
30
+ from datasets import load_dataset
31
+
32
+ dataset = load_dataset("DBD-research-group/BirdSet", "HSN")
33
+ ```
34
+
35
+ ## Model Source
36
+ - **Repository:** [BirdSet Repository](https://github.com/DBD-research-group/BirdSet/tree/main)
37
+ - **Paper [optional]:** [BirdSet Paper](https://arxiv.org/pdf/2403.10380)
38
+
39
+ ## Citation
config.json ADDED
The diff for this file is too large to render. See raw diff
 
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23aee16191e2df1000648a2466bc2213829f5b690d885252078927f9a624d46f
3
+ size 390208528