Upload pth model file
Browse files- 4xRealWebPhoto_v3.txt +30 -0
- 4xRealWebPhoto_v3_atd.pth +3 -0
4xRealWebPhoto_v3.txt
ADDED
@@ -0,0 +1,30 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
Name: 4xRealWebPhoto_v3_atd
|
2 |
+
License: CC BY 4.0
|
3 |
+
Author: Philip Hofmann
|
4 |
+
Network: [ATD](https://github.com/LabShuHangGU/Adaptive-Token-Dictionary)
|
5 |
+
Scale: 4
|
6 |
+
Release Date: 22.03.2024
|
7 |
+
Purpose: 4x upscaler for photos downloaded from the web
|
8 |
+
Iterations: 250'000
|
9 |
+
epoch: 10
|
10 |
+
batch_size: 6, 3
|
11 |
+
HR_size: 128, 192
|
12 |
+
Dataset: 4xRealWebPhoto_v3
|
13 |
+
Number of train images: 101'904
|
14 |
+
OTF Training: No
|
15 |
+
Pretrained_Model_G: 003_ATD_SRx4_finetune
|
16 |
+
|
17 |
+
Description:
|
18 |
+
4x real web photo upscaler, meant for upscaling photos downloaded from the web. Trained on my v3 of my 4xRealWebPhoto dataset, it should be able to handle noise, jpg and webp (re)compression, (re)scaling, and just a little bit of lens blur, while also be able to handle good quality input. Trained on the very recently released (~2 weeks ago) Adaptive-Token-Dictionary network.
|
19 |
+
|
20 |
+
My 4xRealWebPhoto dataset tried to simulate the use-case of a photo being uploaded to the web and being processed by the service provides (like on a social media platform) so compression/downscaling, then maybe being downloaded and re-uploaded by another used where it, again, were processed by the service provider. I included different variants in the dataset. The pdf with info to the v2 dataset can be found [here](https://github.com/Phhofm/models/releases/download/4xRealWebPhoto_v2_rgt_s/4xRealWebPhoto_v2.pdf), while i simply included whats different in the v3 png.
|
21 |
+
|
22 |
+
Training details:
|
23 |
+
AdamW optimizer with U-Net SN discriminator and BFloat16.
|
24 |
+
Degraded with otf jpg compression down to 40, re-compression down to 40, together with resizes and the blur kernels.
|
25 |
+
Losses: PixelLoss using CHC (Clipped Huber with Cosine Similarity Loss), PerceptualLoss using Huber, GANLoss, [LDL](https://github.com/csjliang/LDL) using Huber, [Focal Frequency](https://github.com/EndlessSora/focal-frequency-loss), [Gradient Variance](https://github.com/lusinlu/gradient-variance-loss) with Huber, YCbCr Color Loss (bt601) and Luma Loss (CIE XYZ) on [neosr](https://github.com/muslll/neosr) with norm: true.
|
26 |
+
|
27 |
+
11 Examples:
|
28 |
+
[Slowpics](https://slow.pics/s/plgWVh4j)
|
29 |
+
|
30 |
+
|
4xRealWebPhoto_v3_atd.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:34d19b7bc551db75e454c6fd636bf92927515b95fce82aa4dfaac813b7529763
|
3 |
+
size 81959074
|