--- license: other license_name: cc-by-4.0 license_link: https://creativecommons.org/licenses/by/4.0/ --- Name: 4xRealWebPhoto_v3_atd License: CC BY 4.0 Author: Philip Hofmann Network: [ATD](https://github.com/LabShuHangGU/Adaptive-Token-Dictionary) Scale: 4 Release Date: 22.03.2024 Purpose: 4x upscaler for photos downloaded from the web Iterations: 250'000 epoch: 10 batch_size: 6, 3 HR_size: 128, 192 Dataset: 4xRealWebPhoto_v3 Number of train images: 101'904 OTF Training: No Pretrained_Model_G: 003_ATD_SRx4_finetune Description: 4x real web photo upscaler, meant for upscaling photos downloaded from the web. Trained on my v3 of my 4xRealWebPhoto dataset, it should be able to handle noise, jpg and webp (re)compression, (re)scaling, and just a little bit of lens blur, while also be able to handle good quality input. Trained on the very recently released (~2 weeks ago) Adaptive-Token-Dictionary network. My 4xRealWebPhoto dataset tried to simulate the use-case of a photo being uploaded to the web and being processed by the service provides (like on a social media platform) so compression/downscaling, then maybe being downloaded and re-uploaded by another used where it, again, were processed by the service provider. I included different variants in the dataset. The pdf with info to the v2 dataset can be found [here](https://github.com/Phhofm/models/releases/download/4xRealWebPhoto_v2_rgt_s/4xRealWebPhoto_v2.pdf), while i simply included whats different in the v3 png: ![4xRealWebPhoto_v3](https://github.com/Phhofm/models/assets/14755670/2ec67e48-bf21-4b57-9f27-69bc49b84315) Training details: AdamW optimizer with U-Net SN discriminator and BFloat16. Degraded with otf jpg compression down to 40, re-compression down to 40, together with resizes and the blur kernels. Losses: PixelLoss using CHC (Clipped Huber with Cosine Similarity Loss), PerceptualLoss using Huber, GANLoss, [LDL](https://github.com/csjliang/LDL) using Huber, [Focal Frequency](https://github.com/EndlessSora/focal-frequency-loss), [Gradient Variance](https://github.com/lusinlu/gradient-variance-loss) with Huber, YCbCr Color Loss (bt601) and Luma Loss (CIE XYZ) on [neosr](https://github.com/muslll/neosr) with norm: true. 11 Examples: [Slowpics](https://slow.pics/s/plgWVh4j) ![4xRealWebPhoto_v3_atd_example1](https://github.com/Phhofm/models/assets/14755670/0387b0a3-8078-4df3-b5fa-b784ea7fff47) ![4xRealWebPhoto_v3_atd_example2](https://github.com/Phhofm/models/assets/14755670/7c4a03cb-1303-489a-931c-13a5baa3c699) ![4xRealWebPhoto_v3_atd_example3](https://github.com/Phhofm/models/assets/14755670/c0978ebc-58a7-40dc-a372-635a733c0364) ![4xRealWebPhoto_v3_atd_example4](https://github.com/Phhofm/models/assets/14755670/54675762-18a6-41e2-b2db-d22f9e694b4f) ![4xRealWebPhoto_v3_atd_example5](https://github.com/Phhofm/models/assets/14755670/5e4bb337-1516-4232-8bf7-892f914ed792) ![4xRealWebPhoto_v3_atd_example6](https://github.com/Phhofm/models/assets/14755670/fc41100d-4c6a-4f19-97a0-a007d9323c2f) ![4xRealWebPhoto_v3_atd_example7](https://github.com/Phhofm/models/assets/14755670/0f838d98-218f-4971-927a-941b5774540f)