ProteusSigma / README.md

Update README.md

927f2b8 verified about 1 month ago

5.64 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- stabilityai/stable-diffusion-xl-base-1.0
	pipeline_tag: text-to-image
	tags:
	- art
	---
	# SDXL-ProteusSigma Training with ZTSNR and NovelAI V3 Improvements

	- [x] 10k dataset proof of concept (completed)

	- [x] 500k+ dataset finetune (completed) [Note: not aesthetically tuned whatsoever]

	- [ ] 12M million dataset finetune (planned)

	<style>
	.logo-container {
	position: relative;
	text-align: center;
	margin: 40px 0;
	}

	.text-layer {
	font-family: 'Arial Black', 'Helvetica', sans-serif;
	font-size: 72px;
	font-weight: bold;
	white-space: nowrap;
	}

	.text-base {
	position: relative;
	color: #ff71ce;
	text-shadow: 2px 2px 0 #ff00ff;
	}

	.text-overlay {
	position: absolute;
	left: 50%;
	top: 50%;
	transform: translate(-49%, -47%); /* Slightly offset */
	color: #01cdfe;
	text-shadow: -2px -2px 0 #00ffff;
	opacity: 0.8;
	mix-blend-mode: screen;
	}

	.sigma {
	color: #00ffff;
	text-shadow:
	2px 2px 0 #ff00ff,
	-2px -2px 0 #00ffff;
	}
	</style>

	<div class="logo-container">
	<div class="text-layer text-overlay">
	Proteus<span class="sigma">Σ</span>
	</div>
	<div class="text-layer text-base">
	Proteus<span class="sigma">Σ</span>
	</div>
	</div>

	## Example Outputs

	<style>
	.gallery {
	display: flex;
	flex-direction: row;
	flex-wrap: wrap;
	gap: 10px;
	justify-content: center;
	align-items: center;
	width: 100%;
	padding: 10px;
	}

	.gallery-item {
	flex: 0 0 300px;
	margin: 0;
	position: relative;
	}

	.gallery-item.large { /* New class for larger item */
	flex: 0 0 340px;
	}

	.gallery img {
	width: 300px;
	cursor: pointer;
	transition: transform 0.2s;
	border-radius: 8px;
	}

	.gallery-item.large img { /* Larger size for last image */
	width: 512px;
	}

	.gallery img:hover {
	transform: scale(1.05);
	}

	.caption {
	position: absolute;
	bottom: 0;
	left: 0;
	right: 0;
	background: rgba(0, 0, 0, 0.4);
	color: white;
	padding: 8px;
	font-size: 11px;
	border-bottom-left-radius: 8px;
	border-bottom-right-radius: 8px;
	opacity: 0.7;
	transition: opacity 0.3s ease;
	}

	.gallery-item:hover .caption {
	opacity: 0.2;
	}

	.modal {
	display: none;
	position: fixed;
	z-index: 1000;
	top: 0;
	left: 0;
	width: 100%;
	height: 100%;
	background-color: rgba(0,0,0,0.9);
	padding: 20px;
	box-sizing: border-box;
	}

	.modal img {
	max-width: 90%;
	max-height: 90vh;
	margin: auto;
	display: block;
	position: relative;
	top: 50%;
	transform: translateY(-50%);
	}

	.modal.active {
	display: block;
	}
	</style>

	<div class="gallery">
	<div class="gallery-item">
	<img src="https://huggingface.co/dataautogpt3/ProteusSigma/resolve/main/example.png" alt="Example Output 1" onclick="showImage(this.src)"/>
	</div>
	<div class="gallery-item">
	<img src="https://huggingface.co/dataautogpt3/ProteusSigma/resolve/main/example2.png" alt="Example Output 2" onclick="showImage(this.src)"/>
	</div>
	<div class="gallery-item">
	<img src="https://huggingface.co/dataautogpt3/ProteusSigma/resolve/main/example3.png" alt="Example Output 3" onclick="showImage(this.src)"/>
	</div>
	<div class="gallery-item">
	<img src="https://huggingface.co/dataautogpt3/ProteusSigma/resolve/main/example4.png" alt="Example Output 4" onclick="showImage(this.src)"/>
	</div>
	<div class="gallery-item large"> <!-- Added 'large' class -->
	<img src="https://huggingface.co/dataautogpt3/ProteusSigma/resolve/main/example5.png" alt="Example Output 5" onclick="showImage(this.src)"/>
	</div>
	</div>

	<div class="modal" onclick="this.classList.remove('active')">
	<img id="modal-img" src="" alt="Full size image"/>
	</div>

	<script>
	function showImage(src) {
	document.getElementById('modal-img').src = src;
	document.querySelector('.modal').classList.add('active');
	}
	</script>


	# Combined Proteus and Mobius datasets with ZTSNR and NovelAI V3 Improvements

	CUSTOM INFERENCE IS REQUIRED FOR BEST RESULTS!

	https://github.com/DataCTE/SDXL-Training-Improvements/tree/main/Comfyui-zsnrnode

	use this comfyui custom node from the training repo.

	and the workflow here: https://github.com/DataCTE/SDXL-Training-Improvements/blob/main/Comfyui-zsnrnode/ztsnr%2Bv-pred.json



	## Model Details

	- Model Type: SDXL Fine-tuned with ZTSNR and NovelAI V3 Improvements
	- Base Model: stabilityai/stable-diffusion-xl-base-1.0
	- Training Dataset: 500,000 high-quality images
	- License: Apache 2.0

	## Key Features

	- Zero Terminal SNR (ZTSNR) implementation
	- Increased σ_max ≈ 20000.0 (NovelAI research)
	- High-resolution coherence enhancements

	## Training Details

	### Training Configuration
	- Learning Rate: 4e-7
	- Batch Size: 8
	- Gradient Accumulation Steps: 8
	- Epochs: 80
	- Optimizer: AdamW
	- Precision: bfloat16

	## Repository and Resources

	- GitHub Repository: [SDXL-Training-Improvements](https://github.com/DataCTE/SDXL-Training-Improvements)
	- Training Code: Available in the repository
	- Documentation: [Implementation Details](https://github.com/DataCTE/SDXL-Training-Improvements/blob/main/README.md)
	- Issues and Support: [GitHub Issues](https://github.com/DataCTE/SDXL-Training-Improvements/issues)

	## Citation

	```bibtex
	@article{ossa2024improvements,
	title={Improvements to SDXL in NovelAI Diffusion V3},
	author={Ossa, Juan and Doğan, Eren and Birch, Alex and Johnson, F.},
	journal={arXiv preprint arXiv:2409.15997v2},
	year={2024}
	}
	```