Wi-zz
/

joy-caption-pre-alpha

Model card Files Files and versions Community

Wi-zz commited on Aug 25

Commit

b3632d5

•

1 Parent(s): 6b99536

Update README.md

Files changed (1) hide show

README.md +51 -53

README.md CHANGED Viewed

@@ -1,54 +1,52 @@
-Here's a concise, structured, and aesthetically formatted markdown description for your GitHub repo README:
-# Image Captioning App
-## Overview
-This application generates descriptive captions for images using advanced ML models. It processes single images or entire directories, leveraging CLIP and LLM models for accurate and contextual captions. It has NSFW captioning support with natural language.
-## Features
-- Single image and batch processing
-- Multiple directory support
-- Custom output directory
-- Adjustable batch size
-- Progress tracking
-## Usage
-| Command | Description |
-|---------|-------------|
-| `python app.py image.jpg` | Process a single image |
-| `python app.py /path/to/directory` | Process all images in a directory |
-| `python app.py /path/to/dir1 /path/to/dir2` | Process multiple directories |
-| `python app.py /path/to/dir --output /path/to/output` | Specify output directory |
-| `python app.py /path/to/dir --bs 8` | Set batch size (default: 4) |
-## Technical Details
-- **Models**: CLIP (vision), LLM (language), custom ImageAdapter
-- **Optimization**: CUDA-enabled GPU support
-- **Error Handling**: Skips problematic images in batch processing
-## Requirements
-- Python 3.x
-- PyTorch
-- Transformers library
-- CUDA-capable GPU (recommended)
-## Installation
-```bash
-git clone https://huggingface.co/Wi-zz/joy-caption-pre-alpha
-cd joy-caption-pre-alpha
-pip install -r requirements.txt
-```
-## Contributing
-Contributions are welcome! Please feel free to submit a Pull Request.
-## License
 This project is licensed under the [MIT License](LICENSE).

+# Image Captioning App
+## Overview
+This application generates descriptive captions for images using advanced ML models. It processes single images or entire directories, leveraging CLIP and LLM models for accurate and contextual captions. It has NSFW captioning support with natural language.
+## Features
+- Single image and batch processing
+- Multiple directory support
+- Custom output directory
+- Adjustable batch size
+- Progress tracking
+## Usage
+| Command | Description |
+|---------|-------------|
+| `python app.py image.jpg` | Process a single image |
+| `python app.py /path/to/directory` | Process all images in a directory |
+| `python app.py /path/to/dir1 /path/to/dir2` | Process multiple directories |
+| `python app.py /path/to/dir --output /path/to/output` | Specify output directory |
+| `python app.py /path/to/dir --bs 8` | Set batch size (default: 4) |
+## Technical Details
+- **Models**: CLIP (vision), LLM (language), custom ImageAdapter
+- **Optimization**: CUDA-enabled GPU support
+- **Error Handling**: Skips problematic images in batch processing
+## Requirements
+- Python 3.x
+- PyTorch
+- Transformers library
+- CUDA-capable GPU (recommended)
+## Installation
+```bash
+git clone https://huggingface.co/Wi-zz/joy-caption-pre-alpha
+cd joy-caption-pre-alpha
+pip install -r requirements.txt
+```
+## Contributing
+Contributions are welcome! Please feel free to submit a Pull Request.
+## License
 This project is licensed under the [MIT License](LICENSE).