|
--- |
|
language: en |
|
license: mit |
|
tags: |
|
- vision |
|
- image-to-text |
|
- image-captioning |
|
model_name: microsoft/git-base |
|
pipeline_tag: image-to-text |
|
library_name: transformers |
|
datasets: |
|
- ayoubkirouane/One-Piece-anime-captions |
|
--- |
|
|
|
|
|
# Model Details |
|
+ **Model Name**: Git-base-One-Piece |
|
+ **Base Model**: Microsoft's "git-base" model |
|
+ **Model Type**: Generative Image-to-Text (GIT) |
|
+ **Fine-Tuned** On: 'One-Piece-anime-captions' dataset |
|
+ **Fine-Tuning Purpose**: To generate text captions for images related to the anime series "One Piece." |
|
|
|
|
|
|
|
## Model Description |
|
**Git-base-One-Piece** is a fine-tuned variant of Microsoft's **git-base** model, specifically trained for the task of generating descriptive text captions for images from the **One-Piece-anime-captions** dataset. |
|
|
|
The dataset consists of **856 {image: caption}** pairs, providing a substantial and diverse training corpus for the model. |
|
|
|
The model is conditioned on both CLIP image tokens and text tokens and employs a **teacher forcing** training approach. It predicts the next text token while considering the context provided by the image and previous text tokens. |
|
|
|
|
|
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6338c06c107c4835a05699f9/N_yNK2tLabtwmSYAqpTEp.jpeg) |
|
|
|
## Limitations |
|
+ The quality of generated captions may vary depending on the complexity and diversity of images from the **One-Piece-anime-captions** dataset. |
|
+ The model's output is based on the data it was fine-tuned on, so it may not generalize well to images outside the dataset's domain. |
|
Generating highly detailed or contextually accurate captions may still be a challenge. |
|
|
|
|
|
## Usage |
|
|
|
```python |
|
# Use a pipeline as a high-level helper |
|
from transformers import pipeline |
|
|
|
pipe = pipeline("image-to-text", model="ayoubkirouane/git-base-One-Piece") |
|
``` |
|
|
|
**or** |
|
|
|
```python |
|
# Load model directly |
|
from transformers import AutoProcessor, AutoModelForCausalLM |
|
|
|
processor = AutoProcessor.from_pretrained("ayoubkirouane/git-base-One-Piece") |
|
model = AutoModelForCausalLM.from_pretrained("ayoubkirouane/git-base-One-Piece") |
|
``` |