ed-sheeran / README.md
AlekseyKorshuk's picture
huggingartists
af98d26
|
raw
history blame
3.78 kB
metadata
language: en
datasets:
  - huggingartists/ed-sheeran
tags:
  - huggingartists
  - lyrics
  - lm-head
  - causal-lm
widget:
  - text: I am
🤖 HuggingArtists Model 🤖
Ed Sheeran
@ed-sheeran

I was made with huggingartists.

Create your own bot based on your favorite artist with the demo!

How does it work?

To understand how the model was developed, check the W&B report.

Training data

The model was trained on lyrics from Ed Sheeran.

Dataset is available here. And can be used with:

from datasets import load_dataset

dataset = load_dataset("huggingartists/ed-sheeran")

Explore the data, which is tracked with W&B artifacts at every step of the pipeline.

Training procedure

The model is based on a pre-trained GPT-2 which is fine-tuned on Ed Sheeran's lyrics.

Hyperparameters and metrics are recorded in the W&B training run for full transparency and reproducibility.

At the end of training, the final model is logged and versioned.

How to use

You can use this model directly with a pipeline for text generation:

from transformers import pipeline
generator = pipeline('text-generation',
                     model='huggingartists/ed-sheeran')
generator("I am", num_return_sequences=5)

Or with Transformers library:

from transformers import AutoTokenizer, AutoModelWithLMHead
  
tokenizer = AutoTokenizer.from_pretrained("huggingartists/ed-sheeran")

model = AutoModelWithLMHead.from_pretrained("huggingartists/ed-sheeran")

Limitations and bias

The model suffers from the same limitations and bias as GPT-2.

In addition, the data present in the user's tweets further affects the text generated by the model.

About

Built by Aleksey Korshuk

Follow

Follow

Follow

For more details, visit the project repository.

GitHub stars