Dataset

Collection of William Shakespeare plays

  • tiktoken - gpt2 tokenizer is used for tokenization
  • Number of total tokens - 338025

The HuggingFace Spaces Gradio App

The app is available here

The App takes following as input

  1. Seed Text (Prompt) - This is provided as input text to the GPT model, based on which it generates further contents. If no data is provided, the only a space (" ") is provided as input
  2. Max tokens to generate - This controls the numbers of tokens it will generate. The default value is 100.
  3. Temperature - This accepts values between 0 to 1. Higher value introduces more randomness in the next token generation. Default value is set to 0.7.
  4. Select Top N in each step - This is an optional field. If no value is provided (or <= 0), all available tokens are considered for the next token prediction based on SoftMax probability. However, if a number is set then only that many top tokes will be considered for the next token prediction.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Space using sayanbanerjee32/nanogpt2_test 1