metadata

license:
  - apache-2.0
  - bsd-3-clause
tags:
  - summarization
  - summary
  - booksum
  - long-document
  - long-form
  - tglobal-xl
  - XL
  - 8bit
  - quantized
datasets:
  - kmfoda/booksum
metrics:
  - rouge
inference: false
pipeline_tag: summarization

long-t5-tglobal-xl-16384-book-summary: 8-bit quantized version

This is an 8-bit quantized version of the pszemraj/long-t5-tglobal-xl-16384-book-summary model, The model has been compressed using bitsandbytes and can be loaded with low memory usage.

Refer to the original model for all details about the model architecture and training process. For more information on loading 8-bit models, refer to the 4.28.0 release information and the example repository.

The total size of the model is only ~3.5 GB (vs original 12 GB)
Enables low-RAM loading, making it easier to use in memory-limited environments like Colab
Requires bitsandbytes - AFAIK at time of writing, only works on GPU

Basic Usage

To use the model, install or upgrade transformers, accelerate, and bitsandbytes. Make sure to have transformers>=4.28.0 and bitsandbytes>0.37.2.

pip install -U -q transformers bitsandbytes accelerate

Load the model with AutoTokenizer and AutoModelForSeq2SeqLM:

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

model_name = "pszemraj/long-t5-tglobal-xl-16384-book-summary-8bit"
tokenizer = AutoTokenizer.from_pretrained(model_name)

model = AutoModelForSeq2SeqLM.from_pretrained(model_name)

More information about long-t5-tglobal-xl-16384-book-summary

This is an 8-bit quantized version of pszemraj/long-t5-tglobal-xl-16384-book-summary.
- It generalizes reasonably well to academic and narrative text.
- The XL checkpoint typically generates summaries that are considerably better from a human evaluation perspective.