pszemraj
/

long-t5-tglobal-xl-16384-book-summary-8bit

@@ -1,24 +1,27 @@
 ---
 license:
-  - apache-2.0
-  - bsd-3-clause
 tags:
-  - summarization
-  - summary
-  - booksum
-  - long-document
-  - long-form
-  - tglobal-xl
-  - XL
 datasets:
-  - kmfoda/booksum
 metrics:
-  - rouge
 inference: false
 ---
-# long-t5-tglobal-xl-16384-book-summary: the 8-bit quantized version
 <a href="https://colab.research.google.com/gist/pszemraj/c19e32baf876deb866c31cd46c86e893/long-t5-xl-accelerate-test.ipynb">
   <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
@@ -28,9 +31,9 @@ This is an 8-bit quantized version of the `pszemraj/long-t5-tglobal-xl-16384-boo
 Refer to the [original model](https://huggingface.co/pszemraj/long-t5-tglobal-xl-16384-book-summary) for all details about the model architecture and training process. For more information on loading 8-bit models, refer to the `4.28.0` [release information](https://github.com/huggingface/transformers/releases/tag/v4.28.0) and the [example repository](https://huggingface.co/ybelkada/bloom-1b7-8bit).
-- The total size of the model is only ~3.5 GB, much smaller than the original size.
-- This allows for low-RAM loading, making it easier to use in memory-limited environments.
-- `bitsandbytes` - AFAIK at time of writing - only works on GPU
 ## Basic Usage
@@ -56,4 +59,4 @@ model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
 - This is an 8-bit quantized version of `pszemraj/long-t5-tglobal-xl-16384-book-summary`.
   - It generalizes reasonably well to academic and narrative text.
-  - The XL checkpoint typically generates summaries that are considerably better from a human evaluation perspective.

 ---
 license:
+- apache-2.0
+- bsd-3-clause
 tags:
+- summarization
+- summary
+- booksum
+- long-document
+- long-form
+- tglobal-xl
+- XL
+- 8bit
+- quantized
 datasets:
+- kmfoda/booksum
 metrics:
+- rouge
 inference: false
+pipeline_tag: summarization
 ---
+# long-t5-tglobal-xl-16384-book-summary: 8-bit quantized version
 <a href="https://colab.research.google.com/gist/pszemraj/c19e32baf876deb866c31cd46c86e893/long-t5-xl-accelerate-test.ipynb">
   <img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/>
 Refer to the [original model](https://huggingface.co/pszemraj/long-t5-tglobal-xl-16384-book-summary) for all details about the model architecture and training process. For more information on loading 8-bit models, refer to the `4.28.0` [release information](https://github.com/huggingface/transformers/releases/tag/v4.28.0) and the [example repository](https://huggingface.co/ybelkada/bloom-1b7-8bit).
+- The total size of the model is only ~3.5 GB (vs original 12 GB)
+- Enables low-RAM loading, making it easier to use in memory-limited environments like Colab
+- Requires `bitsandbytes` - AFAIK at time of writing, only works on GPU
 ## Basic Usage
 - This is an 8-bit quantized version of `pszemraj/long-t5-tglobal-xl-16384-book-summary`.
   - It generalizes reasonably well to academic and narrative text.
+  - The XL checkpoint typically generates summaries that are considerably better from a human evaluation perspective.