andreiujica
/

led-base-big-patent

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

andreiujica commited on May 23

Commit

2684b92

•

1 Parent(s): de03f06

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -60,15 +60,16 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 ## Model description
+This model is a long-form text generation model based on the Longformer Encoder-Decoder (LED) architecture. The LED architecture extends the Transformer model to handle long documents by incorporating sparse attention mechanisms. This makes it suitable for tasks such as summarization of lengthy patent documents, where traditional models might struggle with context length limitations. The model has been fine-tuned on the BigPatent dataset, a large collection of patent documents, to enhance its performance in generating concise and informative summaries.
 ## Intended uses & limitations
+Intended uses
+- Patent summarization: Generate concise summaries of patent documents.
+- Long document summarization: Useful for summarizing other types of long-form documents beyond patents.
+Limitations
+- Context length: Although LED handles long documents better than standard Transformers, extremely lengthy documents might still present challenges.
 ## Training procedure