nicholasKluge
/

TeenyTinyLlama-160m-IMDB

@@ -16,11 +16,11 @@ widget:
 - text: "Esqueceram de mim 2 é o pior filme da franquia inteira."
   example_title: Exemplo
 ---
-# TeenyTinyLlama-162m-IMDB
 TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
-This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) (`TeenyTinyLlama-162m-IMDB`) fine-tuned on the the [IMDB dataset](https://huggingface.co/datasets/christykoh/imdb_pt).
 ## Details
@@ -38,7 +38,7 @@ from transformers import pipeline
 text = "Esqueceram de mim 2 é um dos melhores filmes de natal de todos os tempos."
-classifier = pipeline("text-classification", model="nicholasKluge/TeenyTinyLlama-162m-IMDB")
 classifier(text)
 # >>> [{'label': 'POSITIVE', 'score': 0.9971244931221008}]
@@ -63,13 +63,13 @@ dataset = load_dataset("christykoh/imdb_pt")
 # Create a `ModelForSequenceClassification`
 model = AutoModelForSequenceClassification.from_pretrained(
-    "nicholasKluge/TeenyTinyLlama-162m",
     num_labels=2,
     id2label={0: "NEGATIVE", 1: "POSITIVE"},
     label2id={"NEGATIVE": 0, "POSITIVE": 1}
 )
-tokenizer = AutoTokenizer.from_pretrained("nicholasKluge/TeenyTinyLlama-162m")
 # Preprocess the dataset
 def preprocess_function(examples):
@@ -124,7 +124,7 @@ trainer.train()
 | Models                                                                                     | [IMDB](https://huggingface.co/datasets/christykoh/imdb_pt) |
 |--------------------------------------------------------------------------------------------|------------------------------------------------------------|
-| [Teeny Tiny Llama 162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m)          | 91.14                                                      |
 | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 92.22                                                      |
 | [Bert-large-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased)| 93.58                                                      |
 | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese)        | 91.60                                                      |
@@ -135,7 +135,7 @@ trainer.train()
 @misc{nicholas22llama,
   doi = {10.5281/zenodo.6989727},
-  url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m},
   author = {Nicholas Kluge Corrêa},
   title = {TeenyTinyLlama},
   year = {2023},
@@ -151,4 +151,4 @@ This repository was built as part of the RAIES ([Rede de Inteligência Artificia
 ## License
-TeenyTinyLlama-162m-IMDB is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.

 - text: "Esqueceram de mim 2 é o pior filme da franquia inteira."
   example_title: Exemplo
 ---
+# TeenyTinyLlama-160m-IMDB
 TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
+This repository contains a version of [TeenyTinyLlama-160m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m) (`TeenyTinyLlama-160m-IMDB`) fine-tuned on the the [IMDB dataset](https://huggingface.co/datasets/christykoh/imdb_pt).
 ## Details
 text = "Esqueceram de mim 2 é um dos melhores filmes de natal de todos os tempos."
+classifier = pipeline("text-classification", model="nicholasKluge/TeenyTinyLlama-160m-IMDB")
 classifier(text)
 # >>> [{'label': 'POSITIVE', 'score': 0.9971244931221008}]
 # Create a `ModelForSequenceClassification`
 model = AutoModelForSequenceClassification.from_pretrained(
+    "nicholasKluge/TeenyTinyLlama-160m",
     num_labels=2,
     id2label={0: "NEGATIVE", 1: "POSITIVE"},
     label2id={"NEGATIVE": 0, "POSITIVE": 1}
 )
+tokenizer = AutoTokenizer.from_pretrained("nicholasKluge/TeenyTinyLlama-160m")
 # Preprocess the dataset
 def preprocess_function(examples):
 | Models                                                                                     | [IMDB](https://huggingface.co/datasets/christykoh/imdb_pt) |
 |--------------------------------------------------------------------------------------------|------------------------------------------------------------|
+| [Teeny Tiny Llama 160m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m)          | 91.14                                                      |
 | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 92.22                                                      |
 | [Bert-large-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased)| 93.58                                                      |
 | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese)        | 91.60                                                      |
 @misc{nicholas22llama,
   doi = {10.5281/zenodo.6989727},
+  url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m},
   author = {Nicholas Kluge Corrêa},
   title = {TeenyTinyLlama},
   year = {2023},
 ## License
+TeenyTinyLlama-160m-IMDB is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.