pere commited on
Commit
74202fe
2 Parent(s): 1a2e9a5 fafeb2f

Merge branch 'main' of https://huggingface.co/pere/norwegian-gpt2-social into main

Browse files
Files changed (1) hide show
  1. README.md +15 -4
README.md CHANGED
@@ -5,12 +5,23 @@ tags:
5
  - norwegian
6
  - GPT2
7
  - casual language modeling
8
- datasets:
9
- - oscar
10
  ---
11
 
12
- # Norwegian GPT-2 - Oscar
13
 
14
  ## Description
 
15
 
16
- This is a sample reference model trained only on the Oscar Corpus for a day on a TPU v3-8. Pretrained model on Norwegian language using a causal language modeling (CLM) objective.
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  - norwegian
6
  - GPT2
7
  - casual language modeling
 
 
8
  ---
9
 
10
+ # Norwegian GPT-2 - Social
11
 
12
  ## Description
13
+ Experimental Norwegian GPT-2-model trained on a 37GB mainly social corpus.
14
 
15
+ The following sub-corpora are used:
16
+ ```bash
17
+ wikipedia_download_nb.jsonl
18
+ wikipedia_download_nn.jsonl
19
+ newspapers_online_nb.jsonl
20
+ newspapers_online_nn.jsonl
21
+ twitter_2016_2018_no.jsonl
22
+ twitter_news_2016_2018_no.jsonl
23
+ open_subtitles_no.jsonl
24
+ facebook_no.jsonl
25
+ reddit_no.jsonl
26
+ vgdebatt_no.jsonl
27
+ ```