gpt4chan_model imageboard_datasets https://www.apache.org/licenses/LICENSE-2.0 Internet Archive Python library 3.0.1 data valentino.giudice96@gmail.com GPT-4chan Model 2022-06-07 01:56:14 2022-06-07 01:56:14 [curator]validator@archive.org[/curator][date]20220607020703[/date][comment]checked for malware[/comment] Yannic Kilcher <div><div>GPT-4chan is a language model fine-tuned from <a href="https://huggingface.co/EleutherAI/gpt-j-6B" rel="nofollow">GPT-J 6B</a> on 3.5 years worth of data from 4chan's politically incorrect (/pol/) board, as included in the datasetĀ <span style="border-style:solid;border-color:rgb(229,231,235);"><a href="https://zenodo.org/record/3606810" rel="nofollow">Raiders of theĀ Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board</a></span>.</div></div> Yannic Kilcher English datasets