2018-06-14 14:29:53 +02:00
2018-06-13 16:07:58 +02:00
2018-06-14 12:10:05 +02:00
2018-06-13 16:07:58 +02:00
2018-06-13 16:07:58 +02:00
2018-06-14 13:19:07 +02:00
2018-06-14 03:50:41 +02:00

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Currently this code implements the ROCStories Cloze Test result reported in the paper by running: python train.py --dataset rocstories --desc rocstories --submit --analysis --data_dir [path to data here]

Note: The code is currently non-deterministic due to various GPU ops. The median accuracy of 10 runs with this codebase (using default hyperparameters) is 85.8% - slightly lower than the reported single run of 86.5% from the paper.

The ROCStories dataset can be downloaded from the associated website.

S
Description
Generating text from Gutenberg books and OpenAI's finetuned transformer language model
Readme MIT 361 KiB
Languages
Jupyter Notebook 88.7%
Python 11.3%