gpt-2/README.md

# gpt-2

Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).

For now, we have only released a smaller (117M parameter) version of GPT-2.

See more details in our [blog post](https://blog.openai.com/better-language-models/).

## Installation

Download the model data
```
sh download_model.sh 117M
```

The remaining steps can optionally be done in a virtual environment using tools such as `virtualenv` or `conda`.

Install tensorflow 1.12 (with GPU support, if you have a GPU and want everything to run faster)
```
pip3 install tensorflow==1.12.0
```
or
```
pip3 install tensorflow-gpu==1.12.0
```

Install other python packages:
```
pip3 install -r requirements.txt
```

## Usage

### Unconditional sample generation

| WARNING: Samples are unfiltered and may contain offensive content. |
| --- |

To generate unconditional samples from the small model:
```
python3 src/generate_unconditional_samples.py | tee samples
```
There are various flags for controlling the samples:
```
python3 src/generate_unconditional_samples.py --top_k 40 --temperature 0.7 | tee samples
```

### Conditional sample generation

To give the model custom prompts, you can use:
```
python3 src/interactive_conditional_samples.py --top_k 40
```

## GPT-2 samples

While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.

## Future work

We may release code for evaluating the models on various benchmarks.

We are still considering release of the larger models.
First commit 2019-02-10 20:22:00 -08:00			`# gpt-2`

README updates 2019-02-14 08:43:50 -08:00			`Code and samples from the paper ["Language Models are Unsupervised Multitask Learners"](https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf).`

			`For now, we have only released a smaller (117M parameter) version of GPT-2.`

			`See more details in our [blog post](https://blog.openai.com/better-language-models/).`
First commit 2019-02-10 20:22:00 -08:00
			`## Installation`

Fetch model using curl, add shebang to download_files.sh and mark it executable 2019-02-16 10:32:30 -05:00			`Download the model data`
First commit 2019-02-10 20:22:00 -08:00			```
fix downloading 2019-02-14 09:12:05 -08:00			`sh download_model.sh 117M`
First commit 2019-02-10 20:22:00 -08:00			```

separate out tensorflow install 2019-02-19 17:48:19 -08:00			The remaining steps can optionally be done in a virtual environment using tools such as `virtualenv` or `conda`.

			`Install tensorflow 1.12 (with GPU support, if you have a GPU and want everything to run faster)`
			```
			`pip3 install tensorflow==1.12.0`
			```
			`or`
			```
			`pip3 install tensorflow-gpu==1.12.0`
			```

			`Install other python packages:`
First commit 2019-02-10 20:22:00 -08:00			```
fix downloading 2019-02-14 09:12:05 -08:00			`pip3 install -r requirements.txt`
First commit 2019-02-10 20:22:00 -08:00			```

shuffle headings 2019-02-19 17:57:01 -08:00			`## Usage`

			`### Unconditional sample generation`
First commit 2019-02-10 20:22:00 -08:00
			`\| WARNING: Samples are unfiltered and may contain offensive content. \|`
			`\| --- \|`

			`To generate unconditional samples from the small model:`
			```
interact script for conditional samples 2019-02-14 09:55:36 -08:00			`python3 src/generate_unconditional_samples.py \| tee samples`
First commit 2019-02-10 20:22:00 -08:00			```
			`There are various flags for controlling the samples:`
			```
interact script for conditional samples 2019-02-14 09:55:36 -08:00			`python3 src/generate_unconditional_samples.py --top_k 40 --temperature 0.7 \| tee samples`
First commit 2019-02-10 20:22:00 -08:00			```
add samples 2019-02-14 00:17:55 -08:00
shuffle headings 2019-02-19 17:57:01 -08:00			`### Conditional sample generation`
interact script for conditional samples 2019-02-14 09:55:36 -08:00
			`To give the model custom prompts, you can use:`
			```
Better example parameters for conditional sample command (#41) This PR adds better initial parameters to the conditional sample generation command in the docs. The results are pretty poor in the interactive script with the default settings. Now, you'll get better results if you run the interactive samples. 2019-02-16 16:23:13 -06:00			`python3 src/interactive_conditional_samples.py --top_k 40`
interact script for conditional samples 2019-02-14 09:55:36 -08:00			```
add samples 2019-02-14 00:17:55 -08:00
reorganize and add temp 0.7 2019-02-19 00:43:31 -08:00			`## GPT-2 samples`

			While we have not yet released GPT-2 itself, you can see some samples from it in the `gpt-2-samples` folder.
			`We show unconditional samples with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.`
add conditional samples 2019-02-19 17:21:46 -08:00			We show conditional samples, with contexts drawn from `WebText`'s test set, with default settings (temperature 1 and no truncation), with temperature 0.7, and with truncation with top_k 40.
reorganize and add temp 0.7 2019-02-19 00:43:31 -08:00
add samples 2019-02-14 00:17:55 -08:00			`## Future work`

			`We may release code for evaluating the models on various benchmarks.`

			`We are still considering release of the larger models.`