Category
page 12018 in artificial intelligence
generative pre-trained transformer
type of large language model
bidirectional encoder representations from transformers
deep learning artificial neural network language model
GPT-1
right|thumb|Original GPT architecture
Generative Pre-trained Transformer 1 (GPT-1) was the first of OpenAI's large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a paper entitled "Improving Language Understanding by Generative Pre-Training", in which they introduced that initial model along with the general concept of a generative pre-trained transformer.