EntityQ95726734· pop 34· linked from 184 articles

GPT-3

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.

Key facts

Software.name: Generative Pre-trained Transformer 3.5 (GPT-3.5)
Software.logo: GPT-3.5 icon.png
Software.logo size: 200px
Software.author: OpenAI
Software.released: May 29, 2020 (publication); June 11, 2020 (OA API beta)
Software.replaces: GPT-3
Software.replaced_by: GPT-4GPT-4o mini
Software.license: Proprietary
Software.latest preview version: gpt-3.5-turbo-0125
Software.repo: N/A
Software.website: N/A

via Wikipedia infobox

Article

13 sections

Contents

Background
Training and capabilities
GPT-3 models
{{anchor|GPT-3.5}} GPT-3.5
Models
{{Anchor|GPT-3.5 with browsing}}GPT-3.5 with browsing
InstructGPT
Reception
Applications
Reviews
Criticism
See also
References

Generative Pre-trained Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020.

Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network, which supersedes recurrence and convolution-based architectures with a technique known as "attention". This attention mechanism allows the model to focus selectively on segments of input text it predicts to be most relevant. GPT-3 has 175 billion parameters, each with 16-bit precision, requiring 350GB of storage since each parameter occupies 2 bytes. It has a context window size of 2,048 tokens, and has demonstrated strong "zero-shot" and "few-shot" learning abilities on many tasks.