1

Overview of Large Language Models

In 2017, a team at Google Brain introduced an advanced artificial intelligence (AI) deep learning model called the Transformer. Since then, the Transformer has become the standard for tackling various natural language processing (NLP) tasks in academia and industry. It is likely that you have interacted with the Transformer model in recent years without even realizing it, as Google uses BERT to enhance its search engine by better understanding users’ search queries. The GPT family of models from OpenAI have also received attention for their ability to generate human-like text and images.

These Transformers now power applications such as GitHub’s Copilot (developed by OpenAI in collaboration with Microsoft), ...

Get Quick Start Guide to Large Language Models: Strategies and Best Practices for Using ChatGPT and Other LLMs now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.