Crosspost: Large language models, explained with a minimum of math and jargon

Want to really understand how large language models work? Here’s a gentle primer. Timothy B Lee and Sean Trott, Jul 27, 2023

Aug 20, 2023

“One reason for this is the unusual way these systems were developed. Conventional software is created by human programmers who give computers explicit, step-by-step instructions. In contrast, ChatGPT is built on a neural network that was trained using billions of words of ordinary language.

As a result, no one on Earth fully understands the inner workings of LLMs. Researchers are working to gain a better understanding, but this is a slow process that will take years—perhaps decades—to complete.”

Understanding AI

Large language models, explained with a minimum of math and jargon

Hi, it’s Tim Lee. I’m a journalist with a master’s degree in computer science. This post is the result of two months of in-depth research. If you find it helpful, please subscribe to get future articles delivered straight to your inbox. Today’s post is co-authored with Sean Trott, a cognitive scientist at the University of California, San Diego. If you a…

2 years ago · 421 likes · 46 comments · Timothy B Lee and Sean Trott

Ephektikoi - Guerrilla Epistemologist

Discussion about this post

Ready for more?