Crosspost: Large language models, explained with a minimum of math and jargon
Want to really understand how large language models work? Here’s a gentle primer. Timothy B Lee and Sean Trott, Jul 27, 2023
“One reason for this is the unusual way these systems were developed. Conventional software is created by human programmers who give computers explicit, step-by-step instructions. In contrast, ChatGPT is built on a neural network that was trained using billions of words of ordinary language.
As a result, no one on Earth fully understands the inner workings of LLMs. Researchers are working to gain a better understanding, but this is a slow process that will take years—perhaps decades—to complete.”

