The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,860 other subscribers

Thread by Cameron R. Wolfe on Twitter about why GPT-3 is better than larger language models

Posted by jpluimers on 2025/09/04

For my link archive the [Wayback/Archive] Thread by @cwolferesearch on Thread Reader App starting with [Wayback/Archive] Cameron R. Wolfe on Twitter: “After GPT-3 was proposed, a lot of research was done to find an even better language model. Initial attempts focused on just training larger models. Contrary to popular belief, however, there is more to creating a good language model than size… 🧵[1/8]” / Twitter

3 years later, I’m anxious to know what the current state of the art on GPT is, as between GPT-2 and GPT-3 there was about a 3 year period.

–jeroen

Leave a comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.