The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,860 other subscribers

Jürgen Schmidhuber on X: “DeepSeek [1] uses elements of the 2015 reinforcement learning prompt engineer [2] and its 2018 refinement [3] which collapses the RL machine and world model of [2] into a single net through the neural net distillation procedure of 1991 [4]: a distilled chain of thought system. …”

Posted by jpluimers on 2025/02/05

[WaybackSave/Archive] Jürgen Schmidhuber on X: “DeepSeek [1] uses elements of the 2015 reinforcement learning prompt engineer [2] and its 2018 refinement [3] which collapses the RL machine and world model of [2] into a single net through the neural net distillation procedure of 1991 [4]: a distilled chain of thought system. …”

followed by a list of references and this graph:

[Wayback/Archive] Gioh8G8X0AAOdx8.jpg:orig (2048×2048)

[Wayback/Archive] Tweet JSON

--jeroen

Leave a comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.