The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,862 other subscribers

Archive for the ‘DeepSeek’ Category

Jürgen Schmidhuber on X: “DeepSeek [1] uses elements of the 2015 reinforcement learning prompt engineer [2] and its 2018 refinement [3] which collapses the RL machine and world model of [2] into a single net through the neural net distillation procedure of 1991 [4]: a distilled chain of thought system. …”

Posted by jpluimers on 2025/02/05

[WaybackSave/Archive] Jürgen Schmidhuber on X: “DeepSeek [1] uses elements of the 2015 reinforcement learning prompt engineer [2] and its 2018 refinement [3] which collapses the RL machine and world model of [2] into a single net through the neural net distillation procedure of 1991 [4]: a distilled chain of thought system. …”

followed by a list of references and this graph:

Read the rest of this entry »

Posted in AI and ML; Artificial Intelligence & Machine Learning, DeepSeek, Development, LLM, Software Development | Leave a Comment »