The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,860 other subscribers

Archive for March, 2024

Some links to investigate a Raspberry Pi system failing on an SD card that got bad

Posted by jpluimers on 2024/03/22

–jeroen

Read the rest of this entry »

Posted in *nix, *nix-tools, Development, Hardware Development, Power User, Raspberry Pi | Leave a Comment »

Microsoft OAuth Authentication and Thunderbird in 2024 | Thunderbird Help

Posted by jpluimers on 2024/03/22

https://support.mozilla.org/en-US/kb/microsoft-oauth-authentication-and-thunderbird-202

Via

Posted in Uncategorized | Leave a Comment »

On the ignore list: “e/acc” Effective accelerationism – Wikipedia

Posted by jpluimers on 2024/03/22

https://en.wikipedia.org/wiki/Effective_accelerationism

Via

Related:

https://twitter.com/search?q=from%3A%40NecroKuma3%20e%20acc&src=typed_query

Posted in Uncategorized | Leave a Comment »

Autumn 2023 research: How Is ChatGPT’s Behavior Changing over Time?

Posted by jpluimers on 2024/03/21

[Wayback/Archive] https://arxiv.org/pdf/2307.09009.pdf ([Google Docs PDF view: Wayback] Google Docs PDF view: 2307.09009.pdf) is interesting. The abstract confirms my thought: over time LLM drift over time and seem to become worse at knowledge tasks.

How Is ChatGPT’s Behavior Changing over Time?

Lingjiao Chen†, Matei Zaharia‡, James Zou†
†Stanford University ‡UC Berkeley

Abstract

GPT-3.5 and GPT-4 are the two most widely used large language model (LLM) services.
However, when and how these models are updated over time is opaque. Here, we evaluate the March 2023 and June 2023 versions of GPT-3.5 and GPT-4 on several diverse tasks: 1) math problems, 2) sensitive/dangerous questions, 3) opinion surveys, 4) multi-hop knowledge-intensive questions, 5) generating code, 6) US Medical License tests, and 7) visual reasoning. We find that the performance and behavior of both GPT-3.5 and GPT-4 can vary greatly over time. For example, GPT-4 (March 2023) was reasonable at identifying prime vs. composite numbers (84% accuracy) but GPT-4 (June 2023) was poor on these same questions (51% accuracy). This is partly explained by a drop in GPT-4’s amenity to follow chain-of-thought prompting. Interestingly, GPT-3.5 was much better in June than in March in this task. GPT-4 became less willing to answer sensitive questions and opinion survey questions in June than in March. GPT-4 performed better at multi-hop questions in June than in March, while GPT-3.5’s performance dropped on this task. Both GPT-4 and GPT-3.5 had more formatting mistakes in code generation in June than in March. We provide evidence that GPT-4’s ability to follow user instructions has decreased over time, which is one common factor behind the many behavior drifts. Overall, our findings show that the behavior of the “same” LLM service can change substantially in a relatively short amount of time, highlighting the need for continuous monitoring of LLMs.

Later on, Eric Topol had the very interesting conversation with James Zou below which covers many AI aspects including a lot of LLM ones. Basic takeaways for me are that they are good at repeating things from their training data, making them OK on generating text, sort of OK for grammar, but far form OK from reproducing knowledge, and that it will become harder over time to distinguish LLM generated content from human created content.

The video of the conversation is below the blog signature; here is the link: [Wayback/Archive] James Zou: one of the most prolific and creative A.I. researchers in both life science and medicine – YouTube

Almost all LLMs are being trained on a corpus without curation (curation is way too expensive), resulting in them at best averaging the corpus (as in the foundation, LLM is just a “monkey see, monkey do” on steroids but without the means of self-curating to result in above average generation. I think that given more and more on-line content is being and becoming generated by LLM, and newer LLM will be trained based on the corpus encompassing that content (without the means to filter out LLM generated content), over time LLM will perform worse instead of better.

Via he below series of interesting tweets of which were quoted by a slightly less pessimistic Erik Meijer [Wayback/Archive] Erik Meijer on X: “Regression to the mean.. Nnote some interesting replies as well. I found the one mentioning Eternal September especially fitting. It made me discover [Wayback/Archive] www.eternal-september.org

Today is September 11160, 1993, the september that never ends
No pr0n, no warez, just Usenet

Anyway, the tweets:

Read the rest of this entry »

Posted in AI and ML; Artificial Intelligence & Machine Learning, Awareness, ChatGPT, Development, GPT-3, GPT-4, LLM, Software Development | Leave a Comment »

My Ultimate PowerShell prompt with Oh My Posh and the Windows Terminal – Scott Hanselman’s Blog

Posted by jpluimers on 2024/03/21

Via [Archive.is] Kevin on Twitter: “Gotta say this looks amazing and I actually didn’t know you can customize the command line on Windows this far. Read this blogpost by @shanselman , highly recommended. 👇 “

For my link archive: [Wayback] My Ultimate PowerShell prompt with Oh My Posh and the Windows Terminal – Scott Hanselman’s Blog

Read the rest of this entry »

Posted in CommandLine, Development, Power User, PowerShell, PowerShell, Scripting, Software Development, Windows, Windows 10, Windows Development | Leave a Comment »

GPS jamming & interference map | Flightradar24

Posted by jpluimers on 2024/03/21

Not so relevant in our area yet, but all the more relevant in some areas: [Wayback/Archive] GPS jamming & interference map | Flightradar24

Via:

--jeroen

Posted in Awareness | Leave a Comment »

ApplesSoft BASIC code which includes assembly language: Twitter bot AppleIIBot could run it!

Posted by jpluimers on 2024/03/20

This was a trip down memory lane where I was totally unaware that you could embed 6502 assembly language inside AppleSoft BASIC code.

It turns you can, and even better: the Twitter bot named AppleIIBot could execute it too!

Though I bumped into AppleIIBot during winter 2021, I published the BBC equivalent last week (see BBC trip down memory lane – 8bitkick/BBCMicroBot: Runs your tweet on an 8-bit computer emulator) as that one got released earlier.

For the moment it is down because Elon blew up Twitter and shut down on 2022-11-05, but hopefully – like the BBC equivalent – it will resurface on a Mastodon instance somewhere in the future.

Luckily all old Tweets with code and rendering are still there, though you need a Twitter account to view them: Elon broke the feature of anonymous access seeing all messages in a thread.

Below the signature are the full Tweets that led me into it; the texts are these:

Read the rest of this entry »

Posted in //e, 6502, Apple, Apple ][, BASIC, Development, History, SocialMedia, Software Development, Twitter | Leave a Comment »

Publish your results through “wa11y.co: Wordle Accessibility” to allow visual impaired much easier and pleasant access to your results #a11y

Posted by jpluimers on 2024/03/19

I wish Wordle would be more accessible, but alas after all this time it still isn’t.

Luckily there is [Wayback/Archive] wa11y.co: Wordle Accessibility

At the end of your Wordle game, click “Share” to copy your result then paste it below to generate descriptive text.

It is open source on GitHub at [Wayback/Archive] cariad/wa11y.co: Makes Wordle results accessible. (most of it is written in JavaScript)

I first bumped into it via:

Read the rest of this entry »

Posted in accessibility (a11y), Development, JavaScript/ECMAScript, Scripting, Software Development, Web Development | Leave a Comment »

Stop met makelaarstaal welke nep-deftig lijkt maar het niet is: gebruik die/dat en pas op met andere ouderwetse of formele woorden

Posted by jpluimers on 2024/03/18

Tegenwoordig kom je nog steeds taalgebruik tegen waarvan de huiveringen over je rug lopen.

Nu is taal voor mij als woordblinde autist sowieso altijd ingewikkeld, maar ik heb wel een aardig gevoel over het soort taal dat ik niet wil gebruiken.

Twee jaar terug kwam ik via Twitter deze twee onderwerpen tegen:

Read the rest of this entry »

Posted in LifeHacker, Natural Languages, Power User | Leave a Comment »

Communication can help patients but also harm them when they are confronted with a serious illness (via Twitter: Liesbeth van Vliet)

Posted by jpluimers on 2024/03/15

I reacted to a Dutch [Wayback/Archive] Thread by @NL_Wetenschap on Thread Reader App (which has an excellent Google translation [Wayback/Archive]).

The responses mentioned these two interesting English papers and a nice YouTube video:

I have first hand experience from my cancer treatment period where despite having a companion and emphatic care takers, I did not comprehend large parts of the information.

Two tips:

  1. Ensure you and your caretakers are on the same “wavelength” communicating. If you don’t: politely ask for another caretaker in the team.
  2. Always bring a companion with you that knows your case very well.

The rest of this post is in Dutch containing the original tweets:

Read the rest of this entry »

Posted in Awareness, Health, LifeHacker, Power User | Leave a Comment »