April 2026
M	T	W	T	F	S	S
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Archive for the ‘Development’ Category

C# Effective way to find any file’s Encoding – Stack Overflow

Posted by jpluimers on 2022/02/09

Note: notepad cannot correctly guess the encoding, see the “old new thing”: [Wayback] Some files come up strange in Notepad | The Old New Thing (talking about ANSI a.k.a. Windows-1252, UTF-16LE, UTF-16BE, UTF-8, UTF-7 somewith and some without BOM as Notepad does not understand all permutations)

David Cumps discovered that certain text files come up strange in Notepad. The reason is that Notepad has to edit files in a variety of encodings, and when its back against the wall, sometimes it’s forced to guess.

[Wayback] C# Effective way to find any file’s Encoding – Stack Overflow shows how to detect various byte order marks in C#.

–jeroen

Posted in ASCII, Development, Encoding, Software Development, Unicode, UTF-16, UTF-32, UTF-8, UTF16, UTF32, UTF8 | Leave a Comment »

UTF-8 web adoption is huge, closing 100%, but only soured up since around 2006.

Posted by jpluimers on 2022/02/08

As a precursor to a post tomorrow showing that serving UTF8 does not mean organisations go without unicode problems, first some statistics.

The first Unicode ideas got drafted some 30 years ago in 1987. In 1991, more than 30 years ago, the Unicode Consortium saw the light. Nowadays more than 95% percent of the web-pages (close to 100% when you include plain ASCII) is served using the UTF-8 encoding.

It means that nowadays there is a very small chance you

will see mangled characters (what Japanese call mojibake) when you’re surfing the web.

Some nice graphs of unicode growth are at these locations are at these locations:

Popularity of text encodings – Wikipedia
[Wayback] W3C: Who uses Unicode?
[Archive.is] Web Technologies Statistics and Trends: W3Techs shows statistics and trends in the usage statistics of web technologies
2008: [Wayback] utf-8 Growth On The Web | W3C Blog
2012: [Wayback] Official Google Blog: Unicode over 60 percent of the web
2012: Archive.is Usage Statistics of Character Encodings for Websites, May 2012
2015: [Wayback] UTF-8 Unicode vs. other encodings over time | Pinyin News
2020: Archive.is Usage Statistics and Market Share of Character Encodings for Websites, August 2020
2010-2021: [Archive.is] Historical yearly trends in the usage statistics of character encodings for websites, June 2021: from 50% UTF-8 in 2010, to almost 97% mid 2021 (where the second place ISO-8859-1 at just 1.3%, so leaving less than 1.5% for all other encodings, see [Archive.is] Usage Statistics and Market Share of Character Encodings for Websites, June 2021)

I think especially important are 2008 (when UTF-8 had outgrown all other individual encodings) and slightly after 2010, when UTF-8 alone covered more than 50% of the pages served. These exclude ASCII-only pages. Adding those would make the figures even larger.

Historical yearly trends in the usage statistics of character encodings for websites, June 2021

–jeroen

Posted in Development, Encoding, Software Development, UTF-8, UTF8, Web Development | Leave a Comment »

ThinkPad T440p Touchpad Swap: Installing Correct Drivers – YouTube

Posted by jpluimers on 2022/02/08

[Wayback/Archive.is] ThinkPad T440p Touchpad Swap: Installing Correct Drivers – YouTube

One of the most common upgrades for any Haswell (xx40) series ThinkPad is to replace the awful button-less touchpad (sometimes referred to as the ClunkPad) with a T450 touchpad that has the proper buttons for TrackPoint users. However, getting the buttons to work properly on xx40 hardware can be tricky – particularly if you are running Windows 10. In this video you will see how to get drivers installed that will allow you to use the TrackPoint as if this were a T450! As always thanks for watching! Some of the guides I used for this video:

I have edited the link so they show forum post titles, added way-back links, and added some crucial information:

[Wayback] T440/p CLUNKPAD: Pics and installation of T450 Synaptics and Alps Touchpad+buttons Driver – Thinkpads Forum

PLEASE NOTE:
BEFORE YOU PUT IN THE NEW TOUCHPAD, INSTALL SOME EXTRA BUMPERS/RUBBERS ALONG THE TOP OF THE LID, LEFT AND RIGHT OF THE WEBCAM.
THESE SHOULD BE ABOUT TWICE AS THICK/HIGH AS THE SMALL BUMPERS THAT ARE THERE ALREADY.
Recommended size ~WxH: ~6 x 2 mm or ~1/4 x 5/64 inch.
PLACE THEM IMMEDIATELY LEFT AND RIGHT OF THE EXISTING BUMPERS.
FAILURE TO DO SO WILL RESULT IN SCRATCH MARKS ON THE SCREEN FROM THE NEW BUTTONS!
- Synaptics T450 driver
  - Windows 7/8/8.1 that also works in Windows 10: [Wayback] download.lenovo.com/pccbbs/mobiles/jbgc28ww.exe
- Lenovo Yoga 14 and T450 ALPS driver
  - Windows 7/8/8.1 that also works in Windows 10: [Wayback] download.lenovo.com/ibmdl/pub/pc/pccbbs/mobiles/jfgh09ww.exe
- [Wayback] Stop W10 from automatically updating SPECIFIC drivers. – Thinkpads Forum
  - [Wayback] How to Prevent Windows from Automatically Updating Specific Drivers (using the Group Policy Editor to block updates for specific device IDs)
[Wayback] X240/s: How-to replace the Clunkpad with an X250 Touchpad+buttons – Thinkpads Forum
- Windows 7/8/8.1: [Wayback] download.lenovo.com/pccbbs/mobiles/n10gw27w.exe with readme in [Wayback] download.lenovo.com/pccbbs/mobiles/n10gw27w.txt
- Windows 10: [Wayback] download.lenovo.com/pccbbs/mobiles/n1cgr22w.exe with readme in [Wayback] download.lenovo.com/pccbbs/mobiles/n1cgr22w.txt
[Wayback/Archive.is] Lenovo ThinkPad T440p: T450 trackpad + FHD IPS display – YouTube: Touchpad swap + screen replacement
[Wayback/Archive.is] Lenovo ThinkPad T440p: CPU and storage upgrade – YouTube
[Wayback/Archive.is] Diagnosing and repairing laptop display flickering (Featuring the Lenovo ThinkPad T440p) – YouTube: Replacing a damaged display cable

A few videos I’ve made about the T440p:

Touchpad swap + screen replacement – https://web.archive.org/web/20210605095911/https://www.youtube.com/watch?v=vOXz-…

CPU + storage upgrades – https://web.archive.org/web/20210605095911/https://www.youtube.com/watch?v=YetQc…

Replacing a damaged display cable – https://web.archive.org/web/20210605095911/https://www.youtube.com/watch?v=99ZoU…

Via [Archive.is] ThinkPads Old and New | Facebook

–jeroen

Posted in Development, Hardware Development, Power User, ThinkPad | Leave a Comment »

Chrome debugging tip: disabling framework/library code (from Minko Gechev on Twitter)

Posted by jpluimers on 2022/02/03

Cool tip: [Archive.is] Minko Gechev on Twitter: “Tooling tip: When debugging, you can prevent stepping into framework/library code by using blackboxing. In @ChromeDevTools: ‣ Open the script you don’t want to enter ‣ Right click → Blackbox ‣ Pain free debugging ✨… “

–jeroen

Read the rest of this entry »

Posted in Development, JavaScript/ECMAScript, Scripting, Software Development, TypeScript | Leave a Comment »

RegEx character classes in “Searching | Notepad++ User Manual”

Posted by jpluimers on 2022/02/03

I needed to search for IBAN numbers in documents and used this regular expression: [a-zA-Z]{2}[0-9]{2} ?[a-zA-Z0-9]{4} ?[0-9]{4} ?[0-9]{4} ?[0-9]{2} which supports the usual optional whitespace like in NL12 INGB 0345 6789 01.

It is based on a nice list with table of Notepad++ RegEx character classes supported at [Wayback] Searching | Notepad++ User Manual:

Character Classes

[set] ⇒ This indicates a set of characters, for example, [abc] means any of the literal characters a, b or c. You can also use ranges by doing a hyphen between characters, for example [a-z] for any character from a to z. You can use a collating sequence in character ranges, like in [[.ch.]-[.ll.]] (these are collating sequence in Spanish).

[^set] ⇒ The complement of the characters in the set. For example, [^A-Za-z] means any character except an alphabetic character. Care should be taken with a complement list, as regular expressions are always multi-line, and hence [^ABC]* will match until the first A, B or C (or a, b or c if match case is off), including any newline characters. To confine the search to a single line, include the newline characters in the exception list, e.g. [^ABC\r\n].

Please note that the complement of a character set is often many more characters than you expect: (?-s)[^x]+ will match 1 or more instances of any non-x character, including newlines: the (?-s) search modifier turns off “dot matches newlines”, but the [^x] is not a dot ., so that class is still allowed to match newlines.

[[:name:]] or [[:☒:]] ⇒ The whole character class named name. For many, there is also a single-letter “short” class name, ☒. Please note: the [:name:] and [:☒:] must be inside a character class [...] to have their special meaning.

short full name description equivalent character class

alnum letters and digits

alpha letters

h blank spacing which is not a line terminator [\t\x20\xA0]

cntrl control characters [\x00-\x1F\x7F\x81\x8D\x8F\x90\x9D]

d digit digits

graph graphical character, so essentially any character except for control chars, \0x7F, \x80

l lower lowercase letters

print printable characters [\s[:graph:]]

punct punctuation characters [!"#$%&'()*+,\-./:;<=>?@\[\\\]^_{

s space whitespace (word or line separator) [\t\n\x0B\f\r\x20\x85\xA0\x{2028}\x{2029}]

u upper uppercase letters

unicode any character with code point above 255 [\x{0100}-\x{FFFF}]

w word word characters [_\d\l\u]

xdigit hexadecimal digits [0-9A-Fa-f]

Note that letters include any unicode letters (ASCII letters, accented letters, and letters from a variety of other writing systems); digits include ASCII numeric digits, and anything else in Unicode that’s classified as a digit (like superscript numbers ¹²³…).

Note that those character class names may be written in upper or lower case without changing the results. So [[:alnum:]] is the same as [[:ALNUM:]] or the mixed-case [[:AlNuM:]].

As stated earlier, the [:name:] and [:☒:] (note the single brackets) must be a part of a surrounding character class. However, you may combine them inside one character class, such as [_[:d:]x[:upper:]=], which is a character class that would match any digit, any uppercase, the lowercase x, and the literal _ and = characters. These named classes won’t always appear with the double brackets, but they will always be inside of a character class.

If the [:name:] or [:☒:] are accidentally not contained inside a surrounding character class, they will lose their special meaning. For example, [:upper:] is the character class matching :, u, p, e, and r; whereas [[:upper:]] is similar to [A-Z] (plus other unicode uppercase letters)

[^[:name:]] or [^[:☒:]] ⇒ The complement of character class named name or ☒ (matching anything not in that named class). This uses the same long names, short names, and rules as mentioned in the previous description.

short	full name	description	equivalent character class
	`alnum`	letters and digits
	`alpha`	letters
`h`	`blank`	spacing which is not a line terminator	`[\t\x20\xA0]`
	`cntrl`	control characters	`[\x00-\x1F\x7F\x81\x8D\x8F\x90\x9D]`
`d`	`digit`	digits
	`graph`	graphical character, so essentially any character except for control chars, `\0x7F`, `\x80`
`l`	`lower`	lowercase letters
	`print`	printable characters	`[\s[:graph:]]`
	`punct`	punctuation characters	`[!"#$%&'()*+,\-./:;<=>?@\[\\\]^_`{
`s`	`space`	whitespace (word or line separator)	`[\t\n\x0B\f\r\x20\x85\xA0\x{2028}\x{2029}]`
`u`	`upper`	uppercase letters
	`unicode`	any character with code point above 255	`[\x{0100}-\x{FFFF}]`
`w`	`word`	word characters	`[_\d\l\u]`
	`xdigit`	hexadecimal digits	`[0-9A-Fa-f]`

–jeroen

Posted in Development, Notepad++, Power User, RegEx, Software Development, Text Editors | Leave a Comment »

Google Open Source Insights (hopefully by now more than just npm/golang/maven)

Posted by jpluimers on 2022/02/02

Interesting project at [Wayback] Open Source Insights

Open Source Insights is an experimental project by Google.

Hopefully by now it is supporting more than just npm/golang/maven and by the time it sunsets, other projects take over.

The introduction was some 9 months ago: [Wayback] Introducing the Open Source Insights Project | Google Open Source Blog

Via:

–jeroen

Posted in Development, Go (golang), JavaScript/ECMAScript, Node.js, Power User, Scripting, Security, Software Development | Leave a Comment »

pipe – Windows how to redirect file parameter to stdout? (Windows equivalent of `/dev/stdout`) – Super User

Posted by jpluimers on 2022/02/02

TL;DR:

Windows has CON: which is an equivalent for /dev/tty
Windows has no equivalent for /dev/stdout (the standard output stream)
There is a C# PipeServer.cs proof-of-concept that allows to simulate /dev/stdout through a temporary named pipe
Windows pipe names start with \\.\pipe\ for names on the local machine
The above for /dev/stdout on Windows also holds for /dev/stdin (the standard input stream)

All via [Wayback] pipe – Windows how to redirect file parameter to stdout? (Windows equivalent of /dev/stdout) – Super User.

Read the rest of this entry »

Posted in .NET, C#, Development, Software Development, Windows Development | Leave a Comment »

Hornbach has some very “special” limitations to “special characters” in passwords. I wonder why.

Posted by jpluimers on 2022/02/01

[Wayback] Jeroen Wiert Pluimers on Twitter: “”Too special” password character password woos at @HORNBACH_NL : [ Het wachtwoord moet minstens acht tekens lang zijn, en minstens een getal en een letter (a-zA-Z) bevatten. De volgende speciale tekens zijn toegestaan: !”#$%&'()*+,.:;?@_|} ] 1/”

I wonder what kind of parser they use, as these printable special ASCII characters are forbidden:

\-/[\]^`{~

space (0x20)

tab (0x9)

line feed (0xa)

carriage return (0xb

vertical tab (0xb)

form feed (0xc)

Seems no JSON or SQL to me: there I would expect other limitations.

What would break if you use them in other fields or pass them in an HTML POST-request?

I mean: these passwords should be salted and hashed immediately when the HTML-POST request is received, so certainly they would not be stored somewhere or passed many layers into code, right?

Oh, in order to activate an account there, you need to accept some 40+ A4 sized pages of legal stuff. Brave Dutch judge that will put these all in favour of Hornbach.

[Wayback] Herroepingsrecht bij HORNBACH (no PDF)
- [Wayback] Modelformulier voor herroeping PDF (1 page)
[Wayback] Privacyverklaring HORNBACH Bouwmarkt (Nederland) B.V. [Wayback] PDF (23 pages)
- [Wayback] Reglement cameratoezicht (only as PDF): 3 pages

–jeroen

Read the rest of this entry »

Posted in Development, LifeHacker, Power User, Security, Software Development, Web Development | Leave a Comment »

Some links on using and updating Let’s Encrypt certificates for internal servers

Posted by jpluimers on 2022/02/01

Sometimes it is easier to have current and public CA signed TLS certificates for internal servers than to setup and maintain an internal CA and register it on all affected browsers (including mobile phones).

One of my reasons to investigate this is that Chrome refuses to save credentials on servers that have no verifiable TLS certificate, see my post Some links on Chrome not prompting to save passwords (when Firefox and Safari do) about a week ago.

Below are some links for my link archive that hopefully will allow me to do this with Let’s Encrypt (msot via [Wayback/Archive] letsencrypt for internal servers – Google Search):

Read the rest of this entry »

Posted in Cloud, Cloudflare, Development, Encryption, ESXi6, ESXi6.5, ESXi6.7, ESXi7, Fritz!, Fritz!Box, Fritz!WLAN, Infrastructure, Internet, Let's Encrypt (letsencrypt/certbot), Power User, Security, Software Development, Virtualization, VMware, VMware ESXi, Web Development | Leave a Comment »

cd-to-file.bat for when you have a full filename that is too long to truncate by hand

Posted by jpluimers on 2022/01/31

Small cd-to-file.bat tip:

pushd %~dp1

–jeroen

Posted in Batch-Files, Power User, Scripting, Software Development, Windows | Leave a Comment »

« Previous Entries

Next Entries »

	Attila Kovacs on Crowbarring Windows 95 into Wi…
	Jeroen Wiert Pluimer… on Does Odido (the old T-Mobile N…
	Lars Fosdal on Security alarm provider Woonve…
	Thomas Mueller on Question got closed in May 202…
	Thaddy de Koning on Formulier voor bewindvoerders…

The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

Twitter Updates

My Flickr Stream

Pages

All categories

Email Subscription

Archive for the ‘Development’ Category

C# Effective way to find any file’s Encoding – Stack Overflow

UTF-8 web adoption is huge, closing 100%, but only soured up since around 2006.

ThinkPad T440p Touchpad Swap: Installing Correct Drivers – YouTube

Chrome debugging tip: disabling framework/library code (from Minko Gechev on Twitter)

RegEx character classes in “Searching | Notepad++ User Manual”

Character Classes

Google Open Source Insights (hopefully by now more than just npm/golang/maven)

pipe – Windows how to redirect file parameter to stdout? (Windows equivalent of `/dev/stdout`) – Super User

TL;DR:

Hornbach has some very “special” limitations to “special characters” in passwords. I wonder why.

Some links on using and updating Let’s Encrypt certificates for internal servers

cd-to-file.bat for when you have a full filename that is too long to truncate by hand

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

My Flickr Stream

Pages

All categories

Email Subscription

Archive for the ‘Development’ Category

Rate this:

Share this:

Rate this:

Share this:

Rate this:

Share this:

Rate this:

Share this:

Character Classes

Rate this:

Share this:

Rate this:

Share this:

TL;DR:

Rate this:

Share this:

Rate this:

Share this:

Rate this:

Share this:

Rate this:

Share this: