The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,860 other subscribers

Archive for the ‘PDF’ Category

mutool: all purpose tool for dealing with PDF files | mupdf Commands | Man Pages | ManKier

Posted by jpluimers on 2025/09/23

Sometimes I feel like having lived under a rock for ages despite trying to live the command-line life as much as possible.

Anyway: not that long ago I learned about [Wayback/Archive] mutool: all purpose tool for dealing with PDF files | mupdf Commands | Man Pages | ManKier

It is part of [Wayback/Archive] Package mupdf – man pages | ManKier which also has [Wayback/Archive] mupdf: MuPDF is a lightweight PDF viewer written in portable C | mupdf Commands | Man Pages | ManKier.

MuPDF – a kind of sibling to GhostScript – lives at [Wayback/Archive] MuPDF: The ultimate library for managing PDF documents.

The below query got me to [Wayback/Archive] mupdf/docs/man/mutool.1 at master · ccxvii/mupdf · GitHub in the “wrong” repository [Wayback/Archive] GitHub – ccxvii/mupdf: mirrored from git://git.ghostscript.com/mupdf.git

The actual repositories are at:

Read the rest of this entry »

Posted in *nix, *nix-tools, Console (command prompt window), Development, PDF, Power User, Scripting, Software Development | Leave a Comment »

Centurion Deuren: Gebruikers- & Installatiehandleiding KS-Serie (70 – 100)

Posted by jpluimers on 2025/06/23

Will need this: [Wayback/Archive] Centurion Deuren: Gebruikers- & Installatiehandleiding KS-Serie (70 – 100) [Wayback PDF View/PDF View]

The problem is that a combination of the buttons of the Centurion remote controls failing more frequently plus that the remote control are unable to teach their code to new universtal remote controls that we still had from our previous home which used a Marantec door.

Universal remote controls are supposed to be compatible with both Centurion and Marantec remote controls.

What I want to figure out is if I can initiate one new remote control from the door opening mechanism, then have that teach other new remote controls.

Hopefully this is the right manual for the model we have (the model number is not visible from below, so need to verify that from a location difficult to look from above).

Fingers crossed.

Related links (the quest):

Read the rest of this entry »

Posted in Hardware, LifeHacker, PDF, Power User | Leave a Comment »

Download your Kindle books soon, because Amazon will block them after February 25, 2025

Posted by jpluimers on 2025/02/17

Download & transfer via USBTransfer Tip: After downloading, use your USB cable to connect your computer and Kindle. Your Kindle will appear as a drive on your computer. Copy your downloaded file from your computer to your Kindle's documents folder. Starting 26 February 2025, the ‘Download & Transfer via USB’ option will no longer be available. You can still send Kindle books to your Wi-Fi enabled devices by selecting the ‘Deliver or Remove from Device’ option. Learn more about managing downloads

Download & transfer via USB Transfer Tip: After downloading, use your USB cable to connect your computer and Kindle. Your Kindle will appear as a drive on your computer. Copy your downloaded file from your computer to your Kindle’s documents folder. ❗️Starting 26 February 2025, the ‘Download & Transfer via USB’ option will no longer be available. You can still send Kindle books to your Wi-Fi enabled devices by selecting the ‘Deliver or Remove from Device’ option. Learn more about managing downloads

Amazon will disable downloading of Kindle books after 2025-02-25 (yup: slightly more than a week!):

(image on the right via Jan Wildeboer)

Edit 20250224: steps to convert from Kindle 1.17 on MacOS or Windows in 3 Ways to Convert Kindle to PDF for Free.

It allowed me to convert everything to PDF except one book which I found a free PDF of at [Wayback/Archive] Linear Algebra Done Right as [Wayback/Archive] linear.axler.net/LADR4e.pdf [Wayback PDF View/PDF View] via [WaybackSave/Archive] Sheldon Axler on X: “The free Kindle version of the fourth edition of my book Linear Algebra Done Right is now available at www.amazon.com/Linear-Algebra-Right-Undergraduate-Mathematics-ebook/dp/B0DDT4WVRD. The free pdf version of the book is available at linear.axler.net. The free translation into Chinese is also available as a pdf file at linear.axler.net ¹

Back to the original:

Table with URLs for your Kindle libraries where you can download manually based on https://www.amazon.com/hz/mycd/digital-console/contentlist/booksAll/dateDsc/ which I got form the below mentioned  download tools:

Read the rest of this entry »

Posted in Amazon.com/.de/.fr/.uk/..., Cloud, Infrastructure, LifeHacker, PDF, Power User | Tagged: , | Leave a Comment »

Markdown has been the Internet’s lingua franca for documentation. Microsoft finally the documentation format with markitdown: Python tool for converting files and office documents to Markdown.

Posted by jpluimers on 2024/12/17

Finally an easier way to convert Office documents (and other formats) to markdown: [Wayback/Archive] GitHub – microsoft/markitdown: Python tool for converting files and office documents to Markdown. (after Google added a Markdown export feature to Google Docs about half a year ago, and basic Markdown formatting about 2 years ago – see below):

There are quite a few dependencies in [Wayback/Archive] markitdown/pyproject.toml at main · microsoft/markitdown · GitHub, so be prepared for that.

Supported formats (added links for clarity):

The MarkItDown library is a utility tool for converting various files to Markdown (e.g., for indexing, text analysis, etc.)
It presently supports:
  • PDF (.pdf)
  • PowerPoint (.pptx)
  • Word (.docx)
  • Excel (.xlsx)
  • Images (EXIF metadata, and OCR)
  • Audio (EXIF metadata, and speech transcription)
  • HTML (special handling of Wikipedia, etc.)
  • Various other text-based formats (csv, json, xml, etc.)

Google was first though:

  1. [Wayback/Archive] Google Workspace Updates: Compose with Markdown in Google Docs on web
  2. [Wayback/Archive] Google Workspace Updates: Import and export Markdown in Google Docs

There is speculation on why Microsoft introduced it just now ranging from “they need it for AI training” to “just late to the game”. I’m with the latter. Apple is even later, so if you want to convert Apple Notes to markdown, then you can use [Wayback/Archive] Import from Apple Notes – Obsidian Help.

Via various sources, including:

Read the rest of this entry »

Posted in CSV, Development, Excel, HTML, HTML5, JSON, Lightweight markup language, MarkDown, Office, PDF, Power Point, Power User, Software Development, Word, XML/XSD | Tagged: , | Leave a Comment »

Jung Hoofdcatalogus 2024 PDF

Posted by jpluimers on 2024/05/06

[Wayback/Archive] www.hateha.nl/inhoud/uploads/9030.00-jung-hoofdcatalogus.pdf (Omdat de catalogus bijna 50 megabyte groot is werken de View Wayback PDF/View PDF helaas niet).

Query: [Wayback/Archive] jung catalogus filetype:pdf – Google Search

Gerelateerd (omdat ik naar blindplaten aan het zoeken was):

--jeroen

Posted in LifeHacker, PDF, Power User | Leave a Comment »

Print large PDF in Preview over several pages… – Apple Community

Posted by jpluimers on 2023/12/22

Summary of [Wayback/Archive] Print large PDF in Preview over several pages… – Apple Community:

  • Preview cannot
  • Acrobat Reader (formerly Adobe Reader) can; it is called “Poster” (also on Windows)

Via [Wayback/Archive] macos print pdf scaled over two pages – Google Search.

–jeroen

Posted in Adobe, Adobe Acrobat, Apple, Mac OS X / OS X / MacOS, PDF, Power User, Windows | Leave a Comment »

Removing identifiable metada from PDF files

Posted by jpluimers on 2023/06/19

I archived a long thread that started with [Archive] 𝚓𝚘𝚗𝚗𝚢﹏𝚜𝚊𝚞𝚗𝚍𝚎𝚛𝚜 on Twitter: “More fun publisher surveillance: Elsevier embeds a hash in the PDF metadata that is unique for each time a PDF is downloaded, this is a diff between metadata from two of the same paper. Combined with access timestamps, they can uniquely identify the source of any shared PDFs. ” / Twitter at [Wayback/Archive] Thread by @json_dirs on Thread Reader App – Thread Reader App.

TL;DR: publishers put hashes in PDF metadata to track back redistribution; they hardly use smarter watermarking as those are difficult to automatically parse; the hashes can be easily removed.

Read the rest of this entry »

Posted in Hashing, LifeHacker, PDF, Power User, Security | Leave a Comment »

PDF24 Creator – Wikipedia

Posted by jpluimers on 2022/05/30

Just in case I ever need more features than the built-in PDF creator in Windows: PDF24 Creator – Wikipedia

PDF24 is free for commercial use and offers these features that the Windows built-in PDF support lacks:

  • Merge multiple PDF into one file
  • Rotating, extracting, inserting pages
  • Integrated preview for PDF editing
  • PDF encryption, decryption and signing
  • Change PDF information (author, title, etc.)
  • Compress and shrink PDF files
  • Add a watermark or stamp a PDF file
  • Combine pages with a digital paper
  • Convert to and from PDF
  • Multiple PDF printers for different purposes since 7.7.0
  • Full featured and lightweight PDF reader since version 8.7.0
  • Tesseract OCR engine since version 8.8.0
  • Blackening of PDF files since version 10.0.0

Via [Wayback/Archive.is] Software-update: doPDF 10.8.127 – Computer – Downloads – Tweakers (which mentions it does not need GhostScript)

–jeroen

Posted in PDF, Power User, Windows, Windows 10 | Leave a Comment »

Print Friendly & PDF

Posted by jpluimers on 2021/04/02

I bumped into [WayBack] Print Friendly & PDF: Make a Printer Friendly & PDF version of any webpage.

Though Chrome has built-in PDF output support, often web sites render like a mess with it.

Hopefully the above site makes better PDFs in those cases.

I will try to use it for those, and get back if it works.

Via:

 

–jeroen

Posted in PDF, Power User | Leave a Comment »

binaryfiles – How to convert PDF binary parts into ASCII/ANSI so I can look at it in a text editor? – Stack Overflow

Posted by jpluimers on 2020/06/30

The first hit of pdf binary to text – Google Search was [WayBack] binaryfiles – How to convert PDF binary parts into ASCII/ANSI so I can look at it in a text editor? – Stack Overflow has many options including:

Since I have qpdf installed on most systems:

Another useful tool to transform a PDF into an internal format that enables text editor access is qpdf. It is a “command-line program that does structural, content-preserving transformations on PDF files”.

Example usage:

 qpdf                                  \
   --qdf                               \
   --object-streams=disable            \
     input-with-compressed-objects.pdf \
     output-with-expanded-objects.pdf
  1. The output of the QDF-mode enforced by the --qdf switch organizes and re-orders the objects neatly. It adds comments to track the original object IDs and page content streams. All object dictionaries are written into a “normalized” standard format for easier parsing.
  2. The --object-streams=disable causes the extraction of (otherwise not recognizable) individual objects that are compressed into another object’s stream data.

The recompress is easy as per [WayBackQPDF Manual:

qpdf /tmp/uncompressed.pdf /tmp/compressed.pdf

The answer is by [WayBack] User Kurt Pfeifle – Stack Overflow who has many other interesting PDF related answers at:

Stackoverflow.com:

Superuser.com:

Serverfault.com:

–jeroen

Posted in Development, EPS/PostScript, PDF, Power User | Leave a Comment »