All categories

August 2026
M	T	W	T	F	S	S
	1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Archive for the ‘OCR’ Category

Tesseract (software): amazing command-line OCR tool

Posted by jpluimers on 2022/05/13

A twitter post blasted me away by showing the results of Tesseract (software) – Wikipedia doing perfect OCR on an image from a twitter post:

Read the rest of this entry »

Posted in C++, Color (software development), Development, OCR, Power User, Software Development, Tesseract | Leave a Comment »

One of the coolest Twitter bots commands: @AltTextCrew OCR please

Posted by jpluimers on 2021/10/28

Twitter account [Archive.is] @AltTextCrew is cool: it can OCR text from images, which is great for visually impaired people.

Just answer a tweet containing such an image and it replies with a series of tweets with the texts of that image.

@AltTextCrew OCR please

You can also have it check and analyse the links from a tweet, just reply this to that tweet:

@AltTextCrew analyze links

[Archive.is] @hbeckpdx is the driving force behind both @AltTextCrew and [Archive.is] @AltTxtReminder:

[Archive.is] databass 🏳️‍⚧️⚢ (@hbeckpdx) | Twitter

[Archive.is] World, I’d like you to meet @AltTextCrew!

Are you a live-tweeter that needs help describing the media they post? DM me on this account.

Are you willing to help them to describe their content? Follow the bot. Thanks to @fraxstal for the idea!

[Archive.is] Alt Text Reminder on Twitter:

Just followed? It may take 15 minutes for reminders to start

Sometimes tweets non-bot things about alt text, those will be tagged #AltTxtReminderOOC

To opt-out, block the bot

DMs not checked, message @hbeckpdx instead

Written/Maintained by a sighted person

Edit 20220510: AltTxtReminder got open sourced!

[Wayback/Archive] Alt Text Reminder on Twitter: “Finally got around to throwing code up on Github. This was my first time writing Javascript in anger, so please be kind 😊 … #AltTxtReminderOOC”

[Wayback/Archive] alt-text-org/AltTxtReminder: A twitter bot for reminding folks to use alt text

Below are two examples of @AltTextCrew usage:

OCR

image: [Archive.is] databass 🏳️‍⚧️⚢ on Twitter: “@AltTextCrew OCR please… “
text: [Wayback] Thread by @AltTextCrew on Thread Reader App – Thread Reader App

Text 1/5:
CVE-2021-20022 Arbitrary file upload through post- authenticated “branding” feature Like many enterprise products with a web- based user interface, SonicWall Email Security includes a feature known as
Text 2/5:
“branding” which gives administrators the ability to customize and add certain assets to the interface, such as company logos. These branding assets are managed via packages, and new packages can be
Text 3/5:
created by uploading ZIP archives containing custom text, image files, and layout settings. A lack of file validation can enable an adversary to upload arbitrary files, including executable code, such
Text 4/5:
as web shells. Once uploaded, these branding package ZIP archives are normally expanded and saved to the <SonicWall ES install path>\data\branding directory. However, an adversary could place
Text 5/5:
malicious files in arbitrary locations, such as a web accessible Apache Tomcat directory, by crafting a ZIP

Link analysis

Explanation

I really want to know what programming languages, frameworks, libraries and APIs they use for this bot.

Edit 20211028:

It uses the Google Vision API, as Tesseract was too slow and inaccurate:

Edit 20211211:

Note that usually the text will be published in the alt tag of the images:

[Archive] Hannah Kolbeck 🏳️‍⚧️ on Twitter: “@jpluimers @AltTextCrew No, it always prefers to tweet images with alt text. Right now if the ocr result from the targeted tweet is too long to fit in 4 images worth it will fall back to posting a thread.” / Twitter

–jeroen

Read the rest of this entry »

Posted in OCR, Power User, SocialMedia, Twitter, TwitterBot | Leave a Comment »

RaiMan’s SikuliX: Automate what you see on a computer monitor

Posted by jpluimers on 2018/09/05

On my research list:

Automate what you see on a computer monitor

Source: [WayBack] RaiMan’s SikuliX

Repositories:

Stable: https://github.com/RaiMan/SikuliX-2014
Development: https://github.com/RaiMan/SikuliX2

It is an evolution of [WayBack] Sikuli Script – Home that has an other fork that can be automated with PowerPoint slides:

I should play with it: [WayBack] SikuliX – QUICKSTART

Via: [WayBack] Any recommendations of automation tools for GUI testing.We tried AutoIT but it had some problems and way too technical… – Tommi Prami – Google+

–jeroen

Read the rest of this entry »

Posted in Agile, Development, OCR, Power User, Software Development, Tesseract, Testing | Leave a Comment »

Project Naptha

Posted by jpluimers on 2014/04/23

Interesting, not only because it is available as Chrome Extension:

Project Naptha automatically applies state-of-the-art computer vision algorithms on every image you see while browsing the web. The result is a seamless and intuitive experience, where you can highlight as well as copy and paste and even edit and translate the text formerly trapped within an image.

–jeroen

via: Project Naptha.

Posted in OCR, Power User | Leave a Comment »

	xyzzy, Relay Confere… on Sad and Useless about Competit…
	jpluimers on Windows warned me of disk full…
	jpluimers on Started making people walk me…
	jpluimers on Stack Overflow’s forum is dead…
	jpluimers on Some links on getting the most…

The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

Twitter Updates

My Flickr Stream

Pages

All categories

Email Subscription

Archive for the ‘OCR’ Category

Tesseract (software): amazing command-line OCR tool

One of the coolest Twitter bots commands: @AltTextCrew OCR please

OCR

Link analysis

Explanation

RaiMan’s SikuliX: Automate what you see on a computer monitor

Project Naptha

The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

Twitter Updates

My Flickr Stream

Pages

All categories

Email Subscription

Archive for the ‘OCR’ Category

Tesseract (software): amazing command-line OCR tool

Rate this:

Share this:

One of the coolest Twitter bots commands: @AltTextCrew OCR please

OCR

Link analysis

Explanation

Rate this:

Share this:

RaiMan’s SikuliX: Automate what you see on a computer monitor

Rate this:

Share this:

Project Naptha

Rate this:

Share this: