The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 2,914 other followers

Archive for the ‘OCR’ Category

Tesseract (software): amazing command-line OCR tool

Posted by jpluimers on 2022/05/13

A twitter post blasted me away by showing the results of Tesseract (software) – Wikipedia doing perfect OCR on an image from a twitter post:

Read the rest of this entry »

Posted in C++, Color (software development), Development, OCR, Power User, Software Development, Tesseract | Leave a Comment »

One of the coolest Twitter bots commands: @AltTextCrew OCR please

Posted by jpluimers on 2021/10/28

Twitter account [Archive.is] @AltTextCrew is cool: it can OCR text from images, which is great for visually impaired people.

Just answer a tweet containing such an image and it replies with a series of tweets with the texts of that image.

@AltTextCrew OCR please

You can also have it check and analyse the links from a tweet, just reply this to that tweet:

@AltTextCrew analyze links

[Archive.is] @hbeckpdx is the driving force behind both @AltTextCrew and [Archive.is] @AltTxtReminder:

Edit 20220510: AltTxtReminder got open sourced!

Below are two examples of @AltTextCrew usage:

OCR

  • image: [Archive.is] databass 🏳️‍⚧️⚢ on Twitter: “@AltTextCrew OCR please… “

  • text: [Wayback] Thread by @AltTextCrew on Thread Reader App – Thread Reader App

    Text 1/5:
    CVE-2021-20022 Arbitrary file upload through post- authenticated “branding” feature Like many enterprise products with a web- based user interface, SonicWall Email Security includes a feature known as
    Text 2/5:
    “branding” which gives administrators the ability to customize and add certain assets to the interface, such as company logos. These branding assets are managed via packages, and new packages can be
    Text 3/5:
    created by uploading ZIP archives containing custom text, image files, and layout settings. A lack of file validation can enable an adversary to upload arbitrary files, including executable code, such
    Text 4/5:
    as web shells. Once uploaded, these branding package ZIP archives are normally expanded and saved to the <SonicWall ES install path>\data\branding directory. However, an adversary could place
    Text 5/5:
    malicious files in arbitrary locations, such as a web accessible Apache Tomcat directory, by crafting a ZIP

Link analysis

Explanation

I really want to know what programming languages, frameworks, libraries and APIs they use for this bot.

Edit 20211028:

It uses the Google Vision API, as Tesseract was too slow and inaccurate:

Edit 20211211:

Note that usually the text will be published in the alt tag of the images:

[Archive] Hannah Kolbeck 🏳️‍⚧️ on Twitter: “@jpluimers @AltTextCrew No, it always prefers to tweet images with alt text. Right now if the ocr result from the targeted tweet is too long to fit in 4 images worth it will fall back to posting a thread.” / Twitter

–jeroen

Read the rest of this entry »

Posted in OCR, Power User, SocialMedia, Twitter, TwitterBot | Leave a Comment »

RaiMan’s SikuliX: Automate what you see on a computer monitor

Posted by jpluimers on 2018/09/05

On my research list:

Automate what you see on a computer monitor

Source: [WayBackRaiMan’s SikuliX

Repositories:

It is an evolution of [WayBackSikuli Script – Home that has an other fork that can be automated with PowerPoint slides:

I should play with it: [WayBackSikuliX – QUICKSTART

Via: [WayBack] Any recommendations of automation tools for GUI testing.We tried AutoIT but it had some problems and way too technical… – Tommi Prami – Google+

–jeroen

Read the rest of this entry »

Posted in Agile, Development, OCR, Power User, Software Development, Tesseract, Testing | Leave a Comment »

Project Naptha

Posted by jpluimers on 2014/04/23

Interesting, not only because it is available as Chrome Extension:

Project Naptha automatically applies state-of-the-art computer vision algorithms on every image you see while browsing the web. The result is a seamless and intuitive experience, where you can highlight as well as copy and paste and even edit and translate the text formerly trapped within an image.

–jeroen

via: Project Naptha.

Posted in OCR, Power User | Leave a Comment »

 
%d bloggers like this: