GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++ « The Wiert Corner

All categories

November 2024
M	T	W	T	F	S	S
	1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30

GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++

Posted by jpluimers on 2024/11/20

For future experimentation transcribing voice conversations: [Wayback/Archive] GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++

Whisper (speech recognition system) usually runs in the cloud (someone else’s computers, often rentable for a substantial monthly sum).

Via

[Wayback/Archive] Jeroen Wiert Pluimers: “Wat is een goede tool voor transcriptie van Nederlandse tekst voor hobbymatig gebruik?…” – Mastodon
[Wayback/Archive] bert hubert 🇺🇦🇪🇺: “@wiert whisper.cpp als je handig bent…” – Fosstodon

Now hopefully Whisper works well with the Dutch language…

I later realised Jeff Geerling mentioned Whisper a while ago as well:

and [Wayback/Archive] Jeff Geerling on X: “Here’s how I use Whisper to easily create accurate subtitles for every YouTube video across all three of my channels: … More content creators should do this—can’t speak to non-English languages, but for English, it’s eerily accurate.” / X (edit 20250130: added this Twitter)

and even earlier: [Wayback/Archive] Lior⚡ on X: “You can now transcribe 2.5 hours of audio in 98 seconds, locally…”

You can now transcribe 2.5 hours of audio in 98 seconds, locally.

A new implementation called insanely-fast-whisper is blowing up on Github.

It works on works on Mac or Nvidia GPUs and uses the Whisper + Pyannote library speed up transcriptions and speaker segmentations.

Here’s how you can use it:

pip install insanely-fast-whisper

insanely-fast-whisper --file-name <FILE NAME or URL> --batch-size 2 --device-id mps --hf_token <HF TOKEN>

[Wayback/Archive] GitHub – Vaibhavs10/insanely-fast-whisper

[Wayback/Archive] Tweet JSON

[Wayback/Archive] video.twimg.com/ext_tw_video/1730306137642426368/pu/vid/avc1/406×270/nCGTfwa7_IV7YJM-.mp4

[Wayback/Archive] video.twimg.com/ext_tw_video/1730306137642426368/pu/vid/avc1/542×360/vSQ548Z_wVfJ2ZxU.mp4

[Wayback/Archive] video.twimg.com/ext_tw_video/1730306137642426368/pu/vid/avc1/966×640/hgsBIk36RbALXN0D.mp4

--jeroen

This entry was posted on 2024/11/20 at 18:00 and is filed under AI and ML; Artificial Intelligence & Machine Learning, C++, Development, LifeHacker, Power User, Software Development. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

	Jeroen Wiert Pluimer… on Pie Comic by John McNamee: Mov…
	Attila Kovacs on Crowbarring Windows 95 into Wi…
	Jeroen Wiert Pluimer… on Does Odido (the old T-Mobile N…
	Lars Fosdal on Security alarm provider Woonve…
	Thomas Mueller on Question got closed in May 202…

The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

Twitter Updates

My Flickr Stream

Pages

All categories

Email Subscription

GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++

Leave a comment Cancel reply

The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

Subscribe

Archives

Recent Comments

Recent Posts

Blog Stats

Meta title

Tag Cloud Title

Top Clicks

Top Posts

My badges

Twitter Updates

My Flickr Stream

Pages

All categories

Email Subscription

GitHub – ggerganov/whisper.cpp: Port of OpenAI’s Whisper model in C/C++

Rate this:

Share this:

Related

Leave a comment Cancel reply