Hi everyone!

A few days ago I released Whishper, a new version of a project I’ve been working for about a year now.

It’s a self-hosted audio transcription suite, you can transcribe audio to text, generate subtitles, translate subtitles and edit them all from one UI and 100% locally (it even works offline).

I hope you like it, check out the website for self-hosting instructions: https://whishper.net

18 points

Does this need to connect to openai or does it function fully independently? Its for offline use.

permalink
report
reply
23 points

No, it’s completely independent, it does not rely on any third-party APIs or anything else. It can function entirely offline once the models have been downloaded.

permalink
report
parent
reply
11 points

The readme mentions “transcription time on CPU” so it’s probably running locally

permalink
report
parent
reply
10 points

How does it compare to https://github.com/guillaumekln/faster-whisper?

I’ve been using Faster Whisper for a while locally, and its worked out better than raw whisper and benchmarks really well. Just curious if there are any reasons to switch.

permalink
report
reply
20 points

Whishper uses faster-whisper in the backend.

Simply put, it is a complete UI for Faster-Whisper with extra features like transcription translation, edition, download options, etc…

permalink
report
parent
reply
2 points

Nice! Thanks.

permalink
report
parent
reply
8 points

how does whisper do transcribing technical documents. like for lawyers, doctors, engineers and what not? or speakers with heavy accents?

permalink
report
reply
9 points

Whisper models have a very good WER (word error ratio) for languages like Spanish, English, French… if you use the english-only models it also improves. Check out this page on the docs:

https://whishper.net/reference/models/#languages-and-accuracy

permalink
report
parent
reply
7 points

Congratulations on the launch and thanks for making this open-source! Not sure if this supports searching through all transcriptions yet, but that’s what I’d find really helpful. E.g. search for a keyword in all podcast episodes.

permalink
report
reply
6 points

That’s a great idea! I’ll attempt to implement that feature when I find some time to work on it.

permalink
report
parent
reply
6 points

Oh, awesome! Does it do speaker detection? That’s been one of my main gripes with Whisper.

permalink
report
reply
7 points
*

Unfortunately, not yet. Whisper per se is not able to do that. Currently, there are few viable solutions for integration, and I’m looking at this one, but all current solutions I know about need GPU for this.

permalink
report
parent
reply
2 points

VERY understandable, requiring a GPU would limit it’s application and spread, i hope a good GPU-less solution is found eventually

permalink
report
parent
reply

Selfhosted

!selfhosted@lemmy.world

Create post

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

  1. Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Community stats

  • 4.9K

    Monthly active users

  • 3.5K

    Posts

  • 75K

    Comments