this post was submitted on 28 Aug 2023
224 points (99.1% liked)

Selfhosted

40023 readers
1114 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS
 

Hi everyone!

A few days ago I released Whishper, a new version of a project I've been working for about a year now.

It's a self-hosted audio transcription suite, you can transcribe audio to text, generate subtitles, translate subtitles and edit them all from one UI and 100% locally (it even works offline).

I hope you like it, check out the website for self-hosting instructions: https://whishper.net

top 21 comments
sorted by: hot top controversial new old
[–] webghost0101@sopuli.xyz 10 points 1 year ago (2 children)

Does this need to connect to openai or does it function fully independently? Its for offline use.

[–] pluja@lemmy.world 16 points 1 year ago

No, it's completely independent, it does not rely on any third-party APIs or anything else. It can function entirely offline once the models have been downloaded.

[–] pcouy@lemmy.pierre-couy.fr 7 points 1 year ago

The readme mentions "transcription time on CPU" so it's probably running locally

[–] fmstrat@lemmy.nowsci.com 6 points 1 year ago (1 children)

How does it compare to https://github.com/guillaumekln/faster-whisper?

I've been using Faster Whisper for a while locally, and its worked out better than raw whisper and benchmarks really well. Just curious if there are any reasons to switch.

[–] pluja@lemmy.world 1 points 1 year ago

Whishper uses faster-whisper in the backend.

Simply put, it is a complete UI for Faster-Whisper with extra features like transcription translation, edition, download options, etc...

[–] rikudou@lemmings.world 6 points 1 year ago* (last edited 1 year ago)

Nice, congrats!

a meme with a photo of Richmond Valentine from Kingsman, the bottom text says whishper

[–] ares35@kbin.social 5 points 1 year ago (1 children)

how does whisper do transcribing technical documents. like for lawyers, doctors, engineers and what not? or speakers with heavy accents?

[–] pluja@lemmy.world 6 points 1 year ago

Whisper models have a very good WER (word error ratio) for languages like Spanish, English, French... if you use the english-only models it also improves. Check out this page on the docs:

https://whishper.net/reference/models/#languages-and-accuracy

[–] micha@lemmy.sdf.org 5 points 1 year ago (1 children)

Congratulations on the launch and thanks for making this open-source! Not sure if this supports searching through all transcriptions yet, but that’s what I’d find really helpful. E.g. search for a keyword in all podcast episodes.

[–] pluja@lemmy.world 4 points 1 year ago

That's a great idea! I'll attempt to implement that feature when I find some time to work on it.

[–] obinice@lemmy.world 3 points 1 year ago

I've been looking for a tool to do this for YEARS, my god! Years!!! ❤️❤️

[–] UberMentch@lemmy.world 3 points 1 year ago* (last edited 1 year ago) (1 children)

Would love to deploy this, but unfortunately I'm running older server equipment that apparently doesn't support MongoDB 5 (Error message MongoDB 5.0+ requires a CPU with AVX support, and your current system does not appear to have that!). Tried deploying with both 4.4.18 and 4.4.6 and can't get it to work. If anybody has some recommendations, I'd appreciate hearing them!

[–] pluja@lemmy.world 1 points 1 year ago

I'm glad you were able to solve the problem, I add the comment I made to another user with the same problem:

Didn't know about this problem. I'll try to add a MariaDB alternative database option soon.

[–] Railcar8095@lemm.ee 3 points 1 year ago

Massive kudos. I had the need for something like this in the past and it would have been a blessing.surely it will be for somebody else

[–] orizuru@lemmy.sdf.org 2 points 1 year ago

Congrats, and thank you for releasing this!

Maybe there's a couple of personal projects I could use it for...

[–] morethanevil@lemmy.fedifriends.social 2 points 1 year ago (1 children)

I saw your project on Codeberg before. Then it was whisper plus. Since whisper+ it did not work anymore for me. I uploaded a file and it did not start. The old whisper worked. Did not try it for months anymore with whisper plus.

Maybe I give it another try. Can I use bind mounts or are there special permissions? Anyway thanks for your work.

[–] pluja@lemmy.world 6 points 1 year ago

Whisper+ had some problems, that's why I rewrote everything. This new version should fix almost (maybe there are some bugs I haven't found) everything.

If you take a look at the docker-compose file, you'll see it is already using bind mounts. The only special permission needed is for the LibreTranslate models folder, which runs as non-root with user 1032.

[–] Konraddo@lemmy.world 1 points 1 year ago (1 children)

Just tried this out but couldn't get it to work until downgrading mongo to 4.4.6 because my NAS doesn't ha``ve AVX support. But then, mongo stays unhealthy. No idea why.

[–] pluja@lemmy.world 1 points 1 year ago

Didn't know about this problem. I'll try to add a MariaDB alternative database option soon to solve this.

[–] midas@ymmel.nl 1 points 1 year ago

Awesome will give this a try

[–] crazygoat@lemmy.world 1 points 10 months ago

Even this is an good sound to text converter and a good ai transcription service