this post was submitted on 08 Nov 2023
28 points (100.0% liked)

Free and Open Source Software

17933 readers
144 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
 

Today somebody in a group I'm in which has some accessibility issues was yet again complaining that their Dragon Speaking software was not playing nice with Firefox, which led me to see if there was an alternative, and surprisingly i found none workable at the plain user level beyond Dragon, and upgrading for that person might actually be costly (From what they say it starts at nearly $200 but apparently can go as high as $700? Not clear yet).

So, obviously now I'm checking about the FOSS side of things, a search has been inconclusive as i see stuff for developers, multiple different projects (which is a marked improvement from a decade ago when i last tried and failed to do this), but so far haven't found anything at the user level.

Have i overlooked something? Or is it that we're many years later still at the "building libraries" stage without actual user-level stuff people can just apt-get or download?

Quick edit: I must insist, is there something for USERS, not DEVELOPERS, that i have overlooked? APIs or commandline programs or learning models are not a software i can hand to my non-programmer friend to install on their computer to replace Dragon to help them write on Firefox

top 11 comments
sorted by: hot top controversial new old
[–] redw0rm@kerala.party 4 points 1 year ago

For PC's the whisper can be now run on many common laptops and desktops ( with almost good accuracy ) , thanks to projects like whisper.cpp and faster-whisper.

There also happens a lot of model's getting released by meta like wav2vec2 and MMS ( not seen using anywhere though )

I guess there are many python packages for aforementioned projects too.

[–] RymdLord@beehaw.org 3 points 1 year ago (2 children)

Not saying this is Foss or isn't but Futo Voice exists for android

[–] beyond@linkage.ds8.zone 3 points 1 year ago

It is in fact non-free. (The article is about Grayjay, a product from the same company that uses the same license)

[–] inasaba@lemmy.ml 2 points 1 year ago

https://voiceinput.futo.org/

I use it and it works very well.

[–] Butterbee@beehaw.org 3 points 1 year ago (1 children)

You could look into whisper. It's a neural net one from openai but they've actually opened it up and it can run locally.

[–] jherazob@beehaw.org 5 points 1 year ago (1 children)

That was one of the various projects for developers i saw and mentioned and it's not something my normal user level friend can just install in their computer unassisted to start writing stuff for them on Firefox, i imagine that somebody could develop something using that as a backend but (again, unless i'm overlooking it) i don't see anything at the moment even in alpha state

[–] Butterbee@beehaw.org 3 points 1 year ago

It's definitely not ready to hand off to a regular user to install and use, you're right. I don't think there is an off the shelf alternative.

[–] beyond@linkage.ds8.zone 3 points 1 year ago

People are recommending the proprietary FUTO voice app. This app, and other FUTO apps like Greyjay, are non-free.

https://hiphish.github.io/blog/2023/10/18/grayjay-is-not-open-source/

[–] jbstep@lemm.ee 3 points 1 year ago

Talon Voice is aimed at developers, but I'm using it as an alternative to Dragon right now. Not as user-friendly as Dragon, but very functional.

[–] realslef@fedia.io 2 points 1 year ago

There's sayboard and some other Vosk-based tools but only on android, as far as I know.

[–] wintermute@feddit.de 1 points 1 year ago

Maybe check out your browsers web speech api