this post was submitted on 21 Sep 2023
358 points (98.9% liked)
Privacy
32041 readers
1120 users here now
A place to discuss privacy and freedom in the digital world.
Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.
In this community everyone is welcome to post links and discuss topics related to privacy.
Some Rules
- Posting a link to a website containing tracking isn't great, if contents of the website are behind a paywall maybe copy them into the post
- Don't promote proprietary software
- Try to keep things on topic
- If you have a question, please try searching for previous discussions, maybe it has already been answered
- Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
- Be nice :)
Related communities
Chat rooms
-
[Matrix/Element]Dead
much thanks to @gary_host_laptop for the logo design :)
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Wow that’s awesome hopefully they open source it and make it easy for anyone to use
Google's recaptcha is used to help train image-recognition programs (like all those ones having you identify signs/bicycles/stoplights help Google's self-driving cars)
They used to have the 2 words, where one word was the control and one word was from a physical book that Google was trying to digitalize
same deal with the ones with matching the orientation of something with a given direction being pointed at; it's training 2d-to-3d AI image generators
How does that make it acceptable? I’ve had to do like 6 minutes of free labor in the past 10 years. Google owes me like $6-10 for that bullshit.
Since i started getting in to online privacy, I've been stuck in captcha-purgatory a lot...it just keeps loading a never ending stream of new captchas for me to solve.
Yeah, there was a chunk of time where there was some spat going on between them and VPN providers, and whenever I would wind up on an exposed VPN server every single captcha would take over a minute of clicking through different prompts. Happened so frequently for the fastest server that I just switched to Firefox and DuckDuckGo because I couldn't stand getting hit every single time I googled something. Not just every session, but literally every single search.
The worst was when it would test me for several minutes straight, and then have the gall to tell me to start over again. Google's really been racing to the bottom lately.
Yeah I noticed now that Google maps has little icons for traffic light positions. Years of our work and AI learning for....fuck all. I CAN SEE THE FUCKIN LIGHTS IRL
Does lemmy signup use reCAPTCHA? Because I’m starting to think I’m a robot when trying to sign up new accounts. lol. I’ve never had such a hard time getting it right on the first time. Or maybe my eyesight is just getting bad.
I always use the audio for Lemmy captcha because some of the letters are ambiguous
The cloudflare ones are just as bad
Wouldn't it be significantly easier to bypass if it were open source?
This isn't a problem of security, this is a problem of deciphering between human and non human users.
Encryption is generally "open source" and that's what makes is strong. Security does not come from people not knowing how things work, but by having properly designed things that work whether people know how they work or not.
It doesn't seem to me like encryption is comparable here. With encryption we have known algorithms that are harder to reverse than initially run. This is a completely different problem, where many inputs are taken and some algorithm has to decide if they are human or not. What digital task can a human do that a robot can't in the same way, especially if the robot knows exactly the measures it should aim for?
exactly what recaptcha does, for example. Knowing that you have to type a word because a computer failed to identify which word is it makes creating a program that does that no easier. Same with the image ones. While criptography is a different problem, the argument is the same: you want something that can be verified to be hard to break otherwise someone will eventually figure it out
If you have a known algorithm for generating those hard-to-read images, then it really wouldn't be that difficult to generate a large enough set yourself to train a custom ML model to solve them. The same would apply to audio challenges.
Only one person would need to do it then they could share the process, potentially automating others being able to bypass as well.
I like the idea of captcha being open, but unlike encryption as far as I know we don't have a starting point on something that is actually easier for humans when all information is available. Until something like that exists, open sourcing to implement and improve it doesn't make sense if you want an effective product.
The text is not generated. It's from photos of books that failed ocr. The photos are then distorted to make it even harder in order to become that captcha. 2 words are used 1 is a control (to know if the response is correct), the other is one they what to know what says (to add to the pool of words and finish digitizing the book).
Nope
Explain
mCaptcha is an open source proof of work tool to tackle bots.