Posted April 09, 2016
rtcvb32: In theory yes. However computers aren't very good at object identification unless it's 1:1.
Back in the 80-90's there was a technology called OCR which builds a library of characters from fonts, and then does it's best to match everything. Unfortunately you'd get 80% good and static or glitches in areas would get things wrong. Mind you it was probably good enough for people who are blind since redundancy in language is high enough you can usually figure out what's wrong.
Anyways, a lot of Captcha today is warping the text or adding static or doing things that makes it harder if not impossible to do identification. Unfortunately a large portion of the time the images are so warped i can't figure it out either, making the natural organ we have in our heads for visually figuring things out pointless if things are swirled and warped and inverted and strikes through it and you can't tell if the random letters are an i, l, 1, or any other number of odd combinations that you effectively just blitz through to find a good easy captcha because they are crap.
mk47at: Have you noticed, that a large number of Captchas consist of two separate parts? One that is clearly generated to cause problems and the other one looks like a bad scan or some (slightly) blurry image. Google does this (or at least used to do this). The use the first part to check if the answers are valid and the second part to get humans to do their ocr work for them. You could answer some garbage for the second part and it would still pass. I assume it works the same way the image categorization Captchas. They get free learning data. Back in the 80-90's there was a technology called OCR which builds a library of characters from fonts, and then does it's best to match everything. Unfortunately you'd get 80% good and static or glitches in areas would get things wrong. Mind you it was probably good enough for people who are blind since redundancy in language is high enough you can usually figure out what's wrong.
Anyways, a lot of Captcha today is warping the text or adding static or doing things that makes it harder if not impossible to do identification. Unfortunately a large portion of the time the images are so warped i can't figure it out either, making the natural organ we have in our heads for visually figuring things out pointless if things are swirled and warped and inverted and strikes through it and you can't tell if the random letters are an i, l, 1, or any other number of odd combinations that you effectively just blitz through to find a good easy captcha because they are crap.
It's a clever idea. You provide a service that benefits other people and in turn you get a large number of free workers.