CAPTCHA How Important It Is
CAPTCHA How Important It Is
Amal A. AlSuwaidan
King Saud University
Riyadh, K.S.A
P.O Box 291772, Riyadh 11362
[email protected]
ABSTRACT
Since browsing internet became a daily need for all people, which means it should satisfies the security and
accessibility issues. CAPTCHA became an important part of internet browsing and it is used to prevent the
bots illegal access to the web sites. On the other hand, CAPTCHA is inaccessible, especially for the blind
persons, visually impaired or dyslexic. This paper is about CAPTCHA and its history, advantages,
applications, disadvantages and future.
Keywords
Web accessibility, CAPTCHA accessibility, CAPTCHA and special needs people, and CAPTCHA alternatives.
1.INTRODUCTION:
In this paper I will talk about CAPTCHA and some its related issues. At first I will talk about CAPTCHA history
and related work. Then I will talk about how to create CAPTCHA and how does CAPTCHA work. After that I
will show CAPTCHA benefits. Then I will discuss the CAPTCHA accessibility survey. After that I will talk about
CAPTCHA accessibility and brief discussing about some CAPTCHA alternatives. Finally, I will talk about
CAPTCHA future.
2.CAPTCHA BACKGROUND AND RELATED WORK:
Bots are software applications that run automated tasks over the Internet. Typically, bots perform tasks that
are both simple and structurally repetitive at a much higher rate than would be possible for a human alone.
Some kinds of bots are harmful when attacked a web site. In 1996, Moni Naor discuses several ways to verify
that a request comes from a human and not bots. In his paper "Verification of a human in the loop or
Identification via the Turing Test". Then in 1997, Primitive CAPTCHAs produced at AltaVista by Andrei Broder
and his colleagues to prevent bots from adding URLs to their search engine. CAPTCHA is an acronym for
"Completely Automated Public Turing test to tell Computers and Humans Apart.". Then, in 2000, Luis von
Ahn and Manuel Blum coined the term 'CAPTCHA', improved and publicized the notion, which included any
program that can distinguish humans from computers. They invented multiple examples of CAPTCHAs,
including the first CAPTCHAs to be widely used, which were those adopted by Yahoo! but the trademark
application was abandoned on 21 April 2008 [1].
The CAPTCHA can be described as a picture contains distorted letters to ensure that the user is a human not
bots. These pictures can not be read by bots because CAPTCHA is resistant to OCR (Optical Character
Recognition). e.g. (see figure 1 and figure 2).
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies
are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy
otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission.
The First MiniConference in Web Technologies and Trends (WTT)
© 2009 Information Technology Department, CCIS, King Saud University, Riyadh, Saudi Arabia
Figure 1: Hotmail CAPTCHA Figure2: Yahoo! CAPTCHA
CAPTCHA spreads widely in the web sites. As a result, CAPTCHA must satisfy two main issues the security
and the accessibility. CAPTCHA already satisfies the security issues by providing a great protection from
bots. On the other hand, CAPTCHA faces a big challenge with accessibility issue. Since, CAPTCHA rely on
the picture vision which could be hard to the normal users and impossible to the special needs users.
A lot of internet users trying to avoid CAPTCHA and provide some alternative but until now CAPTCHA still the
most popular. Matt May on his paper "Inaccessibility of CAPTCHA" discuses this topic from several ways and
produce some alternatives.
3.HOW DOES CAPTCHA WORK?
CAPTCHA is fooling the bots by asking questions only human can answer it. CAPTCHA fooling bots by
generating pictures contain distorted letters different pictures with different letters in different shapes are
generated Then CAPTCHA asking the user to enter the letters shown in the picture. Bots can not recognize
each letter alone. After the user submits the answer CAPTCHA validate the answer.
4.HOW TO CREATE CAPTCHA?
CAPTCHA can be written using some programming languages like PHP and ASP. The code should provide
three main functions. First, the code should generate a random picture with different properties. Second,
validate the user answer. Third, make these pictures secure. Also, there are many things to make the code
more reliable like" Rotate the text randomly, add random spaces in between characters, use a TTF fonts and
change the font randomly every time, use a random text and image size every time, use more advanced text
distortion and colours, move the lines randomly, store the password in a random cookie"[2]. In addition, there
are CAPTCHA creator programs which allow users to choose their CAPTCHA shapes.
5.CAPTCHA BENEFITS: [3]
5.1CAPTCHA security services:
CAPTCHA provides several services in security, including:
• Preventing Comment Spam in Blogs.
CAPCHA prevents spam replays without asking the user to sign up.
• Protecting Website Registration.
Ensure that the users who sign up for free services such email are human not bots.
• Protecting Email Addresses from Scrapers.
CAPTCHA hide the email address and asked the users to enter the text in the picture to show the email.
This help to prevent scrapers from sending to the emails.
• Online Polls.
In online poll asking bots can vote thousand of times. CAPTCHA allows humans only to vote.
• Preventing Dictionary Attacks.
CAPTCHAs can also be used to prevent dictionary attacks in password. The idea is simple: prevent a
computer from being able to iterate through the entire space of passwords systems by requiring it to solve a
CAPTCHA after a certain number of unsuccessful logins.
• Search Engine Bots.
Since, search engine bots are usually belong to large companies, respect web pages that don't want to allow
them in. However, in order to truly guarantee that bots won't enter a web site, CAPTCHAs are needed.
• Worms and Spam
CAPTCHA protects email from worm and spam by ensuring that the sender is a human not bot.
5.2ReCAPTCHA project:
ReCAPTCHA is a free CAPTCHA service that helps to digitize books, newspapers and old time radio shows.
Since solving CAPTCHA consume more than 150,000 hours of work each day. ReCAPTCHA is trying to
make appositive use of this human effort. ReCAPTCHA does that by channelling the effort spent solving
CAPTCHAs online into "reading" books.
Digitizing physical books is by scanning the book pages photographically, and then transformed into text
using "Optical Character Recognition" (OCR). Since OCR is not perfect reCAPTCHA improves the process of
digitizing books by sending each word that cannot be read correctly by OCR is placed on an image and used
as a CAPTCHA.
But if a computer can't read such a CAPTCHA, system knows the correct answer by "giving the word that can
not be read correctly by OCR to a given user in conjunction with another word for which the answer is already
known"[3], if the user answered the word already known correctly the system assumes their answer is correct
for the new one. Then, the system gives the same world to a number of other users to determine the correct
answer.
6.CAPTCHA ACCESSIBILTY SURVEY:
In 7 December, 2008 CAPTCHA accessibility survey was created. The survey contains two parts focuses on
CAPTCHA accessibility and some of it is alternatives. The survey covered 118 internet users for part 1 and
111 internet users for part 2 the responders was from different abilities, ages, cultures and interested. The
coming sections will discuss the left of CAPTCHA issues and the survey results. In addition, the full survey
results are provided in the below graphs.
0 20 40 60 80 100 120 0 10 20 30 40 50 60 70 80
Is solving CPTHCHA hard and complicated? How much do you know about CAPTCHA
benefits ?
Always, 5
Sometime a lot,
, 64 33the first
This is a little,
Never, 49 time I hear 61
about
CAPTCHA
0 20 40 60 80 0 20 24 40 60 80
Do you think CAPTCHA is difficult because it Who many times you solve CAPTCHA
is written in English? pictures per week ?
<10 times,
Always, 5 104
Sometimes, 10 to 20, 12
12 Never, >20 times, 2
101
What is the average tim e you need to solve the If you have a w e b site or blog or in case you
CAPTCHA? w ill have , are you interes ting to us e the
CAPTCHA?
<10 sec, 52
10sec to
30sec, 57 Yes, 25
>30 sec, 8 Maybe, 65
I can not No, 28
pass it, 1
0 20 40 60 80
0 10 20 30 40 50 60
If you have a w eb site, do you use CAPTCHA Do you pre fer to have som e CAPTCHA
alternatives? alte rnative s ?
Yes, 2 Yes, 29
No, 11 No, 4 Depend on
I don't have
these
a w eb site.,
alternatives,
105
85
0 20 40 60 80 100 120 0 20 40 60 80 100
If have any re ading difficultie s, Do you think The alte rnative w ill be a program that doe s
the audio alternative are e nough? not re quire any thing from the us e r?
Excellent,
Yes, 11 48
No, 12 I don't have Good, 41
any reading
Bad, 22
dif f iculties,
95
0 20 40 60 80 100 0 10 20 30 40 50 60
The alternative w ill be a calculation like The alternative w ill be an arrangem ent of
(1+1=)? letters or num bers or spell letters in reverse
order
Excellent,
51 Excellent,
Good, 32 21
Good, 40
Bad, 28 Bad, 50
0 10 20 30 40 50 60 0 10 20 30 40 50 60
Excellent, 11
Good, 30
Bad, 70
0 20 40 60 80
Excellent, 19
Good, 41
Bad, 51
0 10 20 30 40 50 60
7.CAPTCHA ACCESSIBILITY:
The accessibility is an important issue in the web. CAPTCHA is an important part of the web so, it should
satisfy this issue. In the accessibility survey 22.88% of responders said that CAPTCHA is bothering them and
62.71% of them said that CAPTCHA some times is bothering them. Actually, the internet users have different
abilities and skills. As a result, some thing that bothers the normal users could be very bothering to the
special needs users. In addition, to prevent bots illegal access we need to define something humans have but
bots don't have. CAPTCHA asking the users to distinguish between a set of letters this require human mind
and eyes. A lot of internet users are eye blind or visually impaired but still they are human. Also,
distinguishing between a set of letters is impossible for the dyslexic users again they are still human. As a
result, CAPTCHA requires skill and sense that not all humans have.
In fact, CAPTCHA provides security which is important but users are more important. Special needs users
can not add a comment or create an email. Since, the screen reader can not read CAPTCHA which, prevent
special needs users from using the web services.
The users are facing CAPTCHA challenge many times and they are spending a lot of time to pass CAPTCHA.
In the CAPTCHA accessibility survey 48.31% of responders spend between 30 to 10 seconds in solving
CAPTCHA picture and 88.14% of them facing CAPTCHA challenge less than 10 times in a week. Also, 0.85%
of responders said they can not pass CAPTCHA. Moreover, solving CAPTCHA is not always easy in the
survey 54.24% of responders said solving CAPTCHA is sometimes hard and complicated and 4.24% of them
said it is always hard and complicated and 41.53% of them said is it easy.
As a result, CAPTCHA is inaccessible and makes web services inaccessible too.
8.CAPTCHA ALTERNATIVES:
CAPTCHA main function is to provide a good protection for the web sites from bots. A lot of CAPTCHA
alternatives are provide the same protection with out bothering the users. CAPTCHA alternatives can be
separated in two types the first is alternatives that prevent bots access without asking the users to input any
thing. The second type is alternatives that prevent bots access by examining humans mind only. In the
CPATCHA accessibility survey 24.58% of responders prefer any CAPTCHA alternatives and 72.03% for them
said it is depended on these alternatives. Also, 88.98% of the responders don't have websites but 1.96% of
the responders who have a web sites said they are using alternatives.
In fact, alternatives are facing some accessibility and security issues. Sometimes, alternatives are not
providing full protection also, they are not always accessible.
In the coming part I will show some CAPTCHA alternatives and some of their advantages and disadvantages.
8.1 Alternatives that require a user input:
8.1.1 Audio alternative:
A lot of web site using the audio alternative as solution to CAPTCHA inaccessibility. Audio alternative
contains a distorted voice that says a set of number or letters and asking the user to enter it. Although, the
audio alternative can help the special need users to pass CAPTCHA, a lot of websites don't use it like Yahoo!,
MySpace and Friendster.
Audio alternative it is not perfect. The voice some times could be difficult to solve. In the survey 14.41% of
responders are having reading difficulties. 10.17% of them responded that the audio alternative is not enough
to be considered as CAPTCHA alternative.
Audio alternative can not be used alone with out CAPTCHA picture. Since, CAPTCHA is inaccessible for the
blind users; audio alternative will be inaccessible for the deaf users.
To make audio alternative better it should be improved to give a clear voice to the human abilities also, to
make it available in all the web sites. Web sites administrators should understand the limited abilities for the
special need users and show some respect for them.
At last, Audio alternative is not the best solution but on the other hand it provides an appropriate solution to
the special need users.
8.1.2Simple calculation alternative:
This alternative could be very simple for the users. It will provide simple calculation like 1+3= or 6+2= and
asks the user to enter the result then, validate the user's result. The calculations provided are changed each
time. The most important advantage of this alternative is providing calculation that every user with different
abilities can pass it.
Simple calculation alternative is used by many web sites instead of CAPTCHA and it can prevent bots
completely, it provides the same protection CAPTCHA does.
In the CAPTCHA alternative survey 45.95% of responders think Simple calculation alternative is an excellent
alternative.
8.1.3Problem solution or general information question alternative:
This alternative will provide simple question that all users can answer it. Simply, it is asking the users to enter
the answer. These questions can be general information like "in morning, what is the sea colour?" or problem
like "my name is Amal and my only brother's name is Badr Abdulrahman what is my full name?", since these
questions are written in specific language, not all users can understand it. Also, the questions provided may
have several correct solutions or may the users can not answer it especially for the problems. Moreover, the
question list should update each period of time to have more secure. On the other hand, these questions can
be accessible for all users and can stand only without CAPTCHA.
In the CAPTCHA alternative survey 9.91% of responders think Problem solution or general information
question alternative is an excellent alternative.
8.1.4Complete sentence with a multiple choice word alternative:
In this alternative a question with multiple choices is provided then it asks the user to choose the correct or
appropriate word. The question could be like this "I am reading a _________" and the choices are "shoes,
Book, bread". This alternative could be accessible for all users with different abilities. This alternative faces
the same problems that Problem solution or general information question alternative face, because it written
in a specific language, but here the choices make answering questions easier. Moreover, this alternative can
work without CAPTCHA.
In the CAPTCHA alternative survey 17.12% of responders think complete sentence with a multiple choice
word alternative is an excellent alternative.
8.1.5Letters or numbers arrangement or spell letter in specific order:
In this alternative questions like "write the letters in reverse order" or "arrange the letters in alphabetical
order". This alternative will give a complete accessibility for the normal users and special needs users, but
the letters or numbers provided will belong to a specific language which not all users know it.
In the CAPTCHA alternative survey 18.92% of responders think Letters or numbers arrangement or spell
letter in specific order alternative is an excellent alternative.
8.1.6Manual alternative:
Some web sites provide manual alternatives for the users who can not pass CAPTCHA. Manual alternative
can be for example "for more support call the number" or "for more support visit us". This alternative solves
the CAPTCHA inaccessible problem but it makes special needs users spend more effort to get some services
that normal users can get via the web sites. In fact, special needs users need more facilities than normal
users to get benefit from the services.
8.2Alternatives that don't require any input from the user:
There are a lot of programs that prevent bots from accessing the websites. Some of these programs offer a
great protection from bots. In addition, there is no need to examine humans which offer a perfect accessibility
for all the users. I think if these programs improved to give a full protection CAPTCHA will disappear.
In the CAPTCHA alternative survey 43.24% of responders think, if the alternative will be a program doesn't
require an input from user this will be an excellent solution.
9.FUTURE OF CAPTCHA:
A lot of CAPTCHA alternatives are provided. Therefore, a lot of web sites replace the CAPTCHA with some
alternative. Most of CAPTCHA alternatives are accessible and easy to use. In the CAPTCHA accessibility
survey 23.73% of responders said they are not using CAPTCHA in their web site. CAPTCHA could disappear
in the coming few years because it is inaccessible and facing a hard challenge with accessible alternatives.
We hope CAPTCHA creator find some way to make CAPTCHA accessible. In fact, I think CAPTCHA will
disappear before they can make it accessible.
10.CONCLUSION:
In conclusion, browsing internet is not for the normal users only. The special needs user need the internet like
the normal user or may be more. The web sites are provided to help users, give them more information or
make their work easier, but CAPTCHA makes reaching the web sites services more complicated. At last, we
need some way to tell CAPTCHA that special needs users are human not bots.
11.REFERENCES
[1] Wikipedia, the free encyclopaedia Located on the Internet at http://en.wikipedia.org/wiki/Captcha. Last
visited: 9 December, 2008.
[2] Web cheat sheet Located on the Internet at
http://www.webcheatsheet.com/PHP/create_captcha_protection.php. Last visited: 10 December, 2008.
[3] The Official CAPTCHA site located on the internet at http://www.captcha.net/. Last visited: 10 December,
2008.