Abstract
CAPTCHA is an effective mechanism for protecting computers from malicious bots. With the development of deep learning techniques, current mainstream text-based CAPTCHAs have been proven to be insecure. Therefore, a major effort has been directed toward developing image-based CAPTCHAs, and image-based visual reasoning is emerging as a new direction of such development. Recently, Tencent deployed the Visual Turing Test (VTT) CAPTCHA. This appears to have been the first application of a visual reasoning scheme. Subsequently, other CAPTCHA service providers (Geetest, NetEase, Dingxiang, etc.) have proposed their own visual reasoning schemes to defend against bots. It is, therefore, natural to ask a fundamental question: are visual reasoning CAPTCHAs as secure as their designers expect? This paper presents the first attempt to solve visual reasoning CAPTCHAs. We implemented a holistic attack and a modular attack, which achieved overall success rates of 67.3% and 88.0% on VTT CAPTCHA, respectively. The results show that visual reasoning CAPTCHAs are not as secure as anticipated; this latest effort to use novel, hard AI problems for CAPTCHAs has not yet succeeded. Based on the lessons we learned from our attacks, we also offer some guidelines for designing visual CAPTCHAs with better security.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 30th USENIX Security Symposium |
| Place of Publication | [S.I.] |
| Pages | 3291-3308 |
| Number of pages | 18 |
| ISBN (Electronic) | 9781939133243 |
| Publication status | Published - 13 Aug 2021 |
| Event | 30th USENIX Security Symposium, USENIX Security 2021 - Virtual, Online Duration: 11 Aug 2021 → 13 Aug 2021 |
Publication series
| Name | Proceedings of the 30th USENIX Security Symposium |
|---|
Conference
| Conference | 30th USENIX Security Symposium, USENIX Security 2021 |
|---|---|
| City | Virtual, Online |
| Period | 11/08/21 → 13/08/21 |
Funding
We would like to thank our shepherd David Freeman and the anonymous reviewers for their valuable suggestions for improving this paper. This paper was supported by the Natural Science Foundation of China under Grant 61972306 and sponsored by Zhejiang Lab (No. 2021KD0AB03).
Keywords
- visual reasoning
- cybersecurity
- CAPTCHA