Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything

Zou, Xiaotian; Li, Ke; Chen, Yongkang

Computer Science > Cryptography and Security

arXiv:2407.02534 (cs)

[Submitted on 1 Jul 2024 (v1), last revised 26 Aug 2024 (this version, v2)]

Title:Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything

Authors:Xiaotian Zou, Ke Li, Yongkang Chen

View PDF HTML (experimental)

Abstract:Large Visual Language Model\textbfs (VLMs) such as GPT-4V have achieved remarkable success in generating comprehensive and nuanced responses. Researchers have proposed various benchmarks for evaluating the capabilities of VLMs. With the integration of visual and text inputs in VLMs, new security issues emerge, as malicious attackers can exploit multiple modalities to achieve their objectives. This has led to increasing attention on the vulnerabilities of VLMs to jailbreak. Most existing research focuses on generating adversarial images or nonsensical image to jailbreak these models. However, no researchers evaluate whether logic understanding capabilities of VLMs in flowchart can influence jailbreak. Therefore, to fill this gap, this paper first introduces a novel dataset Flow-JD specifically designed to evaluate the logic-based flowchart jailbreak capabilities of VLMs. We conduct an extensive evaluation on GPT-4o, GPT-4V, other 5 SOTA open source VLMs and the jailbreak rate is up to 92.8%. Our research reveals significant vulnerabilities in current VLMs concerning image-to-text jailbreak and these findings underscore the the urgency for the development of robust and effective future defenses.

Subjects:	Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.02534 [cs.CR]
	(or arXiv:2407.02534v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2407.02534

Submission history

From: Xiaotian Zou [view email]
[v1] Mon, 1 Jul 2024 16:58:55 UTC (1,078 KB)
[v2] Mon, 26 Aug 2024 22:56:28 UTC (1,169 KB)

Computer Science > Cryptography and Security

Title:Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators