Live Trojan Attacks on Deep Neural Networks

Costales, Robby; Mao, Chengzhi; Norwitz, Raphael; Kim, Bryan; Yang, Junfeng

Computer Science > Cryptography and Security

arXiv:2004.11370 (cs)

[Submitted on 22 Apr 2020 (v1), last revised 27 May 2020 (this version, v2)]

Title:Live Trojan Attacks on Deep Neural Networks

Authors:Robby Costales, Chengzhi Mao, Raphael Norwitz, Bryan Kim, Junfeng Yang

View PDF

Abstract:Like all software systems, the execution of deep learning models is dictated in part by logic represented as data in memory. For decades, attackers have exploited traditional software programs by manipulating this data. We propose a live attack on deep learning systems that patches model parameters in memory to achieve predefined malicious behavior on a certain set of inputs. By minimizing the size and number of these patches, the attacker can reduce the amount of network communication and memory overwrites, with minimal risk of system malfunctions or other detectable side effects. We demonstrate the feasibility of this attack by computing efficient patches on multiple deep learning models. We show that the desired trojan behavior can be induced with a few small patches and with limited access to training data. We describe the details of how this attack is carried out on real systems and provide sample code for patching TensorFlow model parameters in Windows and in Linux. Lastly, we present a technique for effectively manipulating entropy on perturbed inputs to bypass STRIP, a state-of-the-art run-time trojan detection technique.

Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2004.11370 [cs.CR]
	(or arXiv:2004.11370v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2004.11370

Submission history

From: Robby Costales [view email]
[v1] Wed, 22 Apr 2020 22:08:29 UTC (384 KB)
[v2] Wed, 27 May 2020 21:21:46 UTC (559 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CR

< prev | next >

new | recent | 2020-04

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chengzhi Mao
Junfeng Yang

export BibTeX citation

Computer Science > Cryptography and Security

Title:Live Trojan Attacks on Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Live Trojan Attacks on Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators