Attribute Obfuscation with Gradient Reversal

Chris Emmery, Enrique Manjavacas, Grzegorz Chrupala

Research output: Contribution to conferenceAbstractOther research output

Abstract

Recent advances in computational stylometry have demonstrated that automatically inferring quite an extensive set of personal attributes from text alone (e.g. gender, age, education, socio-economic status, mental health issues) is not only feasible, but can often rely on little supervision. This application opens up potential for both industry and academia to uncover 'hidden' demographics for a large volume of social media accounts. It can be safely assumed that the majority of users of these media are not aware the latent information they are sharing, creating a false sense of privacy that can be easily abused by third parties. Even if they we aware, they would have no countermeasures at their disposal other than self-censorship. One of the proposed computational methods for assisting users in guarding particular attributes is that of author and/or attribute obfuscation, where the goal is to rewrite a particular text in such a way that a classifier trained on detecting an author (or its attributes) is fooled. Most of the work on this topic has focused on rule-based perturbations on text input, demonstrating only minor gains. Our proposal is to use a text encoder-decoder model which learns intermediate representations which are invariant to the protected attributes, and which -- thanks to this property -- is able to rewrite user text in a way which largely preserves its meaning, but which conceals user identity and/or attributes.
Original languageEnglish
Publication statusPublished - Jan 2018
EventComputational Linguistics in the Netherlands - Nijmegen, Netherlands
Duration: 26 Jan 2018 → …

Conference

ConferenceComputational Linguistics in the Netherlands
CountryNetherlands
CityNijmegen
Period26/01/18 → …

Fingerprint

Computational methods
Classifiers
Education
Health
Economics
Industry

Cite this

Emmery, C., Manjavacas, E., & Chrupala, G. (2018). Attribute Obfuscation with Gradient Reversal. Abstract from Computational Linguistics in the Netherlands, Nijmegen, Netherlands.
Emmery, Chris ; Manjavacas, Enrique ; Chrupala, Grzegorz. / Attribute Obfuscation with Gradient Reversal. Abstract from Computational Linguistics in the Netherlands, Nijmegen, Netherlands.
@conference{709a7b1dfe98477ebbeab264b0836c7f,
title = "Attribute Obfuscation with Gradient Reversal",
abstract = "Recent advances in computational stylometry have demonstrated that automatically inferring quite an extensive set of personal attributes from text alone (e.g. gender, age, education, socio-economic status, mental health issues) is not only feasible, but can often rely on little supervision. This application opens up potential for both industry and academia to uncover 'hidden' demographics for a large volume of social media accounts. It can be safely assumed that the majority of users of these media are not aware the latent information they are sharing, creating a false sense of privacy that can be easily abused by third parties. Even if they we aware, they would have no countermeasures at their disposal other than self-censorship. One of the proposed computational methods for assisting users in guarding particular attributes is that of author and/or attribute obfuscation, where the goal is to rewrite a particular text in such a way that a classifier trained on detecting an author (or its attributes) is fooled. Most of the work on this topic has focused on rule-based perturbations on text input, demonstrating only minor gains. Our proposal is to use a text encoder-decoder model which learns intermediate representations which are invariant to the protected attributes, and which -- thanks to this property -- is able to rewrite user text in a way which largely preserves its meaning, but which conceals user identity and/or attributes.",
author = "Chris Emmery and Enrique Manjavacas and Grzegorz Chrupala",
year = "2018",
month = "1",
language = "English",
note = "Computational Linguistics in the Netherlands ; Conference date: 26-01-2018",

}

Emmery, C, Manjavacas, E & Chrupala, G 2018, 'Attribute Obfuscation with Gradient Reversal' Computational Linguistics in the Netherlands, Nijmegen, Netherlands, 26/01/18, .

Attribute Obfuscation with Gradient Reversal. / Emmery, Chris; Manjavacas, Enrique; Chrupala, Grzegorz.

2018. Abstract from Computational Linguistics in the Netherlands, Nijmegen, Netherlands.

Research output: Contribution to conferenceAbstractOther research output

TY - CONF

T1 - Attribute Obfuscation with Gradient Reversal

AU - Emmery, Chris

AU - Manjavacas, Enrique

AU - Chrupala, Grzegorz

PY - 2018/1

Y1 - 2018/1

N2 - Recent advances in computational stylometry have demonstrated that automatically inferring quite an extensive set of personal attributes from text alone (e.g. gender, age, education, socio-economic status, mental health issues) is not only feasible, but can often rely on little supervision. This application opens up potential for both industry and academia to uncover 'hidden' demographics for a large volume of social media accounts. It can be safely assumed that the majority of users of these media are not aware the latent information they are sharing, creating a false sense of privacy that can be easily abused by third parties. Even if they we aware, they would have no countermeasures at their disposal other than self-censorship. One of the proposed computational methods for assisting users in guarding particular attributes is that of author and/or attribute obfuscation, where the goal is to rewrite a particular text in such a way that a classifier trained on detecting an author (or its attributes) is fooled. Most of the work on this topic has focused on rule-based perturbations on text input, demonstrating only minor gains. Our proposal is to use a text encoder-decoder model which learns intermediate representations which are invariant to the protected attributes, and which -- thanks to this property -- is able to rewrite user text in a way which largely preserves its meaning, but which conceals user identity and/or attributes.

AB - Recent advances in computational stylometry have demonstrated that automatically inferring quite an extensive set of personal attributes from text alone (e.g. gender, age, education, socio-economic status, mental health issues) is not only feasible, but can often rely on little supervision. This application opens up potential for both industry and academia to uncover 'hidden' demographics for a large volume of social media accounts. It can be safely assumed that the majority of users of these media are not aware the latent information they are sharing, creating a false sense of privacy that can be easily abused by third parties. Even if they we aware, they would have no countermeasures at their disposal other than self-censorship. One of the proposed computational methods for assisting users in guarding particular attributes is that of author and/or attribute obfuscation, where the goal is to rewrite a particular text in such a way that a classifier trained on detecting an author (or its attributes) is fooled. Most of the work on this topic has focused on rule-based perturbations on text input, demonstrating only minor gains. Our proposal is to use a text encoder-decoder model which learns intermediate representations which are invariant to the protected attributes, and which -- thanks to this property -- is able to rewrite user text in a way which largely preserves its meaning, but which conceals user identity and/or attributes.

M3 - Abstract

ER -

Emmery C, Manjavacas E, Chrupala G. Attribute Obfuscation with Gradient Reversal. 2018. Abstract from Computational Linguistics in the Netherlands, Nijmegen, Netherlands.