Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased

Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased
Proximities in Word Embeddings

Vaibhav Kumar1∗ Tenzin Singhay Bhotia1∗ Vaibhav Kumar1∗ Tanmoy Chakraborty2

1Delhi Technological University, New Delhi, India
2IIIT-Delhi, India
1{kumar.vaibhav1o1, tenzinbhotia0, vaibhavk992}@gmail.com
2tanmoy@iiitd.ac.in

Abstracto

Word embeddings are the standard model for
semantic and syntactic representations of words.
Desafortunadamente, these models have been shown
to exhibit undesirable word associations result-
ing from gender, racial, and religious biases.
Existing post-processing methods for debias-
ing word embeddings are unable to mitigate
gender bias hidden in the spatial arrangement
of word vectors. en este documento, we propose
RAN-Debias, a novel gender debiasing meth-
odology that not only eliminates the bias pre-
sent
in a word vector but also alters the
spatial distribution of its neighboring vectors,
achieving a bias-free setting while maintaining
minimal semantic offset. We also propose a
new bias evaluation metric, Gender-based
Illicit Proximity Estimate (GIPE), cual
measures the extent of undue proximity in
word vectors resulting from the presence of
gender-based predilections. Experiments based
on a suite of evaluation metrics show that
RAN-Debias significantly outperforms the
state-of-the-art
in reducing proximity bias
(GIPE) by at least 42.02%. It also reduces
direct bias, adding minimal semantic distur-
bance, and achieves the best performance in
a downstream application task (correferencia
resolution).

1 Introducción

Word embedding methods (Devlin et al., 2019;
Mikolov et al., 2013a; Pennington et al., 2014)
have been staggeringly successful in mapping the
semantic space of words to a space of real-valued
vectores, capturing both semantic and syntactic

∗Authors have contributed equally.

486

relaciones. Sin embargo, as recent research has
mostrado, word embeddings also possess a spectrum
of biases related to gender (Bolukbasi et al., 2016;
Hoyle et al., 2019), carrera, and religion (Manzini
et al., 2019; Otterbacher et al., 2017). Bolukbasi
et al. (2016) showed that there is a disparity in
the association of professions with gender. Para
instancia, while women are associated more
closely with ‘‘receptionist’’ and ‘‘nurse’’, hombres
are associated more closely with ‘‘doctor’’ and
‘‘engineer’’. Similarmente, a word embedding model
trained on data from a popular social media plat-
form generates analogies such as ‘‘Muslim is
to terrorist as Christian is to civilian’’ (Manzini
et al., 2019). Por lo tanto, given the large scale use of
word embeddings, it becomes cardinal to remove
the manifestation of biases. En este trabajo, we focus
on mitigating gender bias from pre-trained word
embeddings.

As shown in Table 1,

the high degree of
similarity between gender-biased words largely
results from their individual proclivity towards
a particular notion (gender in this case) bastante
than from empirical utility; we refer to such
proximities as ‘‘illicit proximities’’. Existing
debiasing methods
(Bolukbasi et al., 2016;
Kaneko and Bollegala, 2019) are primarily
concerned with debiasing a word vector by mini-
mising its projection on the gender direction.
Although they successfully mitigate direct bias for
a word, they tend to ignore the relationship bet-
ween a gender-neutral word vector and its
neighbors, thus failing to remove the gender bias
encoded as illicit proximities between words
(Gonen and Goldberg, 2019; Williams et al.,
2019). For the sake of brevity, we refer to ‘‘gender-
based illicit proximities’’ as ‘‘illicit proximities’’
in the rest of the paper.

Transacciones de la Asociación de Lingüística Computacional, volumen. 8, páginas. 486–503, 2020. https://doi.org/10.1162/tacl a 00327
Editor de acciones: Xiaodong He. Lote de envío: 1/2020; Lote de revisión: 5/2020; Publicado 8/2020.
C(cid:13) 2020 Asociación de Lingüística Computacional. Distribuido bajo CC-BY 4.0 licencia.

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Word
nurse
receptionist
prostitute
schoolteacher

Neighbors

mother12, woman24, filipina31
housekeeper9, hairdresser15, prostitute69
housekeeper19, hairdresser41, babysitter44
homemaker2, housewife4, waitress8

Mesa 1: Words and their neighbors extracted using GloVe (Pennington et al.,
2014). Subscript indicates the rank of the neighbor.

To account for these problems, we propose a
post-processing based debiasing scheme for non-
contextual word embeddings, called RAN-Debias
(Repulsion, Attraction, and Neutralization based
Debiasing). RAN-Debias not only minimizes
the projection of gender-biased word vectors
on the gender direction but also reduces the
semantic similarity with neighboring word vectors
having illicit proximities. We also propose
KBC (Knowledge Based Classifier), a word
classification algorithm for selecting the set
to be debiased. KBC utilizes a
of words
set of existing lexical knowledge bases
a
maximize classification accuracy. Además,
we propose a metric, Gender-based Illicit
Proximity Estimate (GIPE), which quantifies
gender bias in the embedding space resulting from
the presence of illicit proximities between word
vectores.

the gender

We evaluate debiasing efficacy on various
evaluation metrics. Para
relational
analogy test on the SemBias dataset (Zhao et al.,
2018b), RAN-GloVe (RAN-Debias applied to
GloVe word embedding) outperforms the next best
baseline GN-GloVe (debiasing method proposed
by Zhao et al. [2018b]) por 21.4% in gender-
stereotype type. RAN-Debias also outperforms the
best baseline by at least 42.02% in terms of GIPE.
Además, the performance of RAN-GloVe on
word similarity and analogy tasks on a number
of benchmark datasets indicates the addition of
minimal semantic disturbance. En breve, our major
contributions1 can be summarized as follows:

• We provide a knowledge-based method
(KBC) for classifying words to be debiased.

• We introduce RAN-Debias, a novel approach
to reduce both direct and gender-based
proximity biases in word embeddings.

1The code and data are released at https://github.

com/TimeTraveller-San/RAN-Debias.

• We propose GIPE, a novel metric to measure
the extent of undue proximities in word
embeddings.

2 Trabajo relacionado

2.1 Gender Bias in Word Embedding Models

Caliskan et al. (2017) highlighted that human-
like semantic biases are reflected through word
embeddings (such as GloVe [Pennington et al.,
2014]) of ordinary language. They also introduced
the Word Embedding Association Test (WEAT)
for measuring bias in word embeddings. El
authors showed a strong presence of biases in
pre-trained word vectors. In addition to gender,
they also identified bias related to race. Para
instancia, European-American names are more
associated with pleasant terms as compared to
African-American names.

In the following subsections, discutimos
existing gender debiasing methods based on
their mode of operation. Methods that operate
on pre-trained word embeddings are known as
post-processing methods, and those which aim
to retrain word embeddings by either introducing
corpus-level changes or modifying the training
objective are known as learning-based methods.

2.2 Debiasing Methods (Post-processing)

Bolukbasi et al. (2016) extensively studied gender
bias in word embeddings and proposed two
debiasing strategies—‘‘hard debias’’ and ‘‘soft
debias’’. Hard debias algorithm first determines
the direction that captures the gender information
in the word embedding space using the difference
vectores (p.ej., ~he − ~she). It then transforms each
él
word vector ~w to be debiased such that
becomes perpendicular to the gender direction
(neutralization). Más, for a given set of word
pares (equalization set), it modifies each pair such
that ~w becomes equidistant to each word in the pair
(equalization). Por otro lado, the soft debias

487

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

algorithm applies a linear transformation to word
vectores, which preserves pairwise inner products
among all the word vectors while limiting the
projection of gender-neutral words on the gender
direction. The authors showed that the former
performs better for debiasing than the latter.
Sin embargo,
to determine the set of words for
debiasing, a support vector machine (SVM)
classifier is used, which is trained on a small
set of seed words. This makes the accuracy of the
approach highly dependent on the generalization
of the classifier to all remaining words in the
vocabulary.

Kaneko and Bollegala (2019) proposed a post-
processing step in which the given vocabulary
is split
into four classes—non-discriminative
female-biased words (p.ej., ‘‘bikini’’, ‘‘lipstick’’),
non-discriminative male-biased words
(p.ej.,
‘‘beard’’, ‘‘moustache’’), gender-neutral words
(p.ej., ‘‘meal’’, ‘‘memory’’), and stereotypical
palabras (p.ej., ‘‘librarian’’, ‘‘doctor’’). A set of
seed words is then used for each of the categories
to train an embedding using an encoder in a
denoising autoencoder, such that gender-related
biases from stereotypical words are removed,
while preserving feminine information for non-
discriminative female-biased words, masculine
information for non-discriminative male-biased
palabras, and neutrality of the gender-neutral words.
The use of the correct set of seed words is
critical for the approach. Además, inappropriate
associations between words (such as ‘‘nurse’’ and
‘‘receptionist’’) may persist.

Gonen and Goldberg (2019) showed that current
approaches (Bolukbasi et al., 2016; Zhao et al.,
2018b), which depend on gender direction for
the definition of gender bias and directly target
it for the mitigation process, end up hiding the
bias rather than reduce it. The relative spatial
distribution of word vectors before and after
debiasing is similar, and bias-related information
can still be recovered.

Ethayarajh et al. (2019) provided theoretical
proof for hard debias (Bolukbasi et al., 2016)
and discussed the theoretical flaws in WEAT
by showing that it systematically overestimates
gender bias in word embeddings. Los autores
presented an alternate gender bias measure, called
RIPA (Relational Inner Product Association), eso
quantifies gender bias using gender direction.
Más, they illustrated that vocabulary selection

for gender debiasing is as crucial as the debiasing
procedimiento.

Zhou et al. (2019) investigated the presence
of gender bias in bilingual word embeddings and
languages which have grammatical gender (semejante
as Spanish and French). Más, they defined
semantic gender direction and grammatical gender
direction used for quantifying and mitigating
gender bias. en este documento, we only focus on
languages that have non-gendered grammar (p.ej.,
Inglés). Our method can be applied to any such
idioma.

2.3 Debiasing Methods (Learning-based)

Zhao et al. (2018b) developed a word vector
training approach, called Gender-Neutral Global
Vectors (GN-GloVe) based on the modification
of GloVe. They proposed a modified objective
function that aims to confine gender-related
information to a sub-vector. During the optimi-
proceso de zación, the objective function of GloVe is
minimized while simultaneously, the square of
Euclidean distance between the gender-related
sub-vectors is maximized. Más, it is empha-
sized that the representation of gender-neutral
words is perpendicular to the gender direction.
Being a retraining approach, this method cannot
be used on pre-trained word embeddings.

Lu et al. (2018) proposed a counterfactual data-
augmentation (CDA) approach to show that
gender bias in language modeling and coreference
resolution can be mitigated through balancing the
corpus by exchanging gender pairs like ‘‘she’’
and ‘‘he’’ or ‘‘mother’’ and ‘‘father’’. Similarmente,
Hall Maudslay et al. (2019) proposed a learning-
a
based approach with two enhancements
CDA—a counterfactual data substitution method
which makes substitutions with a probability of
0.5 and a method for processing first names based
upon bipartite graph matching.

Bordia and Bowman (2019) proposed a gender-
bias reduction method for word-level language
modelos. They introduced a regularization term
that penalizes the projection of word embeddings
on the gender direction. Más, they proposed
metrics to measure bias at embedding and corpus
nivel. Their study revealed considerable gender
bias in Penn Treebank (Marcus et al., 1993) y
WikiText-2 (Merity et al., 2018).

2.4 Word Embeddings Specialization

Mrkˇsi´c et al. (2017) defined semantic special-
ization as the process of refining word vectors

488

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

to improve the semantic content. Similar to the
debiasing procedures, semantic specialization
procedures can also be divided into post-
Procesando (Ono et al., 2015; Faruqui and Dyer,
2014) and learning-based (Rothe and Sch¨utze,
2015; Mrkˇsi´c et al., 2016; Nguyen et al., 2016)
approaches. The performance of post-processing
based approaches is shown to be better than
learning-based approaches (Mrkˇsi´c et al., 2017).

Similar to the ‘‘repulsion’’ and ‘‘attraction’’
terminologies used in RAN-Debias, Mrkˇsi´c
et al. (2017) defined ATTRACT-REPEL algorithm, a
post-processing semantic specialization process
which uses antonymy and synonymy constraints
drawn from lexical resources. aunque
es
superficially similar to RAN-Debias, there are a
number of differences between the two ap-
se acerca. En primer lugar, the ATTRACT-REPEL algorithm
operates over mini-batches of synonym and
antonym pairs, while RAN-Debias operates on
a set containing gender-neutral and gender-biased
palabras. En segundo lugar, the ‘‘attract’’ and ‘‘repel’’ terms
carry different meanings with respect
to the
algoritmos. In ATTRACT-REPEL, for each of the pairs
in the mini-batches of synonyms and antonyms,
negative examples are chosen. The algorithm then
forces synonymous pairs to be closer to each other
(attract) than from their negative examples and
antonymous pairs further away from each other
(repel) than from their negative examples. On
the other hand, for a given word vector, RAN-
Debias forces it away from its neighboring word
vectores (repel) which have a high indirect bias
while simultaneously forcing the post-processed
word vector and the original word vector together
(attract) to preserve its semantic properties.

3 Proposed Approach

i}|V |

i=1 → { ~w′

Given a set of pre-trained word vectors { ~wi}|V |
yo=1
over a vocabulary set V, we aim to create a
transformación { ~wi}|V |
i=1 such that
the stereotypical gender information present in
the resulting embedding set are minimized with
minimal semantic offset. We first define the
categories into which each word w ∈ V is
classified in a mutually exclusive manner. Mesa 2
summarizes important notations used throughout
the paper.

• Preserve set (Vp): This set consists of
words for which gender carries semantic

489

Denotation

Notation
~w
~wd
V
Vp

Vector corresponding to a word w
Debiased version of ~w
Vocabulary set
The set of words which are preserved
during the debiasing procedure
The set of words which are subjected
to the debiasing procedure
Set of dictionaries
A particular dictionary from the set D
Gender direction
Db( ~w) Direct bias of a word w.

D
di
~g

Vd

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

b( ~w1, ~w2) Indirect bias between a pair of words

w1 and w2.

η( ~w) Gender-based proximity bias

of a word w
Set of neighboring words of a word w

Nw

Fr( ~wd) Repulsion objective function
Fa( ~wd) Attraction objective function
Fn( ~wd) Neutralization objective function
F ( ~wd) Multi-objective optimization function
KBC Knowledge Based Classifier
BBN Bias Based Network
GIP E Gender-based Illicit Proximity

Estimate

Mesa 2: Important notations and denotations.

importance; such as names, gendered pro-
nouns and words like ‘‘beard’’ and ‘‘bikini’’
that have a meaning closely associated with
género. Además, words that are non-
alphabetic are also included as debiasing
them will be of no practical utility.

• Debias set (Vd): This set consists of all the
words in the vocabulary that are not pre-
sent in Vp. These words are expected to be
gender-neutral in nature and hence subjected
to debiasing procedure. Note that Vd not
only consists of gender-stereotypical words
(‘‘nurse’’, ‘‘warrior’’, ‘‘receptionist’’, etc.),
(‘‘sky’’,
but also gender-neutral words
‘‘table’’, ‘‘keyboard’’, etc.).

3.1 Word Classification Methodology

Prior to the explanation of our method, we present
the limitations of previous approaches for word
clasificación. Bolukbasi et al. (2016) trained a
linear SVM using a set of gender-specific seed
palabras, which is then generalized on the whole
embedding set to identify other gender-specific

Método
SVM
RIPA
KBC

Prec
97.20
60.60
89.65

Rec
59.37
53.40
81.25

F1
73.72
56.79
85.24

AUC-ROC
83.98
59.51
86.25

Acc
78.83
59.35
85.93

Mesa 3: Comparison between our proposed method (KBC), RIPA- (Ethayarajh et al.,
2019), and SVM- (Bolukbasi et al., 2016) based word classification methods via
precisión (Prec), recordar (Rec), F1-score (F1), AUC-ROC, and accuracy (Acc).

palabras. Sin embargo, such methods rely on training a
supervised classifier on word vectors, cuales son
themselves gender-biased. Because such classi-
fiers are trained on biased data, they catch onto
the underlying gender-bias cues and often mis-
classify words. Por ejemplo, the SVM classifier
trained by Bolukbasi et al. (2016) misclassifies
the word ‘‘blondes’’ as gender-specific, entre
otros. Más, we empirically show the inabil-
ity of a supervised classifier (SVM) to generalize
over the whole embedding using various metrics
en mesa 3.

Taking into consideration this limitation, nosotros
propose the Knowledge Based Classifier (KBC)
that relies on knowledge bases instead of word
embeddings, thereby circumventing the addition
of bias in the classification procedure. Además,
unlike RIPA (Ethayarajh et al., 2019), our ap-
proach does not rely on creating a biased direction
that may be difficult to determine. Esencialmente,
KBC relies on the following assumption.

Assumption 1 If there exists a dictionary d such
that it stores a definition d[w] corresponding to a
word w, then w can be defined as gender-specific
or not based on the existence or absence of a
gender-specific reference s ∈ seed in the defini-
tion d[w], where the set seed consists of gender-
specific references such as {‘‘man’’, ‘‘woman’’,
‘‘boy’’, ‘‘girl’’}.

Algoritmo 1 formally explains KBC. We denote
each if condition as a stage and explain it below:

• Stage 1: This stage classifies all stop words
and non-alphabetic words as Vp. Debiasing
such words serve no practical utility; hence
we preserve them.

• Stage 2: This stage classifies all words
that belong to either names set or seed set
as Vp. Set names is collected from open

Algoritmo 1: Knowledge Based Classifier
(KBC)
Input

: V : vocabulary set, isnonaphabetic(w):

checks for non-alphabetic words
seed: set of gender-specific words
stw: set of stop words
names: set of gender-specific names
D: set of dictionaries, where for a
dictionary di ∈ D, di[w] represents the
definition of a word w.

Output: Vp: set of words that will be preserved,

Vd: set of words that will be debiased

1 Vp = {}, Vd = {}
2 for w ∈ V do
3

if w ∈ stw or isnonalphabetic(w) entonces

4

5

6

7

8

Vp ← Vp ∪ {w}

else if w ∈ names ∪ seed then

Vp ← Vp ∪ {w}
else if |{di : di ∈ D &

w ∈ di & ∃s : s ∈ seed ∩ di[w]}| > |D|/2
entonces

Vp ← Vp ∪ {w}

9 Vd ← Vd ∪ {w : w ∈ V \ Vp}
10 return Vp, Vd

source knowledge base.2 Set seed consists of
gender-specific reference terms. We preserve
names, as they hold important gender infor-
formación (Pilcher, 2017).

• Stage 3: This stage uses a collection of
dictionaries to determine whether a word
is gender-specific using Assumption 1. A
counter the effect of biased definitions arising
from any particular dictionary, we make a
decision based upon the consensus of all
dictionaries. A word is classified as gender-
specific and added to Vp if and only if
more than half of the dictionaries classify
it as gender-specific. En nuestros experimentos,
we employ WordNet (Molinero, 1995) y

2https://github.com/ganoninc/fb-

gender-json.

490

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

the Oxford dictionary. As pointed out by
Bolukbasi et al. (2016), WordNet consists of
few definitions that are gender-biased such
as the definition of ‘‘vest’’; por lo tanto, por
utilizing our approach, we counter such cases
as the final decision is based upon consensus.

due to gender-based constructs. For a given
word wi ∈ Vd, the gender-based proximity
bias ηwi is defined as:

ηwi =

|N b
Wisconsin|
|Nwi|

(1)

The remaining words that are not preserved by
KBC are categorized into Vd. It is the set of words
that are debiased by RAN-Debias later.

3.2 Types of Gender Bias

Primero, we briefly explain two types of gender bias
as defined by Bolukbasi et al. (2016) y luego
introduce a new type of gender bias resulting from
illicit proximities in word embedding space.

• Direct Bias (Db): For a word w, the direct

bias is defined by

Db( ~w, ~g) = |porque( ~w, ~g)|C

dónde, ~g is the gender direction measured by
taking the first principal component from the
ten gender
principal component analysis of
pair difference vectors, como ( ~he − ~she) como
mentioned in (Bolukbasi et al., 2016), and c
represents the strictness of measuring bias.

• Indirect Bias (b): The indirect bias between
a given pair of words w and v is defined by

b( ~w, ~v) =

( ~w.~v − cos( ~w⊥, ~v⊥))
~w.~v

Aquí, ~w and ~v are normalized. ~w⊥ is orthogonal
to the gender direction ~g: ~w⊥ = ~w − ~wg, and ~wg
is the contribution from gender: ~wg = ( ~w.~g)~g.
Indirect bias measures the change in the inner
product of two word vectors as a proportion of
the earlier inner product after projecting out the
gender direction from both the vectors. A higher
indirect bias between two words indicates a strong
association due to gender.

• Gender-based Proximity Bias (η): Gonen
and Goldberg (2019) observed that the exist-
ing debiasing methods are unable to com-
pletely debias word embeddings because the
relative spatial distribution of word embed-
dings after the debiasing process still encap-
sulates bias-related information. Por lo tanto,
we propose gender-based proximity bias that
aims to capture the illicit proximities arising
between a word and its closest k neighbors

dónde

Nwi = argmax
V ′:|V ′|=k

(porque( ~wi, ~wk) : wk ∈ V ′ ⊆ V ),

N b

wi = {Wisconsin : b( ~wi, ~wk) > θs, wk ∈ Nwi}, and θs

is a threshold for indirect bias.

The intuition behind this is as follows. El
set Nwi consists of the top k neighbors of wi
calculated by finding the word vectors having
the maximum cosine similarity with wi. Más,
N b
wi ⊆ Nwi is the set of neighbors having indirect
bias β greater than a threshold θs, which is a
hyperparameter that controls neighbor deselec-
tion on the basis of indirect bias. The lower is
the value of θs, the higher is the cardinality of set
N b
Wisconsin| compared to |Nwi|
indicates that the neighborhood of the word is
gender-biased.

Wisconsin. A high value of |N b

3.3 Proposed Method–RAN-Debias

We propose a multi-objective optimization based
solution to mitigate both direct3 and gender-based
proximity bias while adding minimal impact to
the semantic and analogical properties of the word
incrustar. For each word w ∈ Vd and its vector
~w ∈ Rh, where h is the embedding dimension,
we find its debiased counterpart ~wd ∈ Rh by
solving the following multi-objective optimiza-
tion problem:

argmin
~wd

(cid:0)

Fr( ~wd), Fa( ~wd), Fn( ~wd)

(2)

(cid:1)

We solve this by formulating a single objective
F ( ~wd) and scalarizing the set of objectives using
the weighted sum method as follows:

F ( ~wd) = λ1.Fr( ~wd) + λ2.Fa( ~wd) + λ3.Fn( ~wd)

such that λi ∈ [0, 1] y

λi = 1

Xi

(3)
F ( ~wd) is minimized using the Adam (Kingma and
Ba, 2015) optimized gradient descent to obtain the
optimal debiased embedding ~wd.

3Though not done explicitly, reducing direct bias also

reduces indirect bias as stated by Bolukbasi et al. (2016).

491

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

As shown in the subsequent sections, the range
of objective functions Fr, Fa, Fn (defined later) es
[0, 1]; thus we use the weights λi for determining
the relative importance of one objective function
over another.

3.3.1 Repulsion
For any word w ∈ Vd, we aim to minimize the
gender bias based illicit associations. Por lo tanto,
our objective function aims to ‘‘repel’’ ~wd from
the neighboring word vectors which have a high
value of indirect bias (b) with it. Como consecuencia,
we name it ‘‘repulsion’’ (Fr) and primarily define
the repulsion set Sr to be used in Fr as follows.

Definición 1 For a given word w, the repulsion
set Sr is defined as Sr = {ni : ni ∈ Nw and
b( ~w, ~ni) > θr}, where Nw is the set of top
100 neighbors obtained from the original word
vector ~w.

Because we aim to reduce the unwanted
semantic similarity between ~wd and the set of
vectors Sr, we define the objective function Fr as
follows.

Fr( ~wd) = 

XniǫSr

Fr( ~wd) ∈ [0, 1]

porque( ~wd, ~ni)
(cid:12)
(cid:12)
(cid:12)
(cid:12)


(cid:12)
(cid:12)
(cid:12)

(cid:12)

|Sr| ,

(cid:30)

For our experiments, we find that θr = 0.05
is the appropriate threshold to repel majority of
gender-biased neighbors.

3.3.2 Attraction
For any word w ∈ Vd, we aim to minimize the
loss of semantic and analogical properties for its
debiased counterpart ~wd. Por lo tanto, our objective
function aims to attract ~wd towards ~w in the
word embedding space. Como consecuencia, we name
it ‘‘attraction’’ (Fa) and define it as follows:

Fa( ~wd) = | porque( ~wd, ~w) − cos( ~w, ~w)|/2

= | porque( ~wd, ~w) − 1|/2, Fa( ~wd) ∈ [0, 1]

3.3.3 Neutralization
For any word w ∈ Vd, we aim to minimize its
bias towards any particular gender. Por lo tanto, el
objective function Fn represents the absolute value
of dot product of word vector ~wd with the gender
direction ~g (as defined by Bolukbasi et al., 2016).

492

Como consecuencia, we name it ‘‘neutralization’’ (Fn)
and define it as follows:

Fn( ~wd) = |porque( ~wd, ~g)|, Fn ∈ [0, 1]

3.3.4 Time Complexity of RAN-Debias
Computationally, there are two major components
of RAN-Debias:

1. Calculate neighbors for each word w ∈ Vd
and store them in a hash table. This has a
time complexity of O(n2) where n = |Vd|.

2. Debias each word using gradient descent,

whose time complexity is O(norte).

The overall complexity of RAN-Debias is O(n2),
eso es, quadratic with respect to the cardinality of
debias set Vd.

3.4 Gender-based Illicit Proximity Estimate

(GIPE)

En la sección 3.2, we defined the gender proximity
inclinación (η). En esta sección, we extend it
to the
embedding level for generating a global estimate.
Intuitivamente, an estimate can be generated by simply
taking the mean of ηw, ∀w ∈ Vd. Sin embargo, este
computation assigns equal importance to all ηw
valores, which is an oversimplification. A word
w may itself be in the proximity of another
′ ∈ Vd through gender-biased associations,
word w
thereby increasing ηw′ . Such cases in which w
increases ηw′
for other words should also be
tenido en cuenta. Por lo tanto, we use a weighted
average of ηw, ∀w ∈ V for determining a global
estimate. We first define a weighted directed
network, called Bias Based Network (BBN). El
use of a graph data structure makes it easier to
understand the intuition behind GIPE.

Definición 2 Given a set of non gender-specific
words W , bias based network is a directed graph
G = (V, mi), where nodes represent word vectors
and weights of directed edges represent the indi-
rect bias (b) between them. The vertex set V and
edge set E are obtained according to Algorithm 2.
For each word wi in W , we find N , the set
of top n word vectors having the highest cosine
similarity with ~wi (we keep n to be 100 to reduce
computational overhead without compromising on
quality). Para cada par ( ~wi, ~wk), where wk ∈ N , a
directed edge is assigned from wi to wk with the
edge weight being β( ~wi, ~wk). In case the given

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Cifra 1: (a): A sub-graph of BBN formed by Algorithm 2 for GloVe (Pennington et al., 2014) trained on
2017-January dump of Wikipedia; we discuss the structure of the graph with respect to the word ‘‘nurse’’. Nosotros
illustrate four possible scenarios with respect to their effect on GIPE, with θs = 0.05: (b) An edge with β < θs may not contribute to γi or ηwi ; (c) An outgoing edge may contribute to ηwi only; (d) An incoming edge may contribute to γi only; (e) Incoming and outgoing edges may contribute to γi and ηwi respectively. Every node pair association can be categorized as one of the aforementioned four cases. Algorithm 2: Compute BBN for the given set of word vectors Input : ξ: word embedding set, W : set of non gender-specific words, n: number of neighbors Output: G: bias based network (cos( ~xi, ~xk) : xk ∈ ξ′ ⊆ ξ) 1 V = [ ], E = [ ] 2 for xi ∈ W do 3 N = argmax ξ′:|ξ′|=n V.insert(xi) for xk ∈ N do 4 5 6 E.insert (xi, xk, β ( ~xi, ~xk)) V.insert (xk) 7 G = (V, E) 8 return G embedding is a debiased version, we use the non- debiased version of the embedding for computing β( ~wi, ~wk). Figure 1 portrays a sub-graph in BBN. By representing the set of non gender-specific words as a weighted directed graph we can use the number of outgoing and incoming edges for a node (word wi) for determining ηwi and its weight respectively, thereby leading to the formalization of GIPE as follows. 493 Definition 3 For a BBN G, the Gender-based Illicit Proximity Estimate of G, indicated by GIP E(G) is defined as: GIP E(G) = |V | i=1 γiηwi |V | i=1 γi P P where, for a word wi, ηwi is the gender-based proximity bias as defined earlier, ǫ is a (small) positive constant, and γi is the weight, defined as: γi = 1 + |{vi:(vi,wi) ∈ E,β( ~vi, ~wi)> θs}|

ǫ+|{vi:(vi,Wisconsin) ∈ E}|

(4)

The intuition behind the metric is as follows.
For a bias based network G, GIP E(GRAMO) es el
weighted average of gender-based proximity bias
(ηwi) for all nodes wi ∈ W , where the weight
of a node is γi, which signifies the importance
of the node in contributing towards the gender-
based proximity bias of other word vectors. γi
takes into account the number of incoming edges
having β higher than a threshold θs. Por lo tanto,
we take into account how the neighborhood of
a node contributes towards illicit proximities
(having high β values for outgoing edges) como
well as how a node itself contributes towards
illicit proximities of other nodes (having high β
values for incoming edges). For illustration, nosotros

analyze a sub-graph in Figure 1. By incorporating
γi, we take into account both dual (Figure 1e)
and incoming (Figure 1d) bordes, which would
not have been the case otherwise. In GloVe
(2017-January dump of Wikipedia),
la palabra
‘‘sweetheart’’ has ‘‘nurse’’ in the set of its top
100 neighbors and β > θs; sin embargo, ‘‘nurse’’
does not have ‘‘sweetheart’’ in the set of its top
100 neighbors. Por eso, while ‘‘nurse’’ contributes
towards gender-based proximity bias of the word
‘‘sweetheart’’, vice versa is not true. Similarmente,
if dual-edge exists, then both γi and ηwi are
tenido en cuenta. Por lo tanto, GIPE considers
all possible cases of edges in BBN, making it a
holistic metric.

4 Experiment Results

We conduct the following performance evaluation
pruebas:

• We

compare KBC with

SVM-based
(Bolukbasi et al., 2016) and RIPA-based
(Ethayarajh et al., 2019) methods for word
clasificación.

• We evaluate the capacity of RAN-Debias
on GloVe (aka RAN-GloVe) for the gender
relational analogy dataset–SemBias (zhao
et al., 2018b).

• We demonstrate the ability of RAN-GloVe to
mitigate gender proximity bias by computing
and contrasting the GIPE value.

• We evaluate RAN-GloVe on several bench-
mark datasets for similarity and analogy
tareas, showing that RAN-GloVe introduces
minimal semantic offset to ensure quality of
the word embeddings.

• We demonstrate that RAN-GloVe success-
fully mitigates gender bias in a downstream
applicationcoreference resolution.

y

analizar

Although we

el
informe
performance of RAN-GloVe in our experiments,
we also applied RAN-Debias to other popular non-
contextual and monolingual word embedding,
Word2vec (Mikolov et al., 2013a) to create RAN-
Word2vec. As expected, we observed similar
resultados (hence not reported for the sake of brevity),
emphasizing the generality of RAN-Debias. Nota
that the percentages mentioned in the rest of the
section are relative unless stated otherwise.

4.1 Training Data and Weights

We use GloVe (Pennington et al., 2014) entrenado
on the 2017-January dump of Wikipedia, consist-
ing of 322,636 unique word vectors of 300 dimen-
siones. We apply KBC on the vocabulary set V
obtaining Vp and Vd of size 47,912 y 274,724
respectivamente. Más,
judging upon the basis
of performance evaluation tests as discussed
arriba, we experimentally select the weights in
Ecuación 3 as λ1 = 1/8, λ2 = 6/8, and λ3 = 1/8.

4.2 Baselines for Comparisons

We compare RAN-GloVe against the following
word embedding models, each of which is trained
on the 2017-January dump of Wikipedia.

• GloVe: A pre-trained word embedding model
as mentioned earlier. This baseline represents
the non-debiased version of word embed-
dings.

• Hard-GloVe: Hard-Debias GloVe; we use
the debiasing method4 proposed by Bolukbasi
et al., 2016 on GloVe.

• GN-GloVe: Gender-neutral GloVe; we use
the original5 debiased version of GloVe re-
leased by Zhao et al. (2018b).

• GP-GloVe: Gender-preserving GloVe; nosotros
use the original6 debiased version of GloVe
released by Kaneko and Bollegala (2019).

4.3 Word Classification

We compare KBC with RIPA-based (unsuper-
vised) (Ethayarajh et al., 2019) and SVM-based
(supervised) (Bolukbasi et al., 2016) approaches
for word classification. We create a balanced
labeled test set consisting of a total of 704
palabras, con 352 words for each category—gender-
specific and non gender-specific. For the non
gender-specific category, we select all the 87 neu-
tral and biased words from the SemBias dataset
(Zhao et al., 2018b). Más, we select all 320, 40
y 60 gender-biased occupation words released
by Bolukbasi et al. (2016); Zhao et al. (2018a) y
Rudinger et al. (2018), respectivamente. After com-
bining and removing duplicate words, we obtain

4https://github.com/tolga-b/debiaswe.
5https://github.com/uclanlp/gn_GloVe.
6https://github.com/kanekomasahiro/gp_

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

debias.

494

Dataset

SemBias

Embedding Definition ↑
GloVe
Hard-GloVe
GN-GloVe
GP-GloVe
RAN-GloVe

80.2
84.1
97.7
84.3
92.8

Stereotype ↓ None ↓

10.9
6.4
1.4
8.0
1.1

8.9
9.5
0.9
7.7
6.1

Mesa 4: Comparison for the gender relational analogy test on the
SemBias dataset. () indicates that higher (más bajo) value is better.

352 non gender-specific words. For the gender-
specific category, we use a list of 222 masculino y
222 female words provided by Zhao et al. (2018b).
We use stratified sampling to under-sample 444
words into 352 words for balancing the classes.
The purpose of creating this diversely sourced
dataset is to provide a robust ground-truth for eval-
uating the efficacy of different word classification
algoritmos.

Mesa 3 shows precision,

recordar, F1-score,
AUC-ROC, and accuracy by considering gender-
specific words as the positive class and non
gender-specific words as the negative class. De este modo,
for KBC, we consider the output set Vp as the
positive and Vd as the negative class.

The SVM-based approach achieves high pre-
cision but at the cost of a low recall. A pesar de
the majority of the words classified as gender-
specific are correct, it achieves this due to the
limited coverage of the rest of gender-specific
resulting in them being classified as
palabras,
non gender-specific, thereby reducing the recall
drastically.

The RIPA approach performs fairly with respect
to precision and recall. Unlike SVM, RIPA is
not biased towards a particular class and results
in rather fair performance for both the classes.
Almost similar to SVM, KBC also correctly
classifies most of the gender-specific words but
in an exhaustive manner, thereby leading to much
fewer misclassification of gender-specific words
as non gender-specific. Como resultado, KBC achieves
sufficiently high recall.

En general, KBC outperforms the best baseline by
an improvement of 2.7% in AUC-ROC, 15.6% en
F1-score, y 9.0% in accuracy. Además,
because KBC entirely depends on knowledge
bases, the absence of a particular word in them
may result in misclassification. This could be the
reason behind the lower precision of KBC as
compared to SVM-based classification and can be

improved upon by incorporating more extensive
knowledge bases.

4.4 Gender Relational Analogy

(Definición;

To evaluate the extent of gender bias in RAN-
GloVe, we perform gender relational analogy
test on the SemBias
(Zhao et al., 2018b)
conjunto de datos. Each instance of SemBias contains four
types of word pairs: a gender-definition word
‘‘headmaster-headmistress’’),
pair
(Stereotype;
a gender-stereotype word pair
‘‘manager-secretary’’) and two other word pairs
which have similar meanings but no gender-based
relation (None; ‘‘treblebass’’). There are a
total of 440 instances in the semBias dataset,
created by the cartesian product of 20 género-
stereotype word pairs and 22 gender-definition
word pairs. From each instance, we select a
word pair (a, b) from the four word pairs such
that using the word embeddings under evaluation,
cosine similarity of the word vectors ( ~he − ~she)
y (~a − ~b) would be maximum. Mesa 4 muestra
an embedding-wise comparison on the SemBias
conjunto de datos. The accuracy is measured in terms of the
percentage of times each type of word pair is
selected as the top for various instances. RAN-
GloVe outperforms all other post-processing
debiasing methods by achieving at least 9.96% y
82.8% better accuracy in gender-definition and
gender-stereotype, respectivamente. We attribute this
performance to be an effect of superior vocabulary
selection by KBC and the neutralization objective
of RAN-Debias. KBC classifies the words to be
debiased or preserved with high accuracy, mientras
the neutralization objective function of RAN-
Debias directly minimizes the preference of a
biased word between ‘‘he’’ and ‘‘she’’; reduciendo
the gender cues that give rise to unwanted gender-
biased analogies (Mesa 10). Por lo tanto, a pesar de
RAN-GloVe achieves lower accuracy for gender-
definition type as compared to (learning-based)

495

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Input Embedding

Vd

Hd

GloVe
Hard-GloVe
GN-GloVe
GP-GloVe
RAN-GloVe
GloVe
Hard-GloVe
GN-GloVe
GP-GloVe
RAN-GloVe

θs = 0.03
0.115
0.069
0.142
0.145
0.040
0.129
0.075
0.155
0.157
0.056

GIPE
θs = 0.05
0.038
0.015
0.052
0.048
0.006
0.051
0.020
0.065
0.061
0.018

θs = 0.07
0.015
0.004
0.022
0.018
0.002
0.024
0.007
0.031
0.027
0.011

Mesa 5: GIPE (range: 0–1) for different values of θs (lower value is better).

GN-GloVe, it outperforms the next best baseline
in Stereotype by at least 21.4%.

4.5 Gender-based Illicit Proximity Estimate

GIPE analyzes the extent of undue gender
bias based proximity between word vectors. Un
embedding-wise comparison for various values of
θs is presented in Table 5. For a fair comparison,
we compute GIPE for a BBN created upon our
debias set Vd as well as for Hd, the set of words
debiased by Bolukbasi et al. (2016).

Aquí, θs represents the threshold as defined
earlier in Equation 4. As it may be inferred from
Ecuaciones 1 y 4, upon increasing the value of
θs, for a word wi, the value of both ηwi and γi
decreases, as a lesser number of words qualifies the
threshold for selection in each case. Por lo tanto, como
evident from Table 5, the value of GIPE decreases
with the increase of θs.

For the input set Vd, RAN-GloVe outperforms
the next best baseline (Hard-GloVe) by at least
42.02%. We attribute this to the inclusion of the
repulsion objective function Fr in Equation 2,
the unwanted gender-biased
which reduces
associations between words and their neighbors.
For the input set Hd, RAN-GloVe performs better
than other baselines for all values of θs except for
θs = 0.07 where it closely follows Hard-GloVe.
Además, Hd consists of many misclassified
gender-specific words, as observed from the low
recall performance at the word classification test
en la sección 4.3. Por lo tanto, the values of GIPE
corresponding to every value of θs for the input
Hd is higher as compared to the values for Vd.

Although there is a significant reduction in
GIPE value for RAN-GloVe as compared to
other word embedding models, word pairs with

noticeable β values still exist (as indicated by non-
zero GIPE values), which is due to the tradeoff
between semantic offset and bias reduction. Como un
resultado, GIPE for RAN-GloVe is not perfectly zero
but close to it.

4.6 Analogy Test

The task of analogy test is to answer the following
pregunta: ‘‘p is to q as r is to ?''. Mathematically,
it aims at finding a word vector ~ws which has the
maximum cosine similarity with ( ~wq − ~wp + ~wr).
Sin embargo, Schluter (2018) highlights some critical
issues with word analogy tests. Por ejemplo,
there is a mismatch between the distributional
hypothesis used for generating word vectors
and the word analogy hypothesis. Sin embargo,
following the practice of using word analogy
test to ascertain the semantic prowess of word
vectores, we evaluate RAN-GloVe to provide a fair
comparison with other baselines.

We use Google (Mikolov et al., 2013a) (seman-
tic [Sem] and syntactic [Syn] analogies, containing
a total 19,556 preguntas) and MSR (Mikolov
et al., 2013b) (containing a total 7,999 syntactic
preguntas) datasets for evaluating the performance
of word embeddings. We use 3COSMUL (Levy and
Goldberg, 2014) for finding ~ws.

Mesa 6(a) shows that RAN-GloVe outperforms
other baselines on the Google (Sem and Syn) datos-
set while closely following on the MSR dataset.
The improvement in performance can be attributed
to the removal of unwanted neighbors of a word
vector (having gender bias based proximity), mientras
enriching the neighborhood with those having
empirical utility, leading to a better performance
in analogy tests.

496

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Embedding

GloVe
Hard-GloVe
GN-GloVe
GP-GloVe
RAN-GloVe

(a) Analogy

(b) Semántico

Google-Sem Google-Syn MSR RG MTurk RW MEN SimLex999 AP
60.70
59.95
61.19
57.71
61.69

51.49 75.29 64.27 31.63 72.19
51.59 76.50 64.26 31.45 72.19
49.29 74.11 66.36 36.20 74.49
48.88 75.30 63.46 27.64 69.78
50.98 76.22 64.09 31.33 72.09

34.86
35.17
37.12
34.02
34.36

79.02
80.26
76.13
79.15
80.29

52.26
62.76
51.00
51.55
62.89

Mesa 6: Comparison of various embedding methods for (a) analogy tests (performance is measured
in accuracy) y (b) word semantic similarity tests (performance is measured in terms of Spearman
rank correlation).

4.7 Word Semantic Similarity Test

A word semantic similarity task is a measure of
how closely a word embedding model captures
the similarity between two words as compared to
human-annotated ratings. For a word pair, nosotros
compute the cosine similarity between the word
embeddings and its Spearman correlation with the
human ratings. The word pairs are selected from
the following benchmark datasets: RG (Ruben-
stein and Goodenough, 1965), MTurk (Radinsky
et al., 2011), RW (Luong et al., 2013), MEN (Bruni
et al., 2014), SimLex999 (Hill et al., 2015), y
AP (Almuhareb and Poesio, 2005). The results for
these tests are obtained from the word embedding
benchmark package (Jastrzebski et al., 2017).7
Note that it is not our primary aim to achieve
a state-of-the-art result in this test. It is only
considered to evaluate semantic loss. Mesa 6(b)
shows that RAN-GloVe performs better or fol-
lows closely to the best baseline. This shows
that RAN-Debias introduces minimal semantic
disturbance.

4.8 Coreference Resolution

Finalmente, we evaluate the performance of RAN-GloVe
on a downstream application task—coreference
resolution. The aim of coreference resolution is to
identify all expressions which refer to the same
entity in a given text. We evaluate the embedding
models on the OntoNotes 5.0 (Weischedel et al.,
2012) and the WinoBias (Zhao et al., 2018a)
benchmark datasets. WinoBias comprises sen-
tences constrained by two prototypical templates
(Tipo 1 and Type 2), where each template is
further divided into two subsets (PRO and ANTI).
Such a construction facilitates in revealing the

7https://github.com/kudkudak/word-

embeddings-benchmarks.

extent of gender bias present
in coreference
resolution models. Although both templates are
designed to assess the efficacy of coreference
exceedingly
resolution models, Tipo 1 es
challenging as compared to Type 2 as it has no
syntactic cues for disambiguation. Each template
consists of
two subsets for evaluation—pro-
stereotype (PRO) and anti-stereotype (ANTI).
PRO consists of sentences in which the gendered
pronouns refer to occupations biased towards the
same gender. Por ejemplo, consider the sentence
‘‘The doctor called the nurse because he wanted a
vaccine.’’ Stereotypically, ‘‘doctor’’ is considered
to be a male-dominated profession, and the gender
of pronoun referencing it (‘‘he’’) is also male.
Por lo tanto, sentences in PRO are consistent with
societal stereotypes. ANTI consists of the same
sentences as PRO, but the gender of the pronoun
is changed. Considering the same example but by
replacing ‘‘he’’ with ‘‘she’’, we get: ‘‘The doctor
called the nurse because she wanted a vaccine.’’
En este caso,
the gender of pronoun (‘‘she’’)
which refers to ‘‘doctor’’ is female. Por lo tanto,
sentences in ANTI are not consistent with societal
stereotypes. Due to such construction, gender bias
in the word embeddings used for training the
coreference model would naturally perform better
in PRO than ANTI and lead to a higher absolute
diferencia (Diff ) between them. While a lesser
gender bias in the model would attain a smaller
Diff, the ideal case produces an absolute difference
of zero.

Following the coreference resolution testing
methodology used by Zhao et al. (2018b), nosotros
train the coreference resolution model proposed
by Lee et al. (2017) on the OntoNotes train
dataset for different embeddings. Mesa 7 muestra
F1-score on OntoNotes 5.0 test set, WinoBias
PRO and ANTI test set for Type 1 template, a lo largo de

497

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Embedding
GloVe
Hard-GloVe
GN-GloVe
GP-GloVe
RAN-GloVe

OntoNotes
66.5
66.2
66.2
66.2
66.2

PRO
76.2
70.6
72.4
70.9
61.4

ANTI
46.0
54.9
51.9
52.1
61.8

Diff
30.2
15.7
20.5
18.8
0.4

Mesa 7: F1-Score (en %) in the task of coreference
resolution. Diff denotes the absolute difference between
F1-score on PRO and ANTI datasets.

Input

Embedding

Vd

AN-GloVe
RA-GloVe
RAN-GloVe

θs = 0.03
0.069
0.060
0.040

GIPE
θs = 0.05
0.015
0.014
0.006

θs = 0.07
0.004
0.007
0.002

Mesa 8: Ablation study—GIPE for AN-GloVe and RA-GloVe.

with the absolute difference (Diff ) of F1-scores
on PRO and ANTI datasets for different word
embeddings. The results for GloVe, Hard-GloVe,
and GN-GloVe are obtained from Zhao et al.
(2018b).

Mesa 7 shows that RAN-GloVe achieves the
smallest absolute difference between scores on
PRO and ANTI subsets of WinoBias, de modo significativo
outperforming other embedding models and
achieving 97.4% better Diff
(ver tabla 7 para
the definition of Diff ) than the next best baseline
(Hard-GloVe) y 98.7% better than the original
GloVe. This lower Diff
is achieved by an
improved accuracy in ANTI and a reduced
accuracy in PRO. We hypothesise that the high
performance of non-debiased GloVe in PRO is due
to the unwanted gender cues rather than the desired
coreference resolving ability of the model. Más,
the performance reduction in PRO for the other
debiased versions of GloVe also corroborates
this hypothesis. Despite debiasing GloVe, a
considerable amount of gender cues remain in
the baseline models as quantified by a lower,
yet significant Diff. A diferencia de, RAN-GloVe is
able to remove gender cues dramatically, thereby
achieving an almost ideal Diff. Además, el
performance of RAN-GloVe on the OntoNotes
5.0 test set
is comparable with that of other
embeddings.

4.9 Ablation Study

To quantitatively and qualitatively analyze the
effect of neutralization and repulsion in RAN-
Debias, we perform an ablation study. Nosotros

examine the following changes in RAN-Debias
independientemente:

1. Nullify the effect of repulsion by setting

λ1 = 0, thus creating AN-GloVe.

2. Nullify the effect of neutralization by setting

λ3 = 0, thus creating RA-GloVe.

We demonstrate the effect of the absence of
neutralization or repulsion through a comparative
analysis on GIPE and the SemBias analogy test.

The GIPE values for AN-GloVe, RA-GloVe,
and RAN-GloVe are presented in Table 8. Nosotros
observe that in the absence of repulsion (UN-
GloVe), the performance is degraded by at least
72% compared to RAN-GloVe. It indicates the
efficacy of repulsion in our objective function
as a way to reduce the unwanted gender-biased
associations between words and their neighbors,
thereby reducing GIPE. Más, even in the ab-
sence of neutralization (RA-GloVe), GIPE is
worse by at least 50% as compared to RAN-
GloVe. De hecho, the minimum GIPE is observed for
RAN-GloVe, where both repulsion and neutraliza-
tion are used in synergy as compared to the ab-
sence of any one of them.

To illustrate further, Mesa 9 shows the rank
of neighbors having illicit proximities for three
professions, using different version of debiased
embeddings. It can be observed that the ranks
in RA-GloVe are either close to or further away
from the ranks in AN-GloVe, highlighting the
importance of repulsion in the objective function.
Más, the ranks in RAN-GloVe are the farthest,

498

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Word

Neighbor

Captain

Nurse

Farmer

Señor
james
women
madre
father
son

AN-GloVe
28
26
57
49
22
45

Embedding
RA-GloVe
22
30
56
74
54
90

RAN-GloVe
52
75
97
144
86
162

Mesa 9: For three professions, we compare the ranks of their
neighbors due to illicit proximities (the values denote the ranks).

Dataset

SemBias

Embedding
AN-GloVe
RA-GloVe
RAN-GloVe

Definition ↑
93.0
83.2
92.8

Stereotype ↓
0.2
7.3
1.1

None ↓
6.8
9.5
6.1

Mesa 10: Comparison for the gender relational analogy test on the SemBias
conjunto de datos. () indicates that higher (más bajo) value is better.

corroborating the minimum value of GIPE as
observed in Table 8.

Mesa 10 shows that in the absence of neutra-
lization (RA-GloVe), the tendency of favouring
stereotypical analogies increases by an absolute
diferencia de 6.2% as compared to RAN-
GloVe. Por otro lado, through the presence
of neutralization, AN-GloVe does not
favor
stereotypical analogies. This suggests that reduc-
ing the projection of biased words on gender
direction through neutralization is an effective
measure to reduce stereotypical analogies within
the embedding space. Por ejemplo, consider the
following instance of word pairs from the SemBias
conjunto de datos: {(widower, widow), (libro, revista),
(dog, cat), (doctor, nurse)}, dónde (widower,
widow) is a gender-definition word pair while
(doctor, nurse) is a gender-stereotype word pair
and the remaining are of none type as explained in
Sección 4.4. During the evaluation, RA-GloVe
incorrectly selects the gender-stereotype word
pair as the closest analogy with (él, she), mientras
AN-GloVe and RAN-GloVe correctly select the
gender-definition word pair. Más, we observe
that RAN-GloVe is able to maintain the high
performance of AN-GloVe, and the difference
is less (0.2% compared to 1.1%) cual es
compensated by the superior performance of
RAN-GloVe over other metrics like GIPE.

Through this ablation study, we understand
the importance of repulsion and neutralization
in the multi-objective optimization function of

RAN-Debias. The superior performance of RAN-
GloVe can be attributed to the synergistic
interplay of repulsion and neutralization. Por eso,
in RAN-GloVe we attain the best of both worlds.

4.10 Case Study: Neighborhood of Words

Here we highlight the changes in the neighborhood
(collection of words sorted in the descending
order of cosine similarity with the given word)
of words before and after the debiasing process.
To maintain readability while also demonstrating
the changes in proximity, we only analyze a few
selected words. Sin embargo, our proposed metric
GIPE quantifies this for an exhaustive vocabulary
colocar.

We select a set of gender-neutral professions
having high values of gender-based proximity
bias ηwi as defined earlier. For each of these
professions, en mesa 11, we select a set of four
words from their neighborhood for two classes:

• Class A: Neighbors arising due to gender-

based illicit proximities.

• Class B: Neighbors whose proximities are

not due to any kind of bias.

For the words in class A, the debiasing proce-
dure is expected to increase their rank, thereby
decreasing the semantic similarity, while for
words belonging to class B, debiasing procedure
is expected to retain or improve the rank for main-
taining the semantic information.

We observe that RAN-GloVe not only main-
tains the semantic information by keeping the

499

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Word

Clase

Neighbor

GloVe Hard-GloVe GN-GloVe GP-GloVe

Embedding

Captain

Nurse

Socialite

Farmer

A

B

A

B

A

B

A

B

Señor
james
brother
father
lieutenant
colonel
comandante
officer
woman
madre
housekeeper
girlfriend
nurses
midwife
nursing
practitioner
businesswoman
heiress
niece
actress
philanthropist
aristocrat
wealthy
socialites
father
son
boy
hombre
rancher
farmers
farm
landowner

19
20
34
39
1
2
3
4
25
27
29
32
1
2
3
4
1
2
12
19
3
4
5
6
12
21
50
51
1
2
3
4

32
22
83
52
1
2
3
5
144
71
54
74
1
3
2
5
1
2
18
16
3
4
5
15
28
84
67
50
2
1
3
4

34
26
98
117
1
2
4
10
237
127
28
60
1
2
3
4
1
2
14
38
3
4
7
5
37
77
115
146
1
4
5
2

20
18
39
40
1
2
3
4
16
25
29
31
1
3
2
5
1
2
17
14
3
4
5
9
13
26
45
60
2
1
4
5

RAN-GloVe
52
75
323
326
1
2
3
15
97
144
152
178
1
2
9
3
6
9
78
120
1
3
4
10
84
162
105
212
3
1
2
5

Mesa 11: For four professions, we compare the ranks of their class A and class B neighbors with respect to
each embedding. Aquí, rank represents the position in the neighborhood of a profession, and is shown by
the values under each embedding.

rank of words in class B close to their initial
value but unlike other debiased embeddings, él
drastically increases the rank of words belonging
to class A. Sin embargo, in some cases like the
word ‘‘Socialite’’, we observe that the ranks of
words such as ‘‘businesswoman’’ and ‘‘heiress’’,
despite belonging to class A, are close to their
initial values. This can be attributed to the high
semantic dependence of ‘‘Socialite’’ on these
palabras, resulting in a bias removal and semantic
information tradeoff.

5 Conclusión

en este documento, we proposed a post-processing
gender debiasing method called RAN-Debias.

Our method not only mitigates direct bias of a
word but also reduces its associations with other
words that arise from gender-based predilections.
We also proposed a word classification method,
called KBC, for identifying the set of words to
be debiased. Instead of using ‘‘biased’’ word
embeddings, KBC uses multiple knowledge bases
for word classification. Además, we proposed
Gender-based Illicit Proximity Estimate (GIPE), a
metric to quantify the extent of illicit proximities
in an embedding. RAN-Debias
de modo significativo
outperformed other debiasing methods on a suite
of evaluation metrics, along with the downstream
application task of coreference resolution while
introducing minimal semantic disturbance.

500

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

En el futuro, we would like to enhance KBC by
utilizing machine learning methods to account for
the words which are absent in the knowledge base.
Actualmente, RAN-Debias is directly applicable
to non-contextual word embeddings for non-
gendered grammatical languages. In the wake of
recent work such as Zhao et al. (2019), we would
like to extend our work towards contextualized
embedding models and other languages with
grammatical gender like French and Spanish.

Acknowledgment

The work was partially supported by the
Ramanujan Fellowship, DST (ECR/2017/00l691).
t. Chakraborty would like to acknowledge the
support of the Infosys Center for AI, IIIT-Delhi.

Referencias

Abdulrahman Almuhareb and Massimo Poesio.
2005. Concept learning and categorization from
the Web. In Proceedings of the Annual Meeting
of the Cognitive Science Society, volumen 27,
pages 103–108.

Tolga Bolukbasi, Kai-Wei Chang, James Y. Zou,
Venkatesh Saligrama, and Adam T. Kalai. 2016.
Man is to computer programmer as woman is
to homemaker? Debiasing word embeddings.
In Advances in Neural Information Processing
Sistemas, pages 4349–4357.

Shikha Bordia and Samuel R. Bowman. 2019.
Identifying and reducing gender bias in word-
level language models. CORR, abs/1904.03035.

Elia Bruni, Nam-Khanh Tran, and Marco Baroni.
semantics.
Intelligence Research,

2014. Multimodal distributional
Journal of Artificial
49:1–47.

Aylin Caliskan, Joanna J. Bryson, and Arvind
Narayanan. 2017. Semantics derived automati-
cally from language corpora contain human-like
prejuicios. Ciencia, 356(6334):183–186.

Jacob Devlin, Ming-Wei Chang, Kenton Lee, y
Kristina Toutanova. 2019. Bert: Pre-training of
deep bidirectional transformers for language
comprensión. Actas de
el 2019
Conference of the North American Chapter of
la Asociación de Lingüística Computacional:
Tecnologías del lenguaje humano, Volumen 1
(Artículos largos y cortos), páginas 4171–4186.

Kawin Ethayarajh, David Duvenaud, and Graeme
primero. 2019. Understanding undesirable word
embedding associations. Actas de
el
57ª Reunión Anual de la Asociación de
Ligüística computacional, pages 1696–1705.

Manaal Faruqui and Chris Dyer. 2014. Improving
vector
space word representations using
multilingual correlation. En Actas de la
14th Conference of the European Chapter of
la Asociación de Lingüística Computacional,
pages 462–471.

Hila Gonen and Yoav Goldberg. 2019. Lipstick on
a pig: Debiasing methods cover up systematic
gender biases in word embeddings but do
not remove them. Actas de la 2019
Conference of the North American Chapter of
la Asociación de Lingüística Computacional:
Tecnologías del lenguaje humano, Volumen 1
(Artículos largos y cortos), pages 609–614.

Rowan Hall Maudslay, Hila Gonen, ryan
Cotterell, and Simone Teufel. 2019. It’s all
in the name: Mitigating gender bias with
name-based counterfactual data substitution.
el 2019 Conferencia sobre
En procedimientos de
in Natural Language
Empirical Methods
Procesamiento y IX Conjunción Internacional
Conferencia sobre procesamiento del lenguaje natural
(EMNLP-IJCNLP), pages 5267–5275. hong
kong, Porcelana. Asociación de Computación
Lingüística.

Felix Hill, Roi Reichart, and Anna Korhonen.
2015. Simlex-999: Evaluating semantic models
con (genuine) similarity estimation. Computa-
lingüística nacional, 41(4):665–695.

Alexander Miserlis Hoyle, Lawrence Wolf-
Sonkin, Hanna Wallach, Isabelle Augenstein,
and Ryan Cotterell. 2019. Unsupervised dis-
covery of gendered language through latent-
variable modeling. In Proceedings of the 57th
Annual Meeting of the Association for Compu-
lingüística nacional, pages 1706–1716.

Stanisław Jastrzebski, Damian Le´sniak, y
Wojciech Marian Czarnecki. 2017. Cómo
evaluate word embeddings? On importance of
data efficiency and simple supervised tasks.
arXiv preimpresión arXiv:1702.02170.

501

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Masahiro Kaneko and Danushka Bollegala.
2019. Gender-preserving debiasing for pre-
trained word embeddings. Actas de la
57ª Reunión Anual de la Asociación de
Ligüística computacional, pages 1641–1650.

Diederik P. Kingma and Jimmy Ba. 2015.
Adán: A method for stochastic optimization.
3rd International Conference on Learning
Representaciones, ICLR, pages 1–15.

Kenton Lee, Luheng He, mike lewis, y
Lucas Zettlemoyer. 2017. End-to-end neural
coreference resolution. Actas de la 2017
Jornada sobre Métodos Empíricos en Natural
Procesamiento del lenguaje, pages 188–197.

Omer Levy and Yoav Goldberg. 2014. Linguis-
tic regularities in sparse and explicit word
representaciones. In Proceedings of the Eigh-
teenth Conference on Computational Natural
Aprendizaje de idiomas, pages 171–180.

Kaiji Lu, Piotr Mardziel, Fangjing Wu, Preetam
Amancharla, and Anupam Datta. 2018. Gender
language processing.
bias in neural natural
arXiv preimpresión arXiv:1807.11714.

Thang Luong, Richard Socher, and Christopher
Manning. 2013. Better word representations
with recursive neural networks for morphology.
In Proceedings of the Seventeenth Conference
sobre el aprendizaje computacional del lenguaje natural,
pages 104–113.

Thomas Manzini, Lim Yao Chong, Alan W.
Negro, and Yulia Tsvetkov. 2019. Black is to
criminal as caucasian is to police: Detecting and
removing multiclass bias in word embeddings.
Actas de la 2019 Conference of the
North American Chapter of the Association for
Ligüística computacional: Human Language
Technologies, Volumen 1 (Long and Short
Documentos), pages 615–621.

Santorini,

Mitchell Marcus, Beatrice

y
Mary Ann Marcinkiewicz. 1993. Building a
large annotated corpus of English: The Penn
Treebank. Ligüística computacional, 19:
313–330.

Stephen Merity, Nitish Shirish Keskar, y
Richard Socher. 2018. Regularizing and
optimizing lstm language models. Internacional
Conferencia sobre Representaciones del Aprendizaje,
pages 1–13.

502

Tomas Mikolov, Kai Chen, Greg Corrado,
and Jeffrey Dean. 2013a. Efficient estimation
of word representations
espacio.
1st
International Conference on Learning
Representaciones, ICLR 2013,Workshop Track
Actas, pages 1–12.

in vector

Tomas Mikolov, Wen-tau Yih, and Geoffrey
Zweig. 2013b. Lingüístico
regularities
en
En
continuous space word representations.
Actas de la 2013 Conference of the
North American Chapter of the Association for
Ligüística computacional: Human Language
Technologies, pages 746–751.

George A. Molinero. 1995. Wordnet: A lexical
database for English. Communications of the
ACM, 38(11):39–41.

Nikola Mrkˇsi´c, Diarmuid ´O. S´eaghdha, Blaise
Thomson, Milica Gaˇsi´c, Lina M. Rojas-
Barahona, Pei-Hao Su, David Vandyke,
Tsung-Hsien Wen, and Steve Young. 2016.
Counter-fitting word vectors to linguistic con-
tensiones. Actas de la 2016 Conferencia
of the North American Chapter of the Asso-
ciation for Computational Linguistics: Humano
Language Technologies, pages 142–148.

Nikola Mrkˇsi´c,

Ivan Vulic, Diarmuid

´O.
S´eaghdha, Ira Leviant, Roi Reichart, Milica
Gaˇsi´c, Anna Korhonen, and Steve Young.
2017. Semantic specialization of distributional
word vector spaces using monolingual and
cross-lingual constraints. Transactions of the
Asociación de Lingüística Computacional,
5:309–324.

lexical

Kim Anh Nguyen, Sabine Schulte im Walde,
and Ngoc Thang Vu. 2016.
Integrating
into word
contrast
distributional
embeddings for antonym-synonym distinction.
Proceedings of the 54th Annual Meeting of
la Asociación de Lingüística Computacional
(Volumen 2: Artículos breves), pages 454–459.

Masataka Ono, Makoto Miwa, and Yutaka
Sasaki. 2015. Word embedding-based antonym
detection using thesauri and distributional
el 2015
En procedimientos de
información.
Conferencia del Capítulo Norteamericano
for Computational
de
Lingüística: Tecnologías del lenguaje humano,
pages 984–989.

Asociación

el

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3

Jahna Otterbacher, Jo Bates, and Paul Clough.
2017. Competent men and warm women:
Gender stereotypes and backlash in image
search results. En Actas de la 2017 CHI
Conference on Human Factors in Computing
Sistemas, pages 6620–6631.

jeffrey

Socher,

Pennington, Ricardo

y
Christopher Manning. 2014. GloVe: Global
vectors for word representation. En procedimientos
del 2014 Conferencia sobre métodos empíricos
en procesamiento del lenguaje natural (EMNLP),
pages 1532–1543.

Jane Pilcher. 2017. Names and ‘‘doing gender’’:
How forenames and surnames contribute to
gender identities, diferencia, and inequalities.
Sex Roles, 77(11–12):812–822.

Kira Radinsky, Eugene Agichtein, Evgeniy
Gabrilovich, and Shaul Markovitch. 2011. A
word at a time: Computing word relatedness
using temporal semantic analysis. En curso-
ings of the 20th International Conference on
World Wide Web, pages 337–346. ACM.

Sascha Rothe

para

and Hinrich Sch¨utze. 2015.
Autoextend: Extending word embeddings
synsets and lexemes.
to embeddings
Proceedings of the 53rd Annual Meeting of
la Asociación de Lingüística Computacional
and the 7th International Joint Conference on
Natural Language Processing (Volumen 1: Largo
Documentos), pages 1793–1803.

Herbert Rubenstein and John B. Goodenough.
1965. Contextual correlates of synonymy.
Communications of the ACM, 8(10):627–633.

Rachel Rudinger,

en

inclinación

Jason Naradowsky, brian
leonardo, and Benjamin Van Durme. 2018.
Gender
resolution.
Actas de la 2018 Conference of the
North American Chapter of the Association for
Ligüística computacional: Human Language
Technologies, Volumen 2 (Artículos breves).

correferencia

Natalie Schluter. 2018. The word analogy
testing caveat. En procedimientos de
el 2018
Conference of the North American Chapter of
la Asociación de Lingüística Computacional:
Tecnologías del lenguaje humano, Volumen 2
(Artículos breves). Nueva Orleans, Luisiana.
Asociación de Lingüística Computacional.

503

Ralph Weischedel, Sameer Pradhan, Lance
Ramshaw, Jeff Kaufman, Michelle Franchini,
Mohammed El-Bachouti, Nianwen Xue,
Martha Palmer, Jena D. Hwang, Claire Bonial,
Jinho Choi, Aous Mansouri, Maha Foster,
Abdel-aati Hawwary, Mitchell Marcus, Ann
taylor, Craig Greenberg, Eduard Hovy, Roberto
Belvin, and Ann Houston. 2012. Ontonotes
release 5.0.

Adina Williams, Damian Blasi, Lawrence Wolf-
Sonkin, Hanna Wallach, and Ryan Cotterell.
2019. Quantifying the semantic core of gender
sistemas. En Actas de la 2019 Conferencia
sobre métodos empíricos en lenguaje natural
Procesamiento y IX Conjunción Internacional
Conferencia sobre procesamiento del lenguaje natural
(EMNLP-IJCNLP), pages 5734–5739.

Jieyu Zhao, Tianlu Wang, Mark Yatskar, ryan
Cotterell, Vicente Ordonez, and Kai-Wei
Chang. 2019. Gender bias in contextualized
word embeddings. En Actas de la 2019
Conference of the North American Chapter of
la Asociación de Lingüística Computacional:
Tecnologías del lenguaje humano, Volumen 1
(Artículos largos y cortos), pages 629–634,
Mineápolis, Minnesota.

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente
Ordonez, and Kai-Wei Chang. 2018a. Gender
bias in coreference resolution: Evaluation and
debiasing methods. Actas de la 2018
Conference of the North American Chapter of
la Asociación de Lingüística Computacional:
Tecnologías del lenguaje humano, Volumen 2
(Artículos breves), pages 8–14.

Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei
Wang, and Kai-Wei Chang. 2018b. Aprendiendo
gender-neutral word embeddings. Actas
de
on Empirical
Métodos en el procesamiento del lenguaje natural,
pages 4847–4853.

2018 Conferencia

el

Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao
Huang, Muhao Chen, Ryan Cotterell, y
Kai-Wei Chang. 2019. Examining gender
bias in languages with grammatical gender.
el 2019 Conferencia sobre
En procedimientos de
Empirical Methods
in Natural Language
Procesamiento y IX Conjunción Internacional
Conferencia sobre procesamiento del lenguaje natural
(EMNLP-IJCNLP), pages 5276–5284.

yo

D
oh
w
norte
oh
a
d
mi
d

F
r
oh
metro
h

t
t

pag

:
/
/

d
i
r
mi
C
t
.

metro

i
t
.

mi
d
tu

/
t

a
C
yo
/

yo

a
r
t
i
C
mi

pag
d

F
/

d
oh

i
/

.

1
0
1
1
6
2

/
t

yo

a
C
_
a
_
0
0
3
2
7
1
9
2
3
7
4
4

/

/
t

yo

a
C
_
a
_
0
0
3
2
7
pag
d

.

F

b
y
gramo
tu
mi
s
t

t

oh
norte
0
8
S
mi
pag
mi
metro
b
mi
r
2
0
2
3Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased image

Descargar PDF