Monitor on Digital Violence Against Women in Uruguay
Overview
The monitor on digital violence against women provides accessible real-time evidence regarding the level of aggressions and insults received by Uruguayan public figures such as women politicians, journalists, communicators, activists and artists on Twitter. It includes the analysis of the number of aggressions received, the most frequent types of aggressions and the most used insults.
Methodology
The monitor uses data from Twitter (currenlty, X), a social network that used to provide free access to all tweets made by public users through an API. Only anonymized public data was accesed (private profiles are not available), so no personal information was disclosed in the making of the monitor. Data was collected and stored in real time every hour for the period 1 March 2022-14 March 2023.
The monitor took a sample of 180 active Twitter accounts of women with leadership in opinion and public visibility, and more than 3,000 followers:
• 69 journalists
• 41 politicians
• 27 artists
• 27 communicators
• 16 social leaders
For the automatic and scaled detection of digital violence in networks, a neural network model (machine learning) was trained using local data and annotators.
In particular, a model called BETO was used, a BERT-type transformer trained by a team from the Computer Science Department of the University of Chile on a large corpus of Spanish texts (*). The BETO model was trained with 7,697 tweets classified in digital violence and digital non-violence, annotated by 6 local annotators. Thus, the model learned to classify new, unpublished and previously unseen tweets. After six successive rounds of annotation and training, the final model obtained an accuracy of 79.7% on the test sample, and a macro F1 of 76%.
Once the violent tweets were detected, they were classified into different categories using a list of 350 insults. The list of insults was built from one used in the Argentitian Project (**) and updated based on the inspection of 9,000 noted tweets of Uruguayan origin.
Categories of expressions of digital violence against women
Comments, opinions about personality or qualities that underestimate their knowledge or skills. These comments respond to their ability to do politics, journalism, communication, social leadership.
Comments that allude to the body and/or sexuality, linked to physical appearance, body stereotypes, qualifications and evaluations of their figure or complexion.
Derogatory expressions towards a person or group for their voluntary affiliation to a political party, ideology, militant group, professional or other type of group.
Disdainful comments regarding fundamental aspects of gender identity, sexual orientation, religion, social class, age, ethnic and racial origin, among others.
Violent, lewd and/or aggressive expressions and content that manifest intent to sexually harm women or their loved ones.
Words, comments, accusations about a woman's actions invalidate her opinion/speech.
Accusatory or violent content related to illegal actions in the workplace, professional or political sphere.
* This categorization was developed jointly by UNDP and InMujeres, using as a basis a project carried out in Argentina (Office of Special Projects (Planning and Management Control Unit) of the Honorable Chamber of Deputies). Previous categorizations made by ELA (2019), the Alan Turing Institute (2020) and UNESCO (2015) were also adapted and expanded.
Sources
(*) https://huggingface.co/dccuchile/bert-base-spanish-wwm-cased
(**) Project developed and maintained by WitPact and led by Laila Sprejer, with technical support from Fundar's Gender and Data areas, and with the support of the British Embassy in Argentina. Available at https://www.conductaenredes.org/