Two balanced datasets have been created for each collection, that is in Spanish and in English. The data was manually labelled by two annotators according to three levels, namely Misogyny Identification, Misogynistic Category Classification and Target Classification. Cases in disagreement were solved by a third annotator.
The files containing the training and testing sets are given through the mailing list email@example.com.
The English and Spanish training sets are available here.
The English and Spanish testing sets are available here.