HASOC (2021)

Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages

All the dataset are password protected. Kindly register here for the key to unlock the zip file.

Subtask 1 Dataset


Category Train Dataset Test Dataset
English Dataset Download Download
Hindi Dataset Download Download
Marathi Dataset Download Download

Subtask 2 Dataset


Category Train Dataset Test Dataset
English-Hindi Code-Mix Dataset Download Download

Category Link
English Dataset Download
Hindi Dataset Download
German Dataset Download

Category Link
English Dataset Download
Hindi Dataset Download
German Dataset Download