View on GitHub

SIDATA

Saudi-Dialect-Irony-Dataset

URL: https://github.com/iwan-rg/Saudi-Dialect-Irony-Dataset

Description: The Saudi irony dataset was collected using the Twitter API and consists of 19,810 tweets, 8,089 of them are labeled as ironic tweets.

Dataset

Additional Information

How the datasets were created

The Saudi irony dataset (Sa`7r ساخر) was collected using the Twitter API. It consists of 19,810 tweets, out of which 8,089 are labeled as ironic. This dataset was created as part of a study on irony detection in sentiment analysis.

Training methods applied

The dataset was used to train several machine learning models for irony detection:

Results obtained

Read the full paper.