Newcastle University
Browse

Polarity and topic supervised classification of LastQuake app user's comments - Aegean 2020 earthquake

Download (251.17 kB)
Version 4 2022-04-07, 10:59
Version 3 2021-05-27, 13:04
Version 2 2021-05-27, 11:50
Version 1 2021-05-26, 14:23
dataset
posted on 2021-05-26, 14:23 authored by Diana ContrerasDiana Contreras, Sean Wilkinson, Laure Fallou, Matthieu Landès, Rémy Bossu, Yasemin Didem AktasYasemin Didem Aktas

This database contains the comparison of results between the supervised and unsupervised polarity classification of the tweets related to the 2019 Albania earthquake. The database was constructed with the aim to test the accuracy of the algorithm developed by MonkeyLearn for polarity classification.

675 Tweets with the hashtags: #Albania #AlbanianEarthquake #albanianearthquake from the 26th November 2019 to the 3rd February 2020 were collected by the third-party vendor: TweetBinder. The social media department of Newcastle University provided us with 1001 tweets with the hashtags: #Albania collected from the 31st January to the 2nd February 2020. After removing repeated tweets from the database, we obtained a dataset made up of 255 original tweets. This database only contains text data of originat tweets (no retweets). Attributes and data contained:

§ N: Number of the tweet

§ Tweet: Text data

§ ML-Classification: Unsupervised polarity classification performed by the algorithm developed by MonkeyLearn.

§ Confidence: Percentage of trust that the predicted polarity is right

§ RB-Classification: Supervised polarity classification performed by experts.

§ Accuracy: Coincidence in polarity predicted by the unsupervised classification and the supervised classification


Funding

Learning from Earthquakes: Building Resilient Communities Through Earthquake Reconnaissance, Response and Recovery

Engineering and Physical Sciences Research Council

Find out more...

History

UoA

  • Engineering