Analyzing COVID-Related Social Discourse on Twitter using Emotion, Sentiment, Political Bias, Stance, Veracity and Conspiracy Theories

Abstract

Online misinformation has become a major concern in recent years, and it has been further emphasized during the COVID-19 pandemic. Social media platforms, such as Twitter, can be serious vectors of misinformation online. In order to better understand the spread of these fake-news, lies, deceptions, and rumours, we analyze the correlations between the following textual features in tweets: emotion, sentiment, political bias, stance, veracity and conspiracy theories. We train several transformer-based classifiers from multiple datasets to detect these textual features and identify potential correlations using conditional distributions of the labels. Our results show that the online discourse regarding some topics, such as COVID-19 regulations or conspiracy theories is highly controversial and reflects the actual U.S. political landscape.

Code

This repository contains notebooks to train the three different models (sentiment, emotion, political bias) and to perform some data analysis.

./src/Data-Stats.ipynb contains some code to perform data analysis on the datasets, and to split them into stratified folds. It also convert the original datasets into normalized csv files with tweet content and label.
./src/Train-Inference.ipynb contains some code to train models on the datasets, to use them on other datasets (inference), and to visualise the results into heatmaps. The Load Data cells specify which dataset will be loaded.

Datasets should be copied to a new folder named ./data at the root of the repository.

Citations

Please cite this work as below:

@inproceedings{Peskine_BeyondFacts_2023,
author = {Peskine, Youri and Troncy, Raphael and Papotti, Paolo},
title = {{Analyzing COVID-Related Social Discourse on Twitter using Emotion, Sentiment, Political Bias, Stance, Veracity and Conspiracy Theories}},
proceedings = {3rd International Workshop on Knowledge Graphs for Online Discourse Analysis (BeyondFacts)},
doi = {10.1145/3543873.3587622},
year = {2023}
}

Results

The heatmaps obtained with the different models can be seen on the following figures.

Sentiment:

Emotion:

Political bias:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figures		figures
src		src
CITATION.cff		CITATION.cff
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyzing COVID-Related Social Discourse on Twitter using Emotion, Sentiment, Political Bias, Stance, Veracity and Conspiracy Theories

Abstract

Code

Citations

Results

About

Releases

Packages

Contributors 3

Languages

D2KLab/covid-twitter-discourse-analysis

Folders and files

Latest commit

History

Repository files navigation

Analyzing COVID-Related Social Discourse on Twitter using Emotion, Sentiment, Political Bias, Stance, Veracity and Conspiracy Theories

Abstract

Code

Citations

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages