The nature of political communication has been fundamentally altered by the emergence of social media. In earlier eras, social scientists, journalists, and citizens could focus on static statements by politicians and candidates in order to understand the nature of political discourse. Social scientists studying political communication would design surveys and focus groups to understand which messages were received by citizens, and with what effect. Today, as news moves to digital platforms and as political figures increasingly rely on social media, political communication is fundamentally dynamic. Studying patterns of communication among politicians, their supporters, and their critics requires scholarly focus on the content, sentiment, and framing of posts on various social media platforms. Similarly, making sense of the contours of an election campaign requires that scholars explore the interplay among online messages and real-world events such as endorsements, scandals, and standing in the polls. All of this means that the tools of political analysis must be combined with the tools of data science.
Over the past several years a team of researchers from the Department of Politics, the School of Data Science, and Research Computing have collaborated to create a system to capture and store every Tweet to and from every major presidential candidate since the 2016 primaries, continuing with every Tweet to and from President Trump since Election Day. This collaboration has produced a unique database of approximately one billion tweets (currently 5 TB). While the analysis of tweets from candidates and office holders is commonplace, collecting, organizing, and analyzing tweets to these figures, both from supporters and opponents, requires significant effort and sustained collaboration.
PI: Paul Freedman (Department of Politics)
|
projects
data-science, hpc, hpc-computing, R, text analysis