Harvard CSCI S-109A - Twitter Bot Detection

Project Team: Eumar Assis, Andrew Caide, Mark Carlebach, and Jiang Yusheng

Introduction

Project Background and Purpose

Most people who use twitter are aware of the possibility that tweets received are tweets generated by computer algorithms or ‘bots’. There are concerns broadly that bots can cause societal damage by propagating ‘fake news’ that can influence people in a number of ways. The most prominent potential impact of ‘fake news’ is on how recipients of fake news vote in elections, both in the US and abroad.

To mitigate risks associated with bot activity, Twitter takes many steps that include the use of machine learning algorithms to detect the bots (and then terminate the accounts).

This project fits in this context and has its goal the development of machine learning algorithms to detect bots based on tweet activity. We used techniques taught in CSCI S-109A as demonstrated throughout this report.

Project Overview

The following is an outline of how we approached this project:

Go to next page