Programs

Community Health

We are designing qualitative and quantitative methods to identify and target harassment in Wikimedia projects.

Project overview

Harassment is a pervasive issue in many online communities. A 2014 Pew survey found that 73% of internet users have witnessed online harassment, and 40% have experienced it. In 2015, the Wikimedia Foundation conducted its own survey and found about 38% of responding contributors had experienced some form of harassment. Over half felt a decrease in their motivation to contribute to Wikimedia projects in the future.

The Wikimedia Board of Trustees has identified the problem of harrassment in Wikimedia projects as a threat to "our ability to collect, share, and disseminate free knowledge." In a resolution on "healthy community culture, inclusivity, and safe spaces," the Wikimedia Board of Trustees identified responses to harassment as a priority for the movement.

Our team is working with other departments at the Wikimedia Foundation and outside research collaborators to better understand and combat harrassment in Wikimedia projects and discussion spaces. We have designed algorithms to help detect toxic behavior, and we are learning more about how this behavior affects contributors to Wikimedia projects. We have released data sets and open source tools to support open and reproducible research on online harassment.

Recent updates

Project team

Dario Taraborelli, Jonathan Morgan, Diego Sáez-Trumper

Collaborators

Jonathan Chang (Cornell University), Cristian Danescu-Niculescu-Mizil (Cornell University), Lucas Dixon (Jigsaw), Yiqing Hua (Cornell University), Srijan Kumar (Stanford University), Jure Leskovec (Stanford University), Tilen Marc (Stanford University), Caroline Sinders (Wikimedia Foundation), Nithum Thain (Jigsaw), Justine Zhang (Cornell University), Ellery Wulczyn (Wikimedia Foundation)

Publications

Yiqing Hua, Cristian Danescu-Niculescu-Mizil, Dario Taraborelli, Nithum Thain, Jeffery Sorensen, Lucas Dixon. 2018. WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP ’18), pp. 2818–2823. http://aclweb.org/anthology/D18-1305
Justine Zhang, Jonathan P. Chang, Cristian Danescu-Niculescu-Mizil, Lucas Dixon, Yiqing Hua, Nithum Thain, and Dario Taraborelli. 2018. Conversations Gone Awry: Detecting Early Signs of Conversational Failure. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL '18).
Ellery Wulczyn, Nithum Thain, and Lucas Dixon. 2017. Ex machina: Personal attacks seen at scale. Proceedings of the 26th International Conference on World Wide Web (WWW '17). ACM, New York, NY, USA, 1785-1799. DOI: https://doi.org/10.1145/3038912.3052591

Wikimedia Research

Community Health

Project overview

Recent updates

Release of WikiConv dataset

Presentation video available for Conversations Gone Awry

Research showcase for Conversations Gone Awry

Machine learning is helping computers spot arguments online before they happen

Scientists are building a detector for conversations likely to go bad

Paper accepted at ACL '18: Conversations Gone Awry

Characterizing Wikihounding on Wikipedia

Toxic Comment Classification Challenge

Conversation corpora, emotional robots, and battles with bias

Sockpuppet detection in Wikimedia projects

Collection of 13,500 Nastygrams Could Advance War on Trolls

Scaling up our understanding of harassment on Wikipedia

Detecting Personal Attacks on Wikipedia

Project team

Collaborators

Publications

Resources and links