by Kris McGuffie and Alex Newhouse

In 2020, OpenAI developed GPT-3, a neural language model that is capable of sophisticated natural language generation and completion of tasks like classification, question-answering, and summarization. While OpenAI has not open-sourced the model’s code or pre-trained weights at the time of writing, it has built an API to experiment with the model’s capacity. When we researched OpenAI’s predecessor model, GPT-2, last year, we found that language models have the potential of being used as potent generators of ideologically consistent extremist content.

In a new report, CTEC evaluated the revolutionary improvements of GPT-3 for the risk of weaponization by extremists who may attempt to use GPT-3 or hypothetical unregulated models to amplify their ideologies and recruit to their communities.

Experimenting with prompts representative of different types of extremist narrative, structures of social interaction, and radical ideologies, we find:

  • GPT-3 demonstrates significant improvement over its predecessor, GPT-2, in generating extremist texts.

  • GPT-3 shows strength in generating text that accurately emulates interactive, informational, and influential content that could be utilized for radicalizing individuals into violent far-right extremist ideologies and behaviors.

  • While OpenAI’s preventative measures are strong, the possibility of unregulated copycat technology represents significant risk for large-scale online radicalization and recruitment. In the absence of safeguards, successful and efficient weaponization that requires little experimentation is likely.

  • AI stakeholders, the policymaking community, and governments should begin investing as soon as possible in building social norms, public policy, and educational initiatives to preempt an influx of machine-generated disinformation and propaganda. Mitigation will require effective policy and partnerships across industry, government, and civil society.

Read and download our full report here.

This project was made possible by the OpenAI API Academic Access Program. 



