Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse” (doi:10.60507/FK2/BK14F8)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”

Identification Number:

doi:10.60507/FK2/BK14F8

Distributor:

bonndata

Date of Distribution:

2025-11-11

Version:

1

Bibliographic Citation:

Schilling, Julia; Fuchs, Robert, 2025, "Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”", https://doi.org/10.60507/FK2/BK14F8, bonndata, V1

Study Description

Citation

Title:

Raw data and analysis code for the study “Project 2025 as a technocratic blueprint: A corpus-based linguistic analysis of conservative governance discourse”

Identification Number:

doi:10.60507/FK2/BK14F8

Authoring Entity:

Schilling, Julia (Universität Bonn)

Fuchs, Robert (Universität Bonn)

Software used in Production:

spaCy

Software used in Production:

R

Software used in Production:

RStudio

Software used in Production:

Python

Software used in Production:

LIWC

Distributor:

bonndata

Access Authority:

Schilling, Julia

Depositor:

Schilling, Julia

Date of Deposit:

2025-11-05

Holdings Information:

https://doi.org/10.60507/FK2/BK14F8

Study Scope

Keywords:

Arts and Humanities

Abstract:

This repository contains all data and scripts used for the study “Project 2025 as a Technocratic Blueprint: A Corpus-Based Linguistic Analysis of Conservative Governance Discourse” (Schilling & Fuchs, 2025). The study investigates the language of the Heritage Foundation’s Project 2025, a 900-page conservative policy blueprint, using methods from Corpus-Assisted Discourse Studies (CADS), Political Discourse Analysis (PDA), and psycholinguistic text analysis. The corpus includes Project 2025 and Democratic and Republican Party platforms (2016–2024). The dataset includes: Raw data (/data/raw_data/): full texts of Project 2025 and party platforms in CSV format. Processed data (/data/raw_data/Project2025_lemmaPOS.csv): tokenized, lemmatized, and POS-tagged text. Keyness results (/data/keyness/): unigram and bigram keyness calculations (log-likelihood, log-ratio). Collocation results (/data/collocations/): top 10 adjective, noun, and verb collocates per node. LIWC results (/data/liwc/): LIWC-22 category scores for each corpus. Analysis scripts (/code/): R Markdown file (analysis_project2025.Rmd) and two Python scripts for collocation and keyness analysis. All files are in UTF-8 plain-text format. The dataset contains no personal, sensitive, or proprietary data and derives entirely from publicly accessible political documents.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/licenses/by/4.0">CC BY 4.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

Schilling, Julia & Fuchs, Robert (2025). Project 2025 as a Technocratic Blueprint: A Corpus-Based Linguistic Analysis of Conservative Governance Discourse. Submitted manuscript, PLOS ONE.

Bibliographic Citation:

Schilling, Julia & Fuchs, Robert (2025). Project 2025 as a Technocratic Blueprint: A Corpus-Based Linguistic Analysis of Conservative Governance Discourse. Submitted manuscript, PLOS ONE.

Other Study-Related Materials

Label:

README_Project2025.md

Notes:

text/markdown

Other Study-Related Materials

Label:

analysis_project2025.Rmd

Notes:

text/x-r-notebook

Other Study-Related Materials

Label:

collocation_analysis.py

Notes:

text/x-python-script

Other Study-Related Materials

Label:

keyness_analysis.py

Notes:

text/x-python-script

Other Study-Related Materials

Label:

P2025_colloc_ADJ_w7_top10_per_node.csv

Notes:

text/csv

Other Study-Related Materials

Label:

P2025_colloc_NOUN_w7_top10_per_node.csv

Notes:

text/csv

Other Study-Related Materials

Label:

P2025_colloc_VERB_w7_top10_per_node.csv

Notes:

text/csv

Other Study-Related Materials

Label:

keyness_1gram_pos_manifesto_dem_overuse.csv

Notes:

text/csv

Other Study-Related Materials

Label:

keyness_1gram_pos_manifesto_rep_overuse.csv

Notes:

text/csv

Other Study-Related Materials

Label:

keyness_2gram_pos_manifesto_dem_overuse.csv

Notes:

text/csv

Other Study-Related Materials

Label:

keyness_2gram_pos_manifesto_rep_overuse.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Project2025_lemmaPOS.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Platforms_Democrats_LIWC_subset.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Platforms_Republicans_LIWC_subset.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Project2025_LIWC_subset.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Platforms_Democrats.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Platforms_Republicans.csv

Notes:

text/csv

Other Study-Related Materials

Label:

Project2025.csv

Notes:

text/csv