Replication data for "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair" (doi:10.60507/FK2/O54LWP)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Replication data for "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair"

Identification Number:

doi:10.60507/FK2/O54LWP

Distributor:

bonndata

Date of Distribution:

2024-07-15

Version:

2

Bibliographic Citation:

Leiwig, Maximilian; Swierzy, Ben; Bungartz, Christian; Meier, Michael, 2024, "Replication data for "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair"", https://doi.org/10.60507/FK2/O54LWP, bonndata, V2

Study Description

Citation

Title:

Replication data for "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair"

Identification Number:

doi:10.60507/FK2/O54LWP

Authoring Entity:

Leiwig, Maximilian (University of Bonn, Lamarr Institute)

Swierzy, Ben (University of Bonn)

Bungartz, Christian (University of Bonn, Lamarr Institute)

Meier, Michael (University of Bonn, Fraunhofer FKIE, Lamarr Institute)

Distributor:

bonndata

Access Authority:

Leiwig, Maximilian

Depositor:

Leiwig, Maximilian

Date of Deposit:

2024-07-08

Holdings Information:

https://doi.org/10.60507/FK2/O54LWP

Study Scope

Keywords:

Computer and Information Science, Computer and Information Science

Abstract:

This repository contains replication data for the paper "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair".<br> The dataset contains commits that indicate a bug fix from 200 repositories on Github. Scripts for fetching data associated with the commits are available (see Github repository linked under "Related Material"). <hr> File descriptions: <ul> <li><code>repositories.csv</code>: List of repository names with metadata</li> <li><code>commits.csv</code>: List of commit identifiers of the 200 repositories with most bug fixing commits</li> <li><code>commits_high_watch_count.csv</code>: List of commit identifiers of the 200 repositories with most bug fixing commits with a watch count of at least 50</li> </ul> <hr> Icon licensed under CC-BY by xinh.studio

Kind of Data:

quantitative

Notes:

The corresponding paper was be published at SEAA Aug 28 - Aug 30 2024.

Methodology and Processing

Sources Statement

Documentation and Access to Sources:

Original Sources are Google BigQuery and GitHub

Data Access

Other Study Description Materials

Related Materials

Code: https://github.com/Synrom/FixMe

Related Publications

Citation

Title:

M. Leiwig, B. Swierzy, C. Bungartz and M. Meier, "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair," 2024 50th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Paris, France, 2024, pp. 72-79, doi: 10.1109/SEAA64295.2024.00020.

Identification Number:

10.1109/SEAA64295.2024.00020.

Bibliographic Citation:

M. Leiwig, B. Swierzy, C. Bungartz and M. Meier, "Analyzing the Potency of Pretrained Transformer Models for Automated Program Repair," 2024 50th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Paris, France, 2024, pp. 72-79, doi: 10.1109/SEAA64295.2024.00020.

Other Study-Related Materials

Label:

commits.csv

Notes:

text/csv

Other Study-Related Materials

Label:

commits_high_watch_count.csv

Notes:

text/csv

Other Study-Related Materials

Label:

README.md

Notes:

text/markdown

Other Study-Related Materials

Label:

repositories.csv

Notes:

text/csv