This website requires JavaScript.

GIRT-Data: Sampling GitHub Issue Report Templates

Nafiseh NikeghbalAmir Hossein KargaranAbbas HeydarnooriHinrich Sch\"utze
Mar 2023
摘要
GitHub's issue reports provide developers with valuable information that isessential to the evolution of a software development project. Contributors canuse these reports to perform software engineering tasks like submitting bugs,requesting features, and collaborating on ideas. In the initial versions ofissue reports, there was no standard way of using them. As a result, thequality of issue reports varied widely. To improve the quality of issuereports, GitHub introduced issue report templates (IRTs), which pre-fill issuedescriptions when a new issue is opened. An IRT usually contains greetingcontributors, describing project guidelines, and collecting relevantinformation. However, despite of effectiveness of this feature which wasintroduced in 2016, only nearly 5% of GitHub repositories (with more than 10stars) utilize it. There are currently few articles on IRTs, and the availableones only consider a small number of repositories. In this work, we introduceGIRT-Data, the first and largest dataset of IRTs in both YAML and Markdownformat. This dataset and its corresponding open-source crawler tool areintended to support research in this area and to encourage more developers touse IRTs in their repositories. The stable version of the dataset contains1,084,300 repositories and 50,032 of them support IRTs. The stable version ofthe dataset and crawler is available here:https://github.com/kargaranamir/girt-data
展开全部
图表提取

暂无人提供速读十问回答

论文十问由沈向洋博士提出,鼓励大家带着这十个问题去阅读论文,用有用的信息构建认知模型。写出自己的十问回答,还有机会在当前页面展示哦。

Q1论文试图解决什么问题?
Q2这是否是一个新的问题?
Q3这篇文章要验证一个什么科学假设?
0
被引用
笔记
问答