YANG Jun, LIU Shi-fan, CHEN Xiang, CUI Zhan-qi
Online available: 2025-06-03
In open-source software and platforms, developers can submit issues to report software bugs or suggest new feature requests. Due to the lack of experience and limited professional skills, users may struggle to summarize the content of issues accurately and effectively, resulting in low-quality issue titles, which in turn decreases the efficiency of addressing issues. Additionally, existing automatic issue title generation methods are primarily designed for English open-source platforms, such as GitHub, and the performance are degraded when applied to Chinese open-source platforms, like Gitee. Furthermore, existing methods mainly use the issue body description as inputs, ignoring the code snippets in the issue. In this paper, we propose a method called GITG (Gitee Issue Title Generation) specifically designed for Gitee, an open-source platform. GITG addresses the challenge of generating issue titles for both Chinese and English text by fine-tuning the Chinese BART pre-trained model on a constructed Gitee issue dataset. It leverages the bi-modal information from the issue body description and code snippets to generate informative and accurate issue titles. A dataset consisting of 18 242 Gitee issue samples is constructed to validate the effectiveness of GITG. Experimental results demonstrate that GITG outperforms iTAPE and iTiger by at least 13.09%, 10.18%, and 12.84% on the ROUGE-1, ROUGE-2, and ROUGE-L metrics, respectively. GITG also achieves improvements in BLEU and METEOR metrics. Human evaluation results also indicate that the average scores of the titles generated by GITG are improved by at least 26.7%, 20.8%, 24.2%, and 20.0% in overall score, fluency, informativeness, and conciseness, respectively, compared to iTAPE and iTiger.