Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/131348
Citations
Scopus Web of Science® Altmetric
?
?
Type: Conference paper
Title: Combining query reduction and expansion for text-retrieval-based bug localization
Author: Florez, J.M.
Chaparro, O.
Treude, C.
Marcus, A.
Citation: Proceedings of the Euromicro Conference on Software Maintenance and Reengineering, CSMR, 2021, pp.166-176
Publisher: IEEE
Publisher Place: online
Issue Date: 2021
ISBN: 9781728196305
ISSN: 1534-5351
Conference Name: IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) (9 Mar 2021 - 12 Mar 2021 : virtual online)
Statement of
Responsibility: 
Juan Manuel Florez, Oscar Chaparro, Christoph Treude, Andrian Marcus
Abstract: Automated text-retrieval-based bug localization (TRBL) techniques normally use the full text of a bug report to formulate a query and retrieve parts of the code that are buggy. Previous research has shown that reducing the size of the query increases the effectiveness of TRBL. On the other hand, researchers also found improvements when expanding the query (i.e., adding more terms). In this paper, we bring these two views together to reformulate queries for TRBL. Specifically, we improve discourse-based query reduction strategies, by adopting a combinatorial approach and using task phrases from bug reports, and combine them with a state-of-the-art query expansion technique, resulting in 970 query reformulation strategies. We investigate the benefits of these strategies for localizing buggy code elements and define a new approach, called QREX, based on the most effective strategy. We evaluated the reformulation strategies, including QREX, on 1,217 queries from different software systems to retrieve buggy code artifacts at three code granularities, using five state-of-the-art automated TRBL approaches. The results indicate that QREX increases TRBL effectiveness by 4% - 12.6%, compared to applying query reduction and expansion in isolation, and by 32.1%, compared to the no-reformulation baseline.
Rights: ©2021 IEEE
DOI: 10.1109/SANER50967.2021.00024
Published version: https://ieeexplore.ieee.org/xpl/conhome/9425868/proceeding
Appears in Collections:Aurora harvest 4
Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.