Developers’ perception matters: machine learning to detect developer-sensitive smells

Oliveira, Daniel; Assunção, Wesley K. G.; Garcia, Alessandro; Fonseca, Baldoino; Ribeiro, Márcio

doi:10.1007/s10664-022-10234-2

by Daniel Oliveira, Wesley K. G. Assunção, Alessandro Garcia, Baldoino Fonseca, Márcio Ribeiro

Abstract:

Code smells are symptoms of poor design that hamper software evolution and maintenance. Hence, code smells should be detected as early as possible to avoid software quality degradation. However, the notion of whether a design and/or implementation choice is smelly is subjective, varying for different projects and developers. In practice, developers may have different perceptions about the presence (or not) of a smell, which we call developer-sensitive smell detection. Although Machine Learning (ML) techniques are promising to detect smells, there is little knowledge regarding the accuracy of these techniques to detect developer-sensitive smells. Besides, companies may change developers frequently, and the models should adapt quickly to the preferences of new developers, i.e., using few training instances. Based on that, we present an investigation of the behavior of ML techniques in detecting developer-sensitive smells. We evaluated seven popular ML techniques based on their accuracy and efficiency for identifying 10 smell types according to individual perceptions of 63 developers, with some divergent agreement on the presence of smells. The results showed that five out of seven techniques had statistically similar behavior, being able to properly detect smells. However, the accuracy of all ML techniques was affected by developers’ opinion agreement and smell types. We also observed that the detection rules generated for developers individually have more metrics than in related studies. We can conclude that code smells detection tools should consider the individual perception of each developer to reach higher accuracy. However, untrained developers or developers with high disagreement can introduce bias in the smell detection, which can be risky for overall software quality. Moreover, our findings shed light on improving the state of the art and practice for the detection of code smells, contributing to multiple stakeholders.

View PDF

Reference:

Daniel Oliveira, Wesley K. G. Assunção, Alessandro Garcia, Baldoino Fonseca, Márcio Ribeiro, "Developers’ perception matters: machine learning to detect developer-sensitive smells", In: Empirical Software Engineering, Springer Science and Business Media LLC, vol. 27, no. 7, 2022.

Bibtex Entry:

@Article{Oliveira2022,
  author    = {Daniel Oliveira and Wesley K. G. Assunção and Alessandro Garcia and Baldoino Fonseca and Márcio Ribeiro},
  journal   = {Empirical Software Engineering},
  title     = {Developers’ perception matters: machine learning to detect developer-sensitive smells},
  year      = {2022},
  issn      = {1573-7616},
  month     = oct,
  number    = {7},
  volume    = {27},
  abstract  = {Code smells are symptoms of poor design that hamper software evolution and maintenance. Hence, code smells should be detected as early as possible to avoid software quality degradation. However, the notion of whether a design and/or implementation choice is smelly is subjective, varying for different projects and developers. In practice, developers may have different perceptions about the presence (or not) of a smell, which we call developer-sensitive smell detection. Although Machine Learning (ML) techniques are promising to detect smells, there is little knowledge regarding the accuracy of these techniques to detect developer-sensitive smells. Besides, companies may change developers frequently, and the models should adapt quickly to the preferences of new developers, i.e., using few training instances. Based on that, we present an investigation of the behavior of ML techniques in detecting developer-sensitive smells. We evaluated seven popular ML techniques based on their accuracy and efficiency for identifying 10 smell types according to individual perceptions of 63 developers, with some divergent agreement on the presence of smells. The results showed that five out of seven techniques had statistically similar behavior, being able to properly detect smells. However, the accuracy of all ML techniques was affected by developers’ opinion agreement and smell types. We also observed that the detection rules generated for developers individually have more metrics than in related studies. We can conclude that code smells detection tools should consider the individual perception of each developer to reach higher accuracy. However, untrained developers or developers with high disagreement can introduce bias in the smell detection, which can be risky for overall software quality. Moreover, our findings shed light on improving the state of the art and practice for the detection of code smells, contributing to multiple stakeholders.},
  doi       = {10.1007/s10664-022-10234-2},
  publisher = {Springer Science and Business Media LLC},
  url       = {https://link.springer.com/article/10.1007/s10664-022-10234-2},
}