Mining Your Own Evidence
by Kim Herzig, Andreas Zeller

Andy Oram, Greg Wilson (Ed.), Making Software, Chapter 27, O'Reilly Media, Inc., October 2010.

ISBN: 9780596808327

See also

More information is available at


Throughout this book, you will find examples of how to gather evidence - evidence on the effectiveness of testing, the quality of bug reports, the role of complexity metrics, and so on. But do these findings actually apply to your project? The definite way to find this out is to repeat the appropriate study on your data, in your environment. This way, you will not only gather lots of insight into your own project; you will also experience the joys of experimental research. Unfortunately, you may also encounter the downside: empirical studies can be very expensive, in particular if they involve experiments with developers.
Fortunately, there is a relatively inexpensive way to gather lots of evidence about your project. Software archives, such as version or bug repositories, record much of the activity around your product, in terms of problems occurring, changes made, and problems fixed. By mining these archives automatically, you can obtain lots of initial evidence about your product - evidence that already is worthy in itself, but which may also pave the path toward further experiments and further insights. In this chapter, we give a hands-on tutorial into mining software archives, covering both the basic technical steps and possible pitfalls that you may encounter on the way.

BibTeX Entry

    title = "Mining Your Own Evidence",
    author = "Kim Herzig and Andreas Zeller",
    year = "2010",
    month = oct,
    booktitle = "Making Software",
    chapter = "27",
    editors = "Andy Oram and Greg Wilson",
    publisher = "O'Reilly Media, Inc.",
    ISBN = "9780596808327",

Show all publications of the Software Engineering Chair.