Dies ist ein Archiv des alten Softwaretechnik Lehrstuhls der Universität des Saarlandes. Es ist nicht länger aktuell.


Preprocessing CVS Data for Fine-Grained Analysis
Thomas Zimmermann · Peter Weißgerber

Lehrstuhl für Softwaretechnik (Prof. Zeller)
Universität des Saarlandes – Informatik
Informatik Campus des Saarlandes
Campus E9 1 (CISPA)
66123 Saarbrücken
E-mail: zeller @ cs.uni-saarland.de
Telefon: +49 681 302-70970

Deutschsprachige Startseite Page d'acceuil en franšais English home page
   Thomas Zimmermann and Peter Weißgerber. Preprocessing CVS Data for Fine-Grained Analysis. Proc. 1st International Workshop on Mining Software Repositories (MSR), Edinburgh, UK, May 2004.
Preprocessing is a prerequisite for any analysis of CVS data.

Get the paper in PDF format (5 pages, 212k).


All analyses of version archives have one phase in common: the preprocessing of data. Preprocessing has a direct impact on the quality of the results returned by an analysis. In this paper we discuss four essential preprocessing tasks necessary for a fine-grained analysis of CVS archives:
  1. data extraction,
  2. transaction recovery,
  3. mapping of changes to fine-grained entities, and
  4. data cleaning.
We formalize the concept of sliding time windows and show how commit mails can relate revisions to transactions. We also present two approaches that map changes to the affected building blocks of a file, e.g. functions or sections.


  1. Introduction
  2. Data Extraction
  3. Restoring Transactions
    • Fixed Time Windows
    • Sliding Time Windows
  4. Mapping Changes to Entities
  5. Data Cleaning
    • Large Transactions
    • Merge Transactions
  6. Related Work
  7. Conclusion
  8. References


See Also...

Impressum Datenschutzerklärung

<webmaster@st.cs.uni-saarland.de> · http://www.st.cs.uni-saarland.de/papers/msr2004/ · Stand: 2018-04-05 13:41