The Distribution of Faults in a Large Industrial Software System - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

The Distribution of Faults in a Large Industrial Software System

Description:

3. Persistence of faults (stages and releases) 4. New files VS old files. 9/10/09. CISC879-UDEL ... Old/New. Fault Persistence(releases) No trend at all. Module ... – PowerPoint PPT presentation

Number of Views:32
Avg rating:3.0/5.0
Slides: 23
Provided by: eecis
Category:

less

Transcript and Presenter's Notes

Title: The Distribution of Faults in a Large Industrial Software System


1
The Distribution of Faults in a Large Industrial
Software System
  • Thomas J. Ostrand
  • ostrand_at_research.att.com
  • ATT Labs - Research
  • Elaine J. Weyuker
  • weyuker_at_research.att.com
  • ATT Labs - Research

2
Introduction
  • What is the Problem?
  • Identify the characteristics of files that can be
    used as predictors of fault-proneness
  • What is Motivation?
  • Where to focus limited test resource
  • What is the subject program?
  • Case study of a large industrial inventory system

3
The questions
  • 1. Distribution of faults over different files
    (release, lifecycle stage, severity)
  • 2. Density of faults among different files
  • 3. Persistence of faults (stages and releases)
  • 4. New files VS old files

4
Topics of Discussion
  • Approach
  • Common Belief
  • Their Experimental Study
  • Conclusion
  • Related Work
  • Limitations Future Work

5
Approach
  • Statistical Analysis Investigate data collected
    from an inventory system
  • Related Work Compared with Work of Fenton
    Ohlsson
  • Thirteen Releases VS two Releases
  • Nine phases VS four phases
  • Severity VS none
  • Basic Component File (1974 files, 500000 lines
    of code, Removed non-code files from study)

6
Pareto Distribution of Faults(1-1)
  • Common belief Pareto Distribution of fault
  • What is a Pareto Distribution?
  • Experimental Work
  • By Release
  • By Stage
  • By Severity

7
Pareto Distribution of Faults(1-2)
  • Fault Concentration By Release
  • Supportive table
  • Conclusion concentrate on small number of fault
    prone files

8
Pareto Distribution of Faults(1-3)
9
Pareto Distribution of Faults(1-4)
  • Fault Concentration By Stage
  • Early-pre-release
  • Late-pre-release
  • Post-release
  • Conclusion

10
Pareto Distribution of Faults(1-5)
11
Pareto Distribution of Faults(1-6)
  • Fault Concentration By Severity
  • Severity 1 faults and Severity 4 faults accounted
    for only 4 of the faults
  • With remaining 15 being Severity 2 faults
  • Severity 3 faults accounted for 81 of the faults
  • Conclusion

12
Effect of Module Size (2-1)
  • Common belief large modules are much more
    fault-prone than small ones?
  • Earlier empirical studies contrary
  • Experimental Work
  • Hatton
  • Fenton and Ohlsson
  • Limitations Future Work

13
Effect of Module Size (2-2)
14
Persistence of Faults (3-1)
  • Common belief
  • Files with high concentration of faults detected
    during pre-release also tend to have high
    concentration of faults detected during
    post-release.
  • Faultiness persists between releases.
  • Application identify files that are unusually
    fault-prone and focus test resources on them.

15
Persistence of Faults (3-2)
  • Experimental work
  • Fault persistence between stages
    Late-pre-release and post-release
  • Fault persistence between releases (13)
  • Results
  • 72-94 of pre-release faults with no
    post-release faults
  • 100 of post-release faults in files with 6 to
    28 pre-release faults

16
Persistence of Faults (3-3)
  • In contrast with common belief
  • Each release has 584 to 1772 -gt 20 faults
  • Conclusion Not enough data

17
Persistence of Faults (3-4)
  • Related work Fenton and Ohlsson
  • Two successive releases pre-release and
    post-release
  • Many of the post-release faults occur in modules
    with no pre-release faults
  • 100 of post-release faults in modules with 7
    or 23 pre-release faults
  • Conclusion
  • In contrast with common belief

18
Persistence of Faults (3-5)
Persistence of High-Fault Files
  • Conclusion Supports common belief

19
Old Files, New Files (4-1)
  • Common belief
  • New files have more faults
  • Experimental work
  • Compute the percentage of faulty new files and
    the percentage of faulty pre-existing files

20
Old Files, New Files (4-2)
  • Conclusion allocate more resources for testing
    new files than pre-existing ones

21
Discussion
22
Discussion Questions
  • 1. The evaluation of their work is certainly
    valuable to internal use, but how meaningful
    would it be to the external use?
  • 2. Would it be more interesting or more important
    if they elaborate their methodology rather than
    just generalizing the result?
Write a Comment
User Comments (0)
About PowerShow.com