A System Dynamics Model of Software Evolution (Paper)

A System Dynamics Model of Software Evolution
(revised version of charts 15 May 2001)

University of Sannio
3 May 2001
Juan F Ramil
(presenting a modelling work undertaken jointly with Dr Goel Kahen)
Department of Computing
Imperial College
180 Queen's Gate
London SW7 2BZ
tel. +44 (0) 20 7594 8216
fax. +44 (0) 20 7594 8215
ramil@doc.ic.ac.uk
http://www.doc.ic.ac.uk/~ramil
15 May 2001 (c) All rights reserved. Juan F. Ramil
-1-
89c.lect2[charts]
Objectives
To introduce the application of

system dynamics simulation models as a tool
to plan, manage and control software evolution processes
To provide an example of evolution process model developed in FEAST
To illustrate the top-down modelling approach followed in FEAST, which is based

on successive refinement starting with a simplified high-level process view
-2-
89c.lect2[charts]
Early Models of Evolution Dynamics
Study of OS/360 and other systems during late 60s and early 70s led to
- dynamic models of program growth such as:
Belady and Lehman 1972,75; Riordan 1977 and Woodside 1979
Such models replicated observed growth trends over time and releases
- suggested management guidelines
In late 80s and early 90s that dynamic modelling of software processes gains wider interest
but with focus on ab initio development
- triggered, at least in part, by Abdel-Hamids book on Software Project Dynamics
Not wide interest in dynamic modelling of long-term evolution, with a few exceptions
This contrasts with the fact that evolution - not ab initio development - consumes largest
portion of effort in software organisations
Process models developed in FEAST project (http://www.dse.doc.ic.ac.uk/~mml.feast)

address situation by focusing on long-term evolution
- models are generally consistent with observations in laws of software evolution
- reflect input from industrial collaborators
-3-
89c.lect2[charts]
Evolution: Closing the Loop

Application
Concept
Exogenous
Change
Application Domain
Predictive
Views
Operational
Program
Evolving
Understanding
and Structure
Program
Theories, Models,
Procedures, Laws
of Application and
System Domains
Computational
Procedures
and
Algorithms
Program
Definition
Requirements
Analysis
Installation, introduction into usage closes major feedback loop

Driver of continuing software system evolution
This loop is starting point for process modelling in FEAST

-4-
89c.lect2[charts]
Progressive and Anti-regressive Activity
Several classifications of maintenance and evolution activity have been

proposed over the years
- Focus has been on types such as fixing, adaptation, and enhancement
- See, for example, Chapin N, Hale JE, Khan KM, Ramil JF and Tan WG, Types of Software
Evolution and Software Maintenance, Journal of Software Maintenance and Evolution: Res. and
Practice, Volume 13, Issue # 1, January-February, 2001, pp. 1-30
Lehmans proposal (1974) includes 2 classes of activity (or effort)

Progressive activity - directed to achieve increase functionality, better performance and in
general to change the behaviour of the software as perceived by users
Anti-regressive activity - undertaken to control complexity and its growth and in principle
not altering properties of software as experience by users
Examples of anti-regressive work:

restructuring, rewriting, re-engineering, re-documentation, removing dead code or
duplicates, refactoring, moving to components and/or higher level languages, etc, etc.
Basis for a dynamic model exploring role of increasing complexity
-5-
89c.lect2[charts]
A Model of Lehmans 2nd law - Increasing Complexity
Complexity seen as anti-regressive work deficit - a process metrics, becomes operational

definition of software complexity
Some assumptions:
Total_effort (e.g. person-hours per month) = Progressive_effort + Anti_Regressive_effort
Anti-Regressive Effort = Total Effort * Anti_Regressive_Work_Fraction
0 <= Anti_Regressive_Work_Fraction <= 1.0
Insufficient anti-regressive work will lead to complexity increase

Excessive anti-regressive work will lead to resource waste
Optimum value for Anti_Regressive_Work_Fraction somewhere in-between
A system dynamic model to explore this issue follows
It is presented in a step-by-step fashion to demonstrate successive refinement
approach, see for example, Zurcher FW and Randell B, Iterative Multi-Level Modeling - A
Methodology for Computer System Design, IBM Res. Div. Rep. RC-1938, Nov. 19678. Also in Proc. IFIP
Congress 1968, Edinburgh, Aug 5 - 10, 1968, pp D-138 - 142
-6-
89c.lect2[charts]
Base Model
Work Waiting
Exogenous work requests
Implementation
Work
Implemented
-7-
89c.lect2[charts]
First Refinement
Work Waiting
Implementation
USER CHANGE
REACTION
MULTIPLIER
Endogenous work requests

Work
Implemented
USER CHANGE
REACTION
DELAY
<TIME
STEP>
Cumulative
Work
Released
Release
RELEASE TIMING
Endogenous work request =

delay3(USER CHANGE REACTION MULTIPLIER *Release,USER CHANGE
REACTION DELAY)
Vensim provides several functions, such as delay3, to simulate dynamic effects. See reference manual for details
This equation is an starting point to model impact of the release work in the usage domain. It awaits empirical
validation - in the simulation results presented later, the actual value of Work_Waiting has no impact
-8-
89c.lect2[charts]
Individual Productivity vs Team Size - A Proposed Formulation

Following Abdel-Hamid and Madnick:
Productivity = Potential_Productivity*Communication_Losses
Communication Losses
In a group of size n, number of communication channels is n(n-1)/2
Assuming that the loss per communication link is constant impact on software development rate of team
can be modelled, for example, by function that decreases as the size of the team increases:
Communication Losses in percent = {1-[k(n-1)n]/100}
~ [1- (k n2/100)]
where n is the team size (number of people in the team) and k is a suitable constant, obtained from
empirical data
Abdel-Hamid and Madnick, for example, have suggested that for a team of 30 people,
50 percent of the effort is subsumed by communication activity and hence unavailable for
software development - this observations leads to k = 0.06
-9-
89c.lect2[charts]
Productivity vs Team Size - A Proposed Formulation

Productivity = Potential Productivity*Communication_Losses
Potential Productivity
Many factors can be considered here - e.g. motivation, learning
We refer to Abdel-Hamid and Madnicks book for detailed discussion
Consider here only one aspect, the increase of individual productivity due to build-up of the
experience base as a team grows in size - we refer to it here as synergy
Assuming that the gain in potential productivity due to synergy
can be modelled, for example, by function that increases as the size of the team increases:
Synergy = {a + b(n/n+c)}
where n is the team size (number of people in the team) and a,b and c are suitably selected constants,
obtained from empirical data
Many factors can be considered here

- 10 -
89c.lect2[charts]
Individual Productivity vs Team Size - A Proposed Formulation
10
20
30
Team Size
40
Impact of Synergy on Productivity
20
Team Size
40
Productivity of an Individual
Team Member
Impact of Comunication Losses
Combined Effect
Potential Productivity
Productivity of an Individual Team
Member
Productivity of an Individual Team

Member
Communication Losses
Synergy and Communication Losses

Combined Impact
10
20
Team Size
30
40
Behaviour is qualitatively as expected - quantitative detail will certainly vary from project to
project and organisation to organisation
Of course, one will need to be calibrate behaviour to real data - for the moment, an starting
point
- 11 -
89c.lect2[charts]
Third Refinement
Work Waiting
Implementation

USER CHANGE
REACTION
MULTIPLIER
Work
Implemented
USER CHANGE
REACTION
DELAY
Productivity
Progressive effort
<TIME
STEP>
Cumulative
Work
Released
TEAM SIZE
NOMINAL
PRODUCTIVITY
Release
RELEASE TIMING
Productivity =
NOMINAL PRODUCTIVITY * (0.67+(0.7*TEAM SIZE/ (TEAM SIZE + 0.7))) *

max(0,1-(0.0006*TEAM SIZE*(TEAM SIZE-1)))
Implementation = Progressive Effort * Productivity

Progressive Effort = TEAM SIZE
- 12 -
89c.lect2[charts]
Productivity vs Anti-Regressive Deficit - A Proposed Formulation

Anti_regressive_deficit = Cumulative_Work_Implemented - Cumulative_Anti_regressive_Activity
What is the relationship between Productivity_loss and Anti_regressive_deficit?

Impact_on_Productivity
One may expect that productivity loss

increases as Anti_regressive_deficit
grows
Look Up Table
If Anti_regressive_deficit < 0 then

Productivity_loss = 0
otherwise
Productivity_loss = f(Anti_regressive_deficit)
Loss in Productivity (perc.)
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
f() defined using Vensim Look Up Table feature
0
0
1000
2000
3000
Anti Regressive Deficit
Of course, one would need to be calibrated to real data - for the moment, an starting point
- 13 -
89c.lect2[charts]
Fourth Refinement
Work Waiting
Cumulative Work
Implemented
Implementation
Anti Regressive Deficit
USER CHANGE
REACTION
MULTIPLIER
TEAM SIZE
Work
Implemented
USER CHANGE
REACTION
DELAY
NOMINAL
PRODUCTIVITY
Release
ANTI REGRESSIVE
WORK FRACTION
RELEASE TIMING
Productivity Loss =
f(Anti Regressive
Deficit)
Productivity Loss
Anti regressive effort
<TIME
STEP>
Cumulative
Work
Released
Productivity
Progressive effort
Cumulative Anti
regressive Activity
Anti regressive work
Productivity = NOMINAL PRODUCTIVITY *

(0.67+(0.7*TEAM SIZE/ (TEAM SIZE + 0.7))) * MAX(0,1-(0.0006*TEAM SIZE*(TEAM SIZE-1)))
*Productivity Loss
Progressive Effort = (TEAM SIZE-Anti regressive effort)
Anti regressive effort = TEAM SIZE*ANTI REGRESSIVE WORK FRACTION
- 14 -
89c.lect2[charts]
Example of Model Output

Exploring consequences of various values Anti_Regressive_Work_Fraction (arwf)
Rate of functionality released to users

- a possible indicator of evolution success
- represented in model by Cumulative Fielded Functionality CMF
Models enables to study effects of different allocation of resources, e.g.

arwf 0 pc: no resources allocated to anti regressive work
arwf 40 pc: 40 percent of resources allocated to anti regressive work
arwf 60 pc: 60 percent of resources allocated to anti regressive work
- 15 -
89c.lect2[charts]
An Example of Models Output (cont.)

Anti_Regressive_Work_Fraction (arwf) = 40 percent provides largest growth after 200 months
Anti_Regressive_Work_Fraction (arwf) = 0 percent maximises short-term behaviour
Graph for Cumulative Work Released
6,000
4,500
3,000
1,500
0
0
20
40
60
80
100
120
Time (Month)
Cumulative Work Released : arwf 0 pc

140
160
180
200
Cu mulative Module Changes

Simulation results support the convenience of an appropriate level of antiregressive activity in a software process
Next step, model refinement to reflect higher level of process detail
- 16 -
89c.lect2[charts]
Further extended and refined model

ACCEPTED
TARGET
PREPARATION
PRODUCTIVITY FACTOR
PREPARATION EFFORT
MULTIPLIER
Acceptance Flow
Work
Accepted
Preparation
flow
Work Prepared for

Implementation
Submision Flow
IN PROGRESS
TARGET
Work Identified
Preparation effort
In Progress
Other additions and
changes identified
Demand obsolescense
Progressive effort
NORMAL
PRODUCTIVITY
Change plus defect

discovery factor
Productivity
Implementation
flow
SYSTEM TYPE
MULTIPLIER
Requirement
Change flow
TEAM SIZE
VALIDATION AND
INTEGRATION
EFFORT
MULTIPLIER
Cumulative
Fielded
Functionality
RELEASE
POLICY
F
PROGRESSIVE
FRACTION
INTEGRATION
PRODUCTIVITY
FACTOR
TIME STEP
Validation and Integration effort
<Time>
Fielded Functionality
Satisfying Current
Needs
Software
release
Work Ready
to Release
Work Implemented
Integration flow
- 17 -
89c.lect2[charts]
Further refinement
ACCEPTED
TARGET
PREPARATION
PRODUCTIVITY FACTOR
PREPARATION EFFORT
MULTIPLIER
Acceptance Flow
Submision Flow
Work Prepared for

Implementation
Work
Accepted
Preparation
Flow
IN PROGRESS
TARGET
Work Identified
Preparation effort
In Progress
Additions and
Changes Identified
by Others
Cumulative
Progressive
Work
Demand obsolescense
Progressive effort
NORMAL
PRODUCTIVITY
Change plus defect

discovery factor
F
PROGRESSIVE
FRACTION
Productivity
Implemented
SYSTEM TYPE
MULTIPLIER
Requirement
Change flow
TEAM SIZE
Cumulative Anti
Regressive Work
INTEGRATION
SUCCESS
FACTOR
Cumulative
Fielded
Functionality
Rejected as
needing
rework
Anti regressive work

INTEGRATION
PRODUCTIVITY
FACTOR
RELEASE
POLICY
VALIDATION AND
INTEGRATION
EFFORT
MULTIPLIER
Anti regressive effort
Validation and Integration effort
TIME STEP
<Time>
Fielded Functionality
Satisfying Current
Needs
Software
release
Work Ready
to Release
Work Implemented
Successfully
integrated
- 18 -
89c.lect2[charts]
Some Modelling Tips
Avoid, if and when possible, construction of very large models

- large models involve risk of getting out of intellectual control
One needs externally imposed discipline

- identify sub-systems in the model
- refine step-by-step as more detail needs to be reflected in model
- achieve model realism with the smallest possible number of variables and relationships
Quantification of soft factors - not straight forward

- examples of soft factors: work pressure, motivation, knowledge, expertise
- look for what has change or is likely to change - constant aspects can be in general
factored out from the model
Warning - Large differences in quantitative detail may be expected from process to process
Thus, a model must be calibrated to real data reflecting a particular software evolution
process before its use as policy assessment tool
Start data collection as soon as possible, making use of cheap, available records, first
- data collection is expensive and must be well justified
- modelling unprecedented situations and/or without reference modes is difficult
- 19 -
89c.lect2[charts]
Simulation, Process Improvement and Process Maturity Level
Process maturity
- higher maturity presumes higher degree of repeatability, improved performance
Example: CMM - Capability Maturity Model, developed at SEI, CMU

- considers level 1 - less mature- to level 5 - most mature processes
Christie argues that simulation can play a role at any level

For further details see:
Christie A, Simulation in Support of CMM-based Process Improvement, Journal of Systems and

Software, Vol. 46, Nos. 2/3, 1999, pp 107 - 112
- 20 -
89c.lect2[charts]
Role of Process Modelling - a Caveat
According to Tom DeMarco - in a recent presentation at Imperial College - these are some
of the skills that a manager in a software organisation requires -note that most of them
are people skills, such as
- hiring
- thanksgiving
- praising
- conflict resolution
- team work - in particular, works as a team with other managers
- product and process management - e.g. simulation techniques
Simulation models, metrics, estimation techniques offer only a tool

- Barry Boehm recommends to rely on more than one model or tool
- Our view is that use of process models should nor lead to a mechanistic view of
process neither diminish human role, but understand implications of policies,
mitigate risks
- 21 -
89c.lect2[charts]

A System Dynamics Model of Software Evolution (Paper)

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A System Dynamics Model of Software Evolution (Paper)

Uploaded by

Copyright:

Available Formats

A System Dynamics Model of Software Evolution

(revised version of charts 15 May 2001)

To introduce the application of

To provide an example of evolution process model developed in FEAST

To illustrate the top-down modelling approach followed in FEAST, which is based

15 May 2001 (c) All rights reserved. Juan F. Ramil

Early Models of Evolution Dynamics

Process models developed in FEAST project (http://www.dse.doc.ic.ac.uk/~mml.feast)

Evolution: Closing the Loop

Installation, introduction into usage closes major feedback loop

This loop is starting point for process modelling in FEAST

Progressive and Anti-regressive Activity

Several classifications of maintenance and evolution activity have been

Lehmans proposal (1974) includes 2 classes of activity (or effort)

Examples of anti-regressive work:

Basis for a dynamic model exploring role of increasing complexity

15 May 2001 (c) All rights reserved. Juan F. Ramil

A Model of Lehmans 2nd law - Increasing Complexity

Complexity seen as anti-regressive work deficit - a process metrics, becomes operational

Insufficient anti-regressive work will lead to complexity increase

15 May 2001 (c) All rights reserved. Juan F. Ramil

Endogenous work requests

Endogenous work request =

Individual Productivity vs Team Size - A Proposed Formulation

Productivity vs Team Size - A Proposed Formulation

Many factors can be considered here

Individual Productivity vs Team Size - A Proposed Formulation

Impact of Synergy on Productivity

Impact of Comunication Losses

Productivity of an Individual Team

Synergy and Communication Losses

15 May 2001 (c) All rights reserved. Juan F. Ramil

Exogenous work requests

Endogenous work requests

NOMINAL PRODUCTIVITY * (0.67+(0.7*TEAM SIZE/ (TEAM SIZE + 0.7))) *

Implementation = Progressive Effort * Productivity

Productivity vs Anti-Regressive Deficit - A Proposed Formulation

What is the relationship between Productivity_loss and Anti_regressive_deficit?

One may expect that productivity loss

If Anti_regressive_deficit < 0 then

Loss in Productivity (perc.)

f() defined using Vensim Look Up Table feature

Anti Regressive Deficit

Anti Regressive Deficit

Anti regressive effort

Endogenous work requests

Anti regressive work

Productivity = NOMINAL PRODUCTIVITY *

Example of Model Output

Rate of functionality released to users

Models enables to study effects of different allocation of resources, e.g.

15 May 2001 (c) All rights reserved. Juan F. Ramil

An Example of Models Output (cont.)

Cumulative Work Released : arwf 0 pc

Cu mulative Module Changes

Further extended and refined model

Work Prepared for

Change plus defect

Validation and Integration effort

15 May 2001 (c) All rights reserved. Juan F. Ramil

Work Prepared for

Change plus defect

NOMINAL PRODUCTIVITY * (0.67+(0.7TEAM SIZE/ (TEAM SIZE + 0.7)))