Lessons Learned Validating 60,000 Pages of Api Documentation

LESSONS LEARNED
VALIDATING 60,000 PAGES OF

API DOCUMENTATION
Robert V. Binder
ACM Chicago Chapter Meeting
Chicago May 8, 2013
Copyright Robert V. Binder, 2013. All rights reserved.
Overview
Background
Microsoft Protocol QA Process
Scope and approach
Requirements engineering
Model-based testing
Non-Microsoft Applications
Q&A
The evil Dogbert mocks

Dilbert as Boron, the
most boring man in the
universe
BACKGROUND
What is a Protocol?
Rules for interaction between
(among) endpoints using
messages
Data
Content and format
Behavior
Request/Response
Acceptable sequences
Layers, Protocols, Stacks

Layer: level of abstraction
BING
XMP
Each layer is a protocol
SOAP
Stack of layers
L L-1 ok
XML
L L m NOT ok
Layer uses other protocols

HTTP over TCP or RPC
HTTP
TCP
IP over WiFi or LAN
IP
Protocol may define own data or use standard
format (XML)
UDP
802.11
802.2
Protocols Everywhere
Cellular: CDMA, GSM, SMS, MMS, WAP
Network: 802.11 (WiFi), 802.16 (WiMax)
Wireless: Bluetooth, Zigbee, ANT, ISO 14443

Routing: OSPF, IGP, RIP, CIDR, BGP
RFCs: HTTP, FTP, SOAP, TCP, IP, IPv4, IPv6 (1000s)
Corba: GIOP, IIOP, ESIOP, RMI, IDL
FIX (Financial Information eXchange)
Amazon API, REST, BING API, Netflix API, Google
Protocol Buffers,
SCOPE AND PROCESS
10
Publish or Perish
US Federal Court and EU order
Consent Decree
Microsoft to publish server side API
documentation
Goal: interoperability for third
parties
Hard milestones/deadlines imposed
by Federal Judge
Microsoft Open Specification
Initiative
11
Cast of Thousands
MSFT project management
100s of senior MSFT developers
wrote/revised TDs
TD publication staff
More than 350 test suite devs
(mostly in Hyderabad & Beijing)

~20 Independent Reviewers (5
System Verification Assoc.)
Process Architects (MSFT &
System Verification Assoc.)

MSFT Netmon and other tool
developers
MSFT Plugfest team
Technical Committee chartered
by court, with over 50 FTE

reviewers of published TDs
12
What is a Microsoft Protocol?

Remote API for a service
All product groups

Windows Server
Office
Exchange
SQL Server
Others
500+ protocols
Remote Desktop
Active Directory
File System
Security
Many others
13
Microsoft Technical Document (TD)

Publish protocols as Technical Documents
One TD for each protocol
Similar to RFC
Must strictly follow template
Black-box spec no internals
All data and behavior specified with text
OS version differences endnotes
14
All Technical Docs Public

http://msdn.microsoft.com/library/jj712081
15
Challenges
Validation of documentation, not as-built implementation
Is each TD well-formed?
Follows TD standards
Consistency, correctness, completeness
Is each TD all a third party needs to develop:
A client that interoperates with an existing service?
A service that interoperates with existing clients?
Only use over-the-wire messages
16
Test-Model Driven Protocol Verification

Technical Document
Analysis
Data and
behavior
statements
Approximates third party implementation
Validates consistency with actual Windows
implementation
WS 2008
WS 2003
WS 2000
Model assertions generate and

check response of actual
Windows Services
Requirements
Specification
Modeling
Model-based
Test Suite
Test Execution
Stobie et al, 2010 Microsoft. Adapted with permission.
17
Protocol Quality Assurance Process

TD v1
TD v2
TD vn
Authors
Study
Scrutinize
TD
Define Test
Strategy
Test
Suite
Developers
Review
TD ready?
Strategy?
Reviewers
Plan
Complete
Test Rqmts
High Level
Test Plan
Design
Final
Complete
Model
Complete
Adapters
Gen & Run

Test Suite
Prep User
Doc
Review
Review
Review
Test Rqmts?
Config ?
Plan ?
Model?
Adapters?
Coverage?
Test Code?
18
Results
Published 500+ TDs
60,000+ pages
50,000+ Technical Document Issues

Most identified before tests run
Many Plugfests, many 3rd party users

Released high interest test suites as open source
US DoJ case closed June 12, 2011
TEST REQUIREMENTS
ENGINEERING
20
TD Statements
Data Statement
2.2.1.2.3 DHCP_OPTION_ID
Behavior Statement
3.1.4.16 R_DhcpRemoveOptionValue
a unique integer that identifies the option

defined for a user class and a vendor class.
The option ID range for DHCPv4 options is
1 to 255, while the option ID range for
DHCPv6 options is 0 to 65536.
When processing this call, the DHCP server

MUST do the following If the
DHCPv4ClassedOptValue corresponding to
the OptionID parameter is not present, then
return ERROR_OPTION.<24> Otherwise,
Endnote
<24> Windows XP and Windows Server 2003 DHCP clients request only option code
249 in the Parameter Request List.
[MS-DHCPM] Microsoft Dynamic Host Configuration Protocol (DHCP)
21
TD Statements to TD Test Requirements

Req ID
Doc Sect
Description
Inform
Pos Neg Derived /Norm Verification
TSCH
_R142
2.4.1
The client MUST set the File Version (2bytes, it contains the Version of the .JOB file format)
field of the FIXDLEN_DATA structure to 0x0001.
R110 R113
2
1
TSCH
_R145
2.4.1
The server MUST ignore the value in the App Name Len Offset field of the FIXDLEN_DATA
structure.
2.4.1
The Trigger Offset (2 bytes) field of the FIXDLEN_DATA structure MUST contain the offset in
bytes within the .JOB file where the task triggers are located.
TSCH
_R1332
3.2.5.4.6
Upon receipt of the SchRpcGetSecurity call, the server MUST return S_OK on success.
Norm
Test Case
TSCH
_R1333
3.2.5.4.7
The SchRpcEnumFolders method MUST retrieve a list of folders on the server.
Norm
Adapter
TSCH
_R146
TSCH 3.2.5.4.7
_R1350
[Upon receipt of the SchRpcEnumFolders call, the server

MUST ] Return [the value 0x80070003] the HRESULT version
of the Win32 error ERROR_PATH_NOT_FOUND, if the path
argument does not name a folder in the XML task store, or if
the caller does not have either read or write access to that
folder.
Norm
Norm
R110 R113
2
1
Non-testable
Norm
Norm Test Case
Document Debugging
Bug
Template, MUST-SHOULD-MAY
Ambiguous, unclear, inconsistent
Missing or incorrect
SUT response inconsistent
Fix
TDI Author rewrite

TDI Author rewrite
TDI Author rewrite
TDI Code bug and/or rewrite
Implicit antecedent
Cause or effect too broad
No effect for corrupt/missing
Add antecedent []
Add derived, narrow domain
Add derived, negative effect
cause
Unobservable or uncontrollable
Infeasible
negative effect
Add derived, observable effect
Add derived, narrowed scope
23
Document Debugging
Every TD statement analyzed
Scrutinize
Categorize
Make context explicit
Trace dependencies
Assess testability
Allocate
24
Scrutinize
Ambiguous phrasing
Misuse of MUST, SHOULD, MAY
Inconsistent
Unclear
TD template violations
Write bug report for author correction
25
Categorize
Normative or Informative?
Like code comments (informative)?
Conceptually, cells are numbered in a dataset as if the

dataset were a p-dimensional array, where p is the number
of axes.
Or like code (normative)?
SVR_RESP (1 byte): A single byte whose value MUST be

0x05.
If removed, would that prevent 3rd party interop?
No modeling/testing for informative
26
Make Context Explicit

Add implicit antecedents
Use [ ] to indicate addition
Preserves meaning in code, test results, log files
Original TD
statements
If the computeByClause is present, one group is created for each unique

combination of values in the column or columns specified in the
computeByClause. Otherwise, all rows of the child RecordSet are treated as a
single group.
Test
Requirement 1
If the computeByClause is present, one group is created for each unique

combination of values in the column or columns specified in the
computeByClause.
Test
Requirement 2
Otherwise [if the computeByClause is not present], all rows of the child
RecordSet are treated as a single group [in the computeByClause.]
27
Trace Dependencies
Is there a stated observable effect:
For every cause?
When a cause is missing or corrupted?
Record analysis with linked requirements
Req ID
Description
R100
Actions: This part MUST be present and MUST

specify the action to be performed once the task is
started.
R110
The server MUST execute multiple actions

sequentially, in the order specified in the Actions field.
Pos
R110
Neg
???
Derived
Verification
Test Case
Test Case
Stobie et al, 2010 Microsoft. Adapted with
permission.
28
Assess Testability
A test requirement is testable if:
Sufficient to generate and/or evaluate in code
Observable over-the-wire
Non-testable if:
Unobservable
Uncontrollable
Infeasible
Excessive cost to develop test
29
Assess Testability
Unobservable or uncontrollable
All the structures MUST begin on 8-byte boundaries, although the
data that is contained within the structure need not be aligned to 8byte boundaries.
Cant detect mis-alignment at test endpoint
After close, the server MUST release all resources.

No way to check using protocol
Infeasible
The server MUST return a unique ID.
No way to conclusively determine uniqueness
30
Assess Testability
What to do about non-
Add derived test requirement
testable statements?
Punt?
Interpretation unpredictable
(testers and users)
Lowers coverage
Skip?
Taints credibility
lowers coverage
Rewrite non-testable
Strictly limited revision or
elaboration
Significant requirements
engineering innovation
31
Derived Test Requirements

Req ID
Description
R42
MUST accept any positive number
R1042
MUST accept 1024
R39
Ignored by the server on receipt
R1039
Reply is the same whether 0 or nonzero is used for Field
Derived
42:c
Verification
Comments
Non-testable
Infeasible
Test Case
Non-testable
39:p
Case: only an instance of a domain

Partial: Observable subset
Inferred: Indirect result of several causes
Test Case
Server internal
behavior
32
Fully Elaborated Test Requirements

Req ID
Description
Pos
Neg
R110
R1100
Derived
Verification
R100
Actions: This part MUST be present and MUST

specify the action to be performed once the task is
started.
R110
The server MUST execute multiple actions

sequentially, in the order specified in the Actions field.
Test Case
R113
pErrorInfo: If this parameter is non-NULL and the

XML task definition is invalid, the server MUST return
additional error information.
Test Case
R114
0x8004131A SCHED_E_MISSINGNODE: The task

XML is missing a required element or attribute.
Test Case
R1100
If Action is missing, SCHED_E_MISSINGNODE is

returned in pErrorInfo
Test Case
R100,
R113:i,
R114:i
Test Case
33
Allocate Test Requirements

To Test Case
Develop model contract and/or test code
Generate the condition and send a message
Evaluate response, pass or fail
To Adapter
Basic data structure and format checked as side-effect
Netmon parsing
Transport layer marshaling
MODEL-BASED TESTING
35
Model-Based Testing
Develop
Test Requirements
Missing, incorrect
Ambiguous, missing,
contradictory, incorrect,
obscured, incomplete
Run
Test Model
Model error, omission
Generate
Test Suite
Inputs
(Test Sequences)
Evaluate
Expected Outputs (Test

Oracle)
Control
Observe
SUT
Bug
Coverage
Requirements
Model
Code
36
Why Model-based Testing?

Effectiveness
Scope
Automate generation of huge number of tests
Mitigate brittleness/breakage risk
Highly structured behavior well-suited to modeling
Easier to assess model than huge test suite
Consistent and automatic transition coverage versus
arbitrary or ad hoc strategies
37
Spec Explorer
Model-based testing tool
Developed at Microsoft Research
Productized after extensive use
Visual Studio Power Tool
Development UI
Generates standalone executable test suite

Tests for any service or API not limited to Microsoft
38
Spec Explorer Personality

Entire model in C# - no pictures/UML
Inline calls to Spec Explorer framework
Include any programmable function or Dot Net capability
Bottom up modeling
Aggregate model synthesized
State machine slicing defines scenarios
Coverage strategy
Constraint Solver
All transitions of the explored model/scenario, short or long path
Combinational selection of transition parameter values
39
Spec Explorer Model Program

static class Model {
public enum TimerMode { Reset, Running, Stopped }
static bool displayTimer = false;
static TimerMode timerMode = TimerMode.Reset;
static bool timerFrozen = false;
[Rule]
static void StartStopButton() {
Condition.IsTrue(displayTimer);
if (timerMode == TimerMode.Running){
timerMode = TimerMode.Stopped;
timerFrozen = false;
}
else
timerMode = TimerMode.Running;
}
[Rule] static void ModeButton(){ ... }
[Rule] static void ResetLapButton(){ ... }
[AcceptingState] static bool IsTimerReset(){ ... }
...
}
Model state drives

exploration
[Rule] identifies
behaviors to be explored
Precondition rest of body
executed only if true
Body: update model state,
log, generate expected
result
Identifies goal state
stops exploration
40
Model Exploration
Scenarios and slices
Any subset of all [Rule]
methods
Uses reg-exp like syntax
Represent use cases
Data-driven slice
Manages state explosion
problem
Explore
Constraint solver finds
feasible paths using
initial data values and
symbolic execution
Supports iterative model
development
41
SEs Visualization of an Exploration

Rule enabled: SUT
message + feasible
data binding
Expect a response
from the SUT
Updated model
state
SE reached an
Accepting State
Each root to leaf path: test case(s)

Explores feasible pre/post bindings
User limits depth and action set
Generate code for each test case
Stobie et al, 2010 Microsoft. Used with permission.
42
Traceability in Model Programs

Helper method called
static void CaptureRequirement (User caller) {
if (!caller.isAdmin)
when Rule selected
{
RequiresCapture(1087, "In response to NetrJobGetInfo request the " +
"server MUST Return ERROR_ACCESS_DENIED if the caller does not have " +
"administrative privileges on the server.");
RequiresCapture(1091, "In response to NetrJobGetInfo request, the " +
"server MUST use Windows Error Codes as specified in [MS-ERREF].");
return TschErrorCode.ERROR_ACCESS_DENIED;
Requirement Id
}
Requirement text
else
hardcoded to make
{
//This action returns success if caller has admin privilege and
//The requested job exists in the job list
code clear and
if (atsvcJobsStore.ContainsKey(jobId))
logs readable
{
RequiresCapture(1025, "If the server implements the ATSvc " +
"interface, it MUST implement the NetrJobGetInfo (Opnum 3) method.");
RequiresCapture(1785, "NetrJobGetInfo method MUST have " +
"administrator privileges.");
return TschErrorCode.ERROR_SUCCESS;
Stobie et al, 2010 Microsoft. Used with permission.
}
}
43
Typical Test Configuration

Test Suite
Adapter
Adapter
Netmon
Transport
Transport
Client OS
Endpoint
Under Test
Transport
Transport
Server OS
44
Netmon Capture
Stobie et al, 2010

Microsoft. Adapted
with permission.
20540 TSAP TSAP:TestCase Name=....Test_ITask_RegisterFlagsS8, Message= MS-TSCH_R1215

20544 TSCH TSCH:ITaskSchedulerService::SchRpcDelete Response, ReturnValue=1 vstesthost.exe
20545 TSCH TSCH:ITaskSchedulerService::SchRpcGetTaskInfo Request, Path=CH\1223330325 Flags=0 (0x0)
20546 TSCH TSCH:ITaskSchedulerService::SchRpcGetTaskInfo Response, Enabled=0 State=0 ReturnValue=1
20550 TSAP TSAP:TestCase Name=....Test_ITask_RegisterFlagsS8, Message=Assert.IsTrue succeeded. The
SchRpcDelete method MUST delete a task in the task store.
20552 TSAP TSAP:TestCase Name=....Test_ITask_RegisterFlagsS8, Message=Assert.IsTrue succeeded. Upon receipt
of the SchRpcDelete call the server MUST delete the task from the XML task store.
Netmon free download

http://www.microsoft.com/en-us/download/details.aspx?id=4865
45
Complete Traceability
Technical Document
Requirements Spec
Model
Test Suite
Logs
Network Captures
Stobie et al, 2010 Microsoft.

Used with permission.
46
Productivity
Total effort: 250 person years (mostly junior SDETs)
Saved 50 person years with model-based testing
42% Less Time

Per Requirement
Model-based Testing
1.4 Days/Requirement
Traditional Testing
2.4 Days/Requirement
Requirements
Study
Modeling
Test
Coding
Adapter
Coding
Test
Execution
47
Lessons Learned Technology

Technical documentation for complex systems is meaningless (or worse)
without validation. Different mindset needed for doc validation

Many published standards (RFCs) have significant deficiencies
Shallow coverage (1 test/rqmt) was effective because existing services were
golden
MBT productivity and effectiveness even greater for deeper testing of new
services
Be practical: use hand-coded test logic when convoluted behavior defies
modeling
48
Lessons Learned - Process

Allow flexibility for improvement
Insist on compliance for results
Ongoing training crucial
Bottom-up evolution
Transparent community participation
Strict and consistent enforcement
Usable conformance test suites highly valuable
49
Q &A
rvbinder@tezzter.com
50
Resources and Sources

Microsoft Open Specification web site
http://www.microsoft.com/openspecifications
Technical Documents
http://msdn.microsoft.com/library/jj712081
Project Overview
http://queue.acm.org/detail.cfm?id=1996412
Spec Explorer
About http://msdn.microsoft.com/en-us/library/ee620411.aspx
Download http://visualstudiogallery.msdn.microsoft.com/enus/271d0904-f178-4ce9-956b-d9bfa4902745
Netmon and protocol parsers
http://blogs.technet.com/b/netmon/
Protocol Test Suites (must provide Live Id to login)
http://msdn.microsoft.com/en-us/openspecifications/cc816059.aspx
Credits
Protocol map, Copyright 2001, Agilent
Technologies.
Some slides adapted with permission from

Stobie, Kicillof, and Grieskamp,
Discretizing Technical Documentation for
End-to-End Traceability Tests, INRIA
2010.
Selected charts and figures from
Grieskamp, Kicillof, Stobie, and
Braberman, Model-based quality
assurance of protocol documentation:
tools and methodology, Journal of
Software Testing, Verification, Validation
and Reliability 21: 55-71 (March 2011).

Lessons Learned Validating 60,000 Pages of Api Documentation

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lessons Learned Validating 60,000 Pages of Api Documentation

Uploaded by

Copyright:

Available Formats

LESSONS LEARNED

VALIDATING 60,000 PAGES OF

The evil Dogbert mocks

Layers, Protocols, Stacks

Each layer is a protocol

Layer uses other protocols

IP over WiFi or LAN

Protocol may define own data or use standard

Wireless: Bluetooth, Zigbee, ANT, ISO 14443

Amazon API, REST, BING API, Netflix API, Google

SCOPE AND PROCESS

(mostly in Hyderabad & Beijing)

System Verification Assoc.)

Process Architects (MSFT &

System Verification Assoc.)

by court, with over 50 FTE

What is a Microsoft Protocol?

All product groups

Microsoft Technical Document (TD)

OS version differences endnotes

All Technical Docs Public

Test-Model Driven Protocol Verification

Model assertions generate and

Protocol Quality Assurance Process

Gen & Run

50,000+ Technical Document Issues

Many Plugfests, many 3rd party users

a unique integer that identifies the option

When processing this call, the DHCP server

[MS-DHCPM] Microsoft Dynamic Host Configuration Protocol (DHCP)

TD Statements to TD Test Requirements

The SchRpcEnumFolders method MUST retrieve a list of folders on the server.

[Upon receipt of the SchRpcEnumFolders call, the server

Norm Test Case

TDI Author rewrite

Conceptually, cells are numbered in a dataset as if the

SVR_RESP (1 byte): A single byte whose value MUST be

Make Context Explicit

If the computeByClause is present, one group is created for each unique

If the computeByClause is present, one group is created for each unique

Actions: This part MUST be present and MUST

The server MUST execute multiple actions

After close, the server MUST release all resources.

Add derived test requirement

Derived Test Requirements

MUST accept any positive number

MUST accept 1024

Ignored by the server on receipt

Reply is the same whether 0 or nonzero is used for Field

Case: only an instance of a domain

Fully Elaborated Test Requirements

Actions: This part MUST be present and MUST

The server MUST execute multiple actions

pErrorInfo: If this parameter is non-NULL and the

0x8004131A SCHED_E_MISSINGNODE: The task

If Action is missing, SCHED_E_MISSINGNODE is

Allocate Test Requirements

Expected Outputs (Test

Why Model-based Testing?

arbitrary or ad hoc strategies

Generates standalone executable test suite

Spec Explorer Personality

Spec Explorer Model Program