Introduction to Computer-Aided Verification Rajeev Alur University of Pennsylvania CAV Mentoring Workshop, July 2015.

Introduction to Computer-Aided Verification

Rajeev Alur University of Pennsylvania

CAV Mentoring Workshop, July 2015

Systems Software

Can Microsoft Windows version X be bug-free?

Millions of lines of code

Types of bugs that cause crashes well-known

Enormous effort spent on debugging/testing code

Certifying third-party code (e.g. device drivers)

do{ KeAcquireSpinLock(); nPacketsOld = nPackets; if(request){

request = request->Next;KeReleaseSpinLock();nPackets++;

}}while(nPackets!=

nPacketsOld);KeReleaseSpinLock();

Do lock operations, acquire and release strictly alternate on every

program execution?

Concurrency Libraries

Exploiting concurrency efficiently and correctly

dequeue(queue_t *queue, value_t *pvalue){ node_t *head; node_t *tail; node_t *next;

while (true) { head = queue->head; tail = queue->tail; next = head->next; if (head == queue->head) { if (head == tail) { if (next == 0) return false; cas(&queue->tail, tail, next); } else { *pvalue = next->value; if (cas(&queue->head, head, next)) break; } } } delete_node(head); return true;}

Concurrent Queue (MS’92)

Can the code deadlock?

Is sequential semantics of a queuepreserved? (Sequential consistency)

Security Checks for Java Applets

How to certify applications fordata integrity / confidentiality ?

By listening to messages, can one infer whether a particular entry is in the addressbook?

https://java.sun.com/javame/

public Vector<String> phoneBook;public String number;public int Selected;public void sendEvent() { phoneBook = getPhoneBook(); selected = chhoseReceiver(); number=phoneBook.elementAt(selected); if ((number==null)|(number=“”)){ //output error } else{ String message = inputMessage(); sendMessage(number, message); }}

EventSharingMidlet from J2ME

Certification of Safety-Critical Software

How to verify that a pacemaker meets all the correctnessrequirements published by the FDA ?

Correctness is formalized as a mathematical claim to be proved or falsified rigorously

Always with respect to the given specification

Challenge: Impossibility results for automated verifierVerification problem is undecidable

Even approximate versions are computationally intractable (model checking is Pspace-hard)

Verifiersoftware/model

correctnessspecification

yes/proof

no/bug

In Search of the Holy Grail…

History of CAV (not comprehensive…)

Some guidelines for choosing a research problem

This Talk

BubbleSort (A : array[1..n] of int) { B = A : array[1..n] of int;

for (i=0; i<n; i++) {Permute(A,B)Sorted(B[n-i,n])for 0<k<=n-i-1 and n-i<=k’<=n B[k]<=B[k’]

for (j=0; j<n-i; j++) {Permute(A,B), Sorted(B[n-i,n],for 0<k<=n-i-1 and n-i<=k’<=n

B[k]<=B[k’]for 0<k<j B[k] <= B[j]

if (B[j]>B[j+1]) swap(B,j,j+1)}

}; return B;}

1970s: Proof calculi for program

correctness

BubbleSort (A : array[1..n] of int) { B = A : array[1..n] of int;

for (i=0; i<n; i++) {

for (j=0; j<n-i; j++) {

if (B[j]>B[j+1]) swap(B,j,j+1)}

}; return B;}

Key to proof:Finding suitable loop invariants

Deductive Program Verification

Powerful mathematical logic (e.g. first-order logic, Higher-order logics) needed for formalization Great progress in decision procedures Finding proof decomposition requires expertise, but modern

tools support many built-in proof tactics Contemporary theorem provers: Coq, PVS, ACL2, ESC-Java

In practice … User partially annotates the program with invariants, and the

tool infers remaining invariants needed to complete the proof Success story: CompCert: Fully verified optimizing compiler for

a subset of C

Current research: Automatic synthesis of loop invariants

1980s: Finite-state Protocol Analysis

Automated analysis of finite-state protocols with respect to temporal logic specifications Network protocols, Distributed algorithms

Specs: Is there a deadlock? Does every req get ack? Does a buffer overflow?

Tools: SPIN, Murphi, CADP …

Battling State-space Explosion

Analysis is basically a reachability problem in a HUGE graph Size of graph grows exponentially as

the number of bits required for state encoding

Graph is constructed only incrementally, on-the-fly

Many techniques for exploiting structure: symmetry, data independence, hashing, partial order reduction …

Great flexibility in modeling: Scale down parameters (buffer size, number of network nodes…)

State

Transition

Bad states

1990s: Symbolic Model Checking

Constraint-based analysis of Boolean systems Symbolic Boolean representations (propositional formulas,

OBDDs) used to encode system dynamics Success in finding high-quality bugs in hardware applications

(VHDL/Verilog code)

M P

UICUIC

UIC

M P

Global bus

Cluster bus

Read-shared/read-owned/write-invalid/write-shared/…

Deadlock found in cache coherency protocol Gigamax by model checker SMV

Symbolic Reachability Problem

Model variables X ={x1, … xn} Each var is of finite type, say, boolean

Initialization: I(X): a formula over X e.g. (x1 && ~x2)

Update: T(X,X’)How new vars X’ are related to old vars X as a result of executing one step of the program: Disjunction of clauses obtained by compiling individual instructions e.g. (x1 && x1’ = x1 && x2’ = ~x2 && x3’ = x3)

Target set: F(X) e.g. (x2 && x3)

Computational problem: Can F be satisfied starting with I by repeatedly applying T ?

K-step reachability reduces to propositional satisfiability (SAT): Bounded Model CheckingI(X0) && T(X0,X1) && T(X1,X2) && --- && T(Xk-1,Xk) && F(Xk)

The Story of SAT

2001Chaff

10k var

1986BDDs

100 var

1992GSAT

300 var

1996Stålmarck 1000 var

1996GRASP1k var

1960DP

10 var

1988SOCRATES

3k var

1994Hannibal 3k var

1962DLL

10 var

1952Quine

10 var1996SATO1k var

2002Berkmin10k var

Propositional Satisfiability: Given a formula over Boolean variables, is there an assignment of 0/1’s to vars which makes the formula true Canonical NP-hard problem (Cook 1973) Enormous progress in tools that can solve instances with

thousands of variables and millions of clauses Extensions to richer classes of constraints (SMT solvers)

2000s: Model Checking of C code

Phase 1: Given a program P, build an abstract finite-state (Boolean) model A such that set of behaviors of P is a subset of those of A (conservative abstraction)

Phase 2: Model check A wrt specification: this can prove P to be correct, or reveal a bug in P, or suggest inadequacy of A

Shown to be effective on Windows device drivers in Microsoft Research project SLAM (follow-up: SDV)

do{ KeAcquireSpinLock(); nPacketsOld = nPackets; if(request){

request = request->Next;KeReleaseSpinLock();nPackets++;

}}while(nPackets!=

nPacketsOld);KeReleaseSpinLock();

Do lock operations, acquire and release, strictly alternate on every

program execution?

Software Model Checking

Tools for verifying source code combine many techniques Program analysis techniques such as slicing, range analysis Abstraction Model checking Refinement from counter-examples (CEGAR)

New challenges for model checking (beyond finite-state reachability analysis) Recursion gives pushdown control Pointers, dynamic creation of objects, inheritence….

Active research area Abstraction-based tools: SLAM, BLAST,…

Direct state encoding: F-SOFT, CBMC, CheckFence…

SMT Success Story

SMT-LIB Standardized Interchange Format (smt-lib.org)Problem classification + Benchmark repositoriesLIA, LIA_UF, LRA, QF_LIA, …

+ Annual Competition (smt-competition.org)

Z3 Yices CVC4 MathSAT5

CBMC SAGE VCC Spec#

Since 1990s: Cyber-Physical Systems

Discrete software interacting with a continuously evolving physical system

Need to model physical world using differential equations/timing delays

Models: Timed automata, Hybrid automata

Symbolic reachability analysis over sets of real-valued variables

Finite-state abstractions Beyond correctness: Stability, Timely

response Fruitful collaboration between control

theory and formal methods

Formal Methods for Cyber-Physical Systems

Tools for verifying timed/hybrid systems modelsUppaal, Taliro, Keymaera, dReal, Space-Ex …

Applications Medical devices (infusion pump, pacemaker) Autonomous driving (collision avoidance protocols)

Industrial technology transfer Model-based design tools (e.g. Hybrid automata as Simulink

domain)

Simulink Design Verifier (model-based testing, static analysis)

Industry research groups (Toyota, Ford…)

How to choose a research problem ?

Common Themes in CAV Success Stories Phase 1: Initial demonstration of a compelling match between

the capability of a research prototype and real-world need Phase 2: Sustained research on improving scalability

But the path to the promised land is unclear …

Incremental vs. Transformative

Symbolic model checking using binary decision diagrams (McMillan et al, 1990) Importance was immediately obvious and celebrated Critical for industrial adoption of hardware model checking

Chaff: Engineering an efficient SAT solver (Malik etal,2001) Low-level optimization exploiting cache perforamce

Played critical role in boosting performance of SAT solvers

Don’t keep searching for “big” ideas by dismissing research problems as incremental

Source: Existing Literature vs. Real-world Problems?

Hybrid automata (Alur, Henzinger et al, 1991) Started as a theoretical extension of timed automata

Now with significant research and adoption in CPS community

SAGE (Godefroid et al, CACM 2012) A response to pressing industrial need for effective testing for

discovering security vulnerabilities

Integration of many research ideas into a highly successful tool

Keep looking everywhere!

Theoretical Results vs. Prototype Tools

Nested depth-first search (CVWY, CAV 1990) Beautiful algorithm for on-the-fly detection of fair cycles

Key ingredient of all explicit-state LTL model checkers

SLAM (Ball and Rajamani, 2001) Integration of predicate abstraction, symbolic model checking,

and counter-example guided abstraction refinement

Prototype tool and evaluation essential to demonstrate utility

CAV offers many options for research: theoretical, practical, and theory in practice!

Advice 1: Be sure of the motivation

If you were to succeed in finding a good solution to the problem you are studying, what would be the consequence? Tool: who is a potential user?

Algorithm: which tool can use and why should it use?

Method: which design/analysis task can be done better?

Be convinced of the answer yourself first, and worry about reviewers later

Advice 2: Know the related work

Is your idea new? How does it fit into what people know and have tried earlier?

Vast literature, but there is no way around this question

Be an expert on work related to your thesis

Caution: this is not an excuse for inaction!

Advice 3: Don’t live in a silo!

Computer science is rapidly expanding in exciting directions Need to know at a high level what’s happening around you

Organization into conferences/sub-disciplines is artificial

Other fields can be a source of new ideas, applications, solution techniques How can statistical machine learning help CAV?

Can CAV techniques be applied to problems in system biology?

Goal: Become an expert in Formal Methods AND X

Introduction to Computer-Aided Verification Rajeev Alur University of Pennsylvania CAV Mentoring Workshop, July 2015.

Documents