ProVerif 2.05: Automatic Cryptographic Protocol Verifier, User

ProVerif 2.05:

Automatic Cryptographic Protocol Veriﬁer,

User Manual and Tutorial

Bruno Blanchet, Ben Smyth, Vincent Cheval, and Marc Sylvestre

[email protected], [email protected], [email protected],

[email protected]

October 17, 2023

Acknowledgements

This manual was written with support from the Direction G´en´erale pour l’Armement (DGA) and the

EPSRC project UbiVal (EP/D076625/2). ProVerif was developed while Bruno Blanchet was aﬃliated

with INRIA Paris-Rocquencourt, with CNRS, Ecole Normale Sup´erieure, Paris, and with Max-Planck-

Institut f¨ur Informatik, Saarbr¨ucken. This manual was written while Bruno Blanchet was aﬃliated with

INRIA Paris-Rocquencourt and with CNRS, Ecole Normale Sup´erieure, Paris, Ben Smyth was aﬃliated

with Ecole Normale Sup´erieure, Paris and with University of Birmingham, Vincent Cheval was aﬃliated

with CNRS and Inria Nancy, and Marc Sylvestre was aﬃliated with INRIA Paris. The development of

ProVerif would not have been possible without the helpful remarks from the research community; their

contributions are greatly appreciated and further feedback is encouraged.

iii

Contents

1 Introduction 1

1.1 Applications of ProVerif . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.2 Scope of this manual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.3 Support . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.4 Installation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

1.4.1 Installation via OPAM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.4.2 Installation from sources (Linux/Mac/cygwin) . . . . . . . . . . . . . . . . . . . . 3

1.4.3 Installation from binaries (Windows) . . . . . . . . . . . . . . . . . . . . . . . . . . 4

1.4.4 Emacs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.4.5 Atom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.5 Copyright . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Getting started 7

3 Using ProVerif 11

3.1 Modeling protocols . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

3.1.1 Declarations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

3.1.2 Example: Declaring cryptographic primitives for the handshake protocol . . . . . . 13

3.1.3 Process macros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.1.4 Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

3.1.5 Example: handshake protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

3.2 Security properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

3.2.1 Reachability and secrecy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

3.2.2 Correspondence assertions, events, and authentication . . . . . . . . . . . . . . . . 19

3.2.3 Example: Secrecy and authentication in the handshake protocol . . . . . . . . . . 20

3.3 Understanding ProVerif output . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

3.3.1 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

3.3.2 Example: ProVerif output for the handshake protocol . . . . . . . . . . . . . . . . 23

3.4 Interactive mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

3.4.1 Interface description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

3.4.2 Manual and auto-reduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

3.4.3 Execution of 0, P | Q, !P , new, let, if, and event . . . . . . . . . . . . . . . . . . . 32

3.4.4 Execution of inputs and outputs . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

3.4.5 Button “Add a term to public” . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

3.4.6 Execution of insert and get . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

3.4.7 Handshake run in interactive mode . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

3.4.8 Advanced features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

4 Language features 37

4.1 Primitives and modeling features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

4.1.1 Constants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

4.1.2 Data constructors and type conversion . . . . . . . . . . . . . . . . . . . . . . . . . 37

4.1.3 Natural numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

4.1.4 Enriched terms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

vi CONTENTS

4.1.5 Tables and key distribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

4.1.6 Phases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

4.1.7 Synchronization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

4.2 Further cryptographic operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

4.2.1 Extended destructors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

4.2.2 Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

4.2.3 Function macros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

4.2.4 Process macros with fail . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

4.2.5 Suitable formalizations of cryptographic primitives . . . . . . . . . . . . . . . . . . 49

4.3 Further security properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

4.3.1 Complex correspondence assertions, secrecy, and events . . . . . . . . . . . . . . . 52

4.3.2 Observational equivalence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

5 Needham-Schroeder: Case study 65

5.1 Simpliﬁed Needham-Schroeder protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

5.1.1 Basic encoding . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

5.1.2 Security properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

5.2 Full Needham-Schroeder protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

5.3 Generalized Needham-Schroeder protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

5.4 Variants of these security properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

5.4.1 A variant of mutual authentication . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

5.4.2 Authenticated key exchange . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

5.4.3 Full ordering of the messages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

6 Advanced reference 87

6.1 Proving correspondence queries by induction . . . . . . . . . . . . . . . . . . . . . . . . . 87

6.1.1 Single query . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

6.1.2 Group of queries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

6.2 Axioms, restrictions, and lemmas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

6.3 Predicates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

6.4 Referring to bound names in queries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100

6.5 Exploring correspondence assertions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101

6.6 ProVerif options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

6.6.1 Command-line arguments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

6.6.2 Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104

6.7 Theory and tricks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

6.7.1 The resolution strategy of ProVerif . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

6.7.2 Performance and termination . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

6.7.3 Alternative encodings of protocols . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

6.7.4 Applied pi calculus encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122

6.7.5 Sources of incompleteness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123

6.7.6 Misleading syntactic constructs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

6.8 Compatibility with CryptoVerif . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

6.9 Additional programs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128

6.9.1 test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128

6.9.2 analyze . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

6.9.3 addexpectedtags . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

7 Outlook 131

A Language reference 133

B Semantics 141

List of Figures

3.1 Handshake protocol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.2 Term and process grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.3 Pattern matching grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

3.4 Messages and events for authentication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

3.5 Handshake protocol attack trace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

3.6 Handshake protocol - Initial simulator window . . . . . . . . . . . . . . . . . . . . . . . . 31

3.7 Handshake protocol - Simulator window 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

3.8 Handshake protocol - Simulator window 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

3.9 Handshake protocol - Simulator window 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

4.1 Natural number grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

4.2 Enriched terms grammar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

4.3 Grammar for correspondence assertions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

A.1 Grammar for terms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134

A.2 Grammar for declarations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

A.3 Grammar for destructors (see Sections 3.1.1 and 4.2.1) and equations (see Section 4.2.2) . 136

A.4 Grammar for not, queries, and lemmas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136

A.5 Grammar for not, queries, and lemmas restricted after parsing . . . . . . . . . . . . . . . 137

A.6 Grammar for nounif . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138

A.7 Grammar for clauses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138

A.8 Grammar for processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139

B.1 Semantics of process terms and patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

B.2 Semantics of processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

vii

viii LIST OF FIGURES

Chapter 1

Introduction

This manual describes the ProVerif software package version 2.05. ProVerif is a tool for automatically

analyzing the security of cryptographic protocols. Support is provided for, but not limited to, crypto-

graphic primitives including: symmetric and asymmetric encryption; digital signatures; hash functions;

bit-commitment; and non-interactive zero-knowledge proofs. ProVerif is capable of proving reachability

properties, correspondence assertions, and observational equivalence. These capabilities are particularly

useful to the computer security domain since they permit the analysis of secrecy and authentication

properties. Moreover, emerging properties such as privacy, traceability, and veriﬁability can also be

considered. Protocol analysis is considered with respect to an unbounded number of sessions and an

unbounded message space. Moreover, the tool is capable of attack reconstruction: when a property

cannot be proved, ProVerif tries to reconstruct an execution trace that falsiﬁes the desired property.

1.1 Applications of ProVerif

The applicability of ProVerif has been widely demonstrated. Protocols from the literature have been

successfully analyzed: ﬂawed and corrected versions of Needham-Schroeder public-key [NS78, Low96]

and shared key [NS78, BAN89, NS87]; Woo-Lam public-key [WL92, WL97] and shared-key [WL92,

AN95, AN96, WL97, GJ03]; Denning-Sacco [DS81, AN96]; Yahalom [BAN89]; Otway-Rees [OR87, AN96,

Pau98]; and Skeme [Kra96]. The resistance to password guessing attacks has been demonstrated for the

password-based protocols EKE [BM92] and Augmented EKE [BM93].

ProVerif has also been used in more substantial case studies:

Abadi & Blanchet [AB05b] use correspondence assertions to verify the certiﬁed email proto-

col [AGHP02].

Abadi, Blanchet & Fournet [ABF07] analyze the JFK (Just Fast Keying) [ABB

04] protocol, which

was one of the candidates to replace IKE as the key exchange protocol in IPSec, by combining

manual proofs with ProVerif proofs of correspondences and equivalences.

Blanchet & Chaudhuri [BC08] study the integrity of the Plutus ﬁle system [KRS

03] on untrusted

storage, using correspondence assertions, resulting in the discovery, and subsequent ﬁxing, of weak-

nesses in the initial system.

Bhargavan et al. [BFGT06, BFG06, BFGS08] use ProVerif to analyze cryptographic protocol im-

plementations written in F#; in particular, the Transport Layer Security (TLS) protocol has been

studied in this manner [BCFZ08].

Chen & Ryan [CR09] evaluate authentication protocols found in the Trusted Platform Module

(TPM), a widely deployed hardware chip, and discovered vulnerabilities.

Delaune, Kremer & Ryan [DKR09, KR05] and Backes, Hritcu & Maﬀei [BHM08] formalize and

analyze privacy properties for electronic voting using observational equivalence.

2 CHAPTER 1. INTRODUCTION

Delaune, Ryan & Smyth [DRS08] and Backes, Maﬀei & Unruh [BMU08] analyze the anonymity

properties of the trusted computing scheme Direct Anonymous Attestation (DAA) [BCC04, SRC07]

using observational equivalence.

K¨usters & Truderung [KT09, KT08] examine protocols with Diﬃe-Hellman exponentiation and

XOR.

Smyth, Ryan, Kremer & Kourjieh [SRKK10, SRK10] formalize and analyze veriﬁability properties

for electronic voting using reachability.

Bhargavan, Blanchet and Kobeissi verify Signal [KBB17] and TLS 1.3 [BBK17].

Blanchet veriﬁes the ARINC 823 avionic protocols [Bla17].

For further examples, please refer to: http://proverif.inria.fr/proverif-users.html.

1.2 Scope of this manual

This manual provides an introductory description of the ProVerif software package version 2.05. The

remainder of this chapter covers software support (Section 1.3) and installation (Section 1.4). Chapter 2

provides an introduction to ProVerif aimed at new users, advanced users may skip this chapter without

loss of continuity. Chapter 3 demonstrates the basic use of ProVerif. Chapter 4 provides a more com-

plete coverage of the features of ProVerif. Chapter 5 demonstrates the applicability of ProVerif with a

case study. Chapter 6 considers advanced topics and Chapter 7 concludes. For reference, the complete

grammar of ProVerif is presented in Appendix A. This manual does not attempt to describe the theo-

retical foundations of the internal algorithms used by ProVerif since these are available elsewhere (see

Chapter 7 for references); nor is the applied pi calculus [AF01, RS11, ABF17], which provides the basis

for ProVerif, discussed.

1.3 Support

Software bugs and comments should be reported by e-mail to:

proverif-[email protected]

User support, general discussion and new release announcements are provided by the ProVerif mailing

list. To subscribe to the list, send an email to [email protected] with the subject “subscribe proverif”

(without quotes). To post on the list, send an email to:

[email protected]

Non-members are not permitted to send messages to the mailing list.

1.4 Installation

ProVerif is compatible with the Linux, Mac, and Windows operating systems; it can be downloaded

from:

http://proverif.inria.fr/

The remainder of this section covers installation on Linux, Mac, and Windows platforms.

1.4. INSTALLATION 3

1.4.1 Installation via OPAM

ProVerif has been developed using Objective Caml (OCaml) and OPAM is the package manager of

OCaml. Installing via OPAM is the simplest, especially if you already have OPAM installed.

1. If you do not already have OPAM installed, download it from

https://opam.ocaml.org/

and install it.

2. If you already have OPAM installed, run

opam update

to make sure that you get the latest version of ProVerif.

3. Run

opam depext conf-graphviz

opam depext proverif

opam install proverif

The ﬁrst line installs graphviz, if you do not already have it. You may also install it using the

package manager of your Linux, OSX, or cygwin distribution, especially if opam fails to install it.

It is needed only for the graphical display of attacks.

The second line installs GTK+2 including development libraries, if you do not already have it.

You may also install it using the package manager of your distribution. You may additionally need

to install pkgconfig using the package manager of your distribution, if you do not already have

it and it is not installed by opam depext proverif. (This happens in particular on some OSX

installations.) GTK+2 is needed for the interactive simulator proverif interact.

The third line installs ProVerif itself and its OCaml dependencies. ProVerif executables are in

/.opam/⟨switch⟩/bin, which is in the PATH, examples are in

/.opam/⟨switch⟩/doc/proverif,

and various helper ﬁles are in

/.opam/⟨switch⟩/share/proverif. The directory ⟨switch⟩ is the

opam switch in which you installed ProVerif, by default system.

4. Download the documentation package proverifdoc2.05.tar.gz from

http://proverif.inria.fr/

and uncompress it (e.g. using tar -xzf proverifdoc2.05.tar.gz or using your favorite ﬁle

archive tool). That gives you the manual and a few additional examples.

1.4.2 Installation from sources (Linux/Mac/cygwin)

1. On Mac OS X, you need to install XCode if you do not already have it. It can be downloaded from

https://developer.apple.com/xcode/.

2. ProVerif has been developed using Objective Caml (OCaml), accordingly OCaml version 4.03 or

higher is a prerequisite to installation and can be downloaded from http://ocaml.org/, or installed

via the package manager of your distribution. OCaml provides a byte-code compiler (ocamlc) and

a native-code compiler (ocamlopt). Although ProVerif does not strictly require the native-code

compiler, it is highly recommended to achieve large performance gains.

3. The installation of graphviz is required if you want to have a graphical representation of the attacks

that ProVerif might ﬁnd. Graphviz can be downloaded from http://graphviz.org or installed

via the package manager of your distribution.

4. The installation GTK+2.24 and LablGTK2 is required if you want to run the interactive simulator

proverif interact. Use the package manager of your distribution to install GTK+2 including its

development libraries if you do not already have it and download lablgtk-2.18.6.tar.gz from

4 CHAPTER 1. INTRODUCTION

http://lablgtk.forge.ocamlcore.org/

and follow the installation instructions in their README ﬁle.

5. Download the source package proverif2.05.tar.gz and the documentation package

proverifdoc2.05.tar.gz from

http://proverif.inria.fr/

6. Decompress the archives:

(a) using GNU tar

tar -xzf proverif2.05.tar.gz

tar -xzf proverifdoc2.05.tar.gz

(b) using tar

gunzip proverif2.05.tar.gz

tar -xf proverif2.05.tar

gunzip proverifdoc2.05.tar.gz

tar -xf proverifdoc2.05.tar

This will create a directory proverif2.05 in the current directory.

7. You are now ready to build ProVerif:

cd proverif2.05

./build

(If you did not install LablGTK2, the compilation of proverif interact fails, but the executables

proverif and proveriftotex are still produced correctly, so you can use ProVerif normally, but

cannot run the interactive simulator.)

8. ProVerif has now been successfully installed.

1.4.3 Installation from binaries (Windows)

Windows users may install ProVerif using the binary distribution, as described below. They may also

install cygwin and install ProVerif from sources as explained in the previous section.

1. The installation of graphviz is required if you want to have a graphical representation of the at-

tacks that ProVerif might ﬁnd. Graphviz can be downloaded from https://graphviz.gitlab.io/

_pages/Download/Download_windows.html. Make sure that the bin subdirectory of the Graphviz

installation directory is in your PATH.

2. The installation GTK+2.24 is required if you want to run the interactive simulator

proverif interact. At

https://download.gnome.org/binaries/win32/gtk%2B/2.24/

download gtk+-bundle_2.24.10-20120208_win32.zip, unzip it in the directory C:\GTK, and add

C:\GTK\bin to your PATH.

3. Download the Windows binary package proverifbin2.05.tar.gz and the documentation package

proverifdoc2.05.tar.gz from

http://proverif.inria.fr/

4. Decompress the proverifbin2.05.tar.gz and proverifdoc2.05.tar.gz archives in the same

directory using your favorite ﬁle archive tool (e.g. WinZip).

5. ProVerif has now been successfully installed in the directory where the ﬁle was extracted.

1.5. COPYRIGHT 5

1.4.4 Emacs

If you use the emacs text editor for editing ProVerif input ﬁles, you can install the emacs mode provided

with the ProVerif distribution.

1. Copy the ﬁle emacs/proverif.el (if you installed by OPAM in the switch ⟨switch⟩, the ﬁle

.opam/⟨switch⟩/share/proverif/emacs/proverif.el) to a directory where Emacs will ﬁnd it

(that is, in your emacs load-path).

2. Add the following lines to your .emacs ﬁle:

(setq auto-mode-alist

(cons '("\\.horn$" . proverif-horn-mode)

(cons '("\\.horntype$" . proverif-horntype-mode)

(cons '("\\.pv[l]?$" . proverif-pv-mode)

(cons '("\\.pi$" . proverif-pi-mode) auto-mode-alist)))))

(autoload 'proverif-pv-mode "proverif" "Major mode for editing ProVerif code." t)

(autoload 'proverif-pi-mode "proverif" "Major mode for editing ProVerif code." t)

(autoload 'proverif-horn-mode "proverif" "Major mode for editing ProVerif code." t)

(autoload 'proverif-horntype-mode "proverif" "Major mode for editing ProVerif code." t)

1.4.5 Atom

There is also a ProVerif mode for the text editor Atom (https://atom.io/), by Vincent Cheval. It can

be downloaded from the Atom web site; the package name is language-proverif.

1.5 Copyright

The ProVerif software is distributed under the GNU general public license. For details see:

http://proverif.inria.fr/LICENSE

6 CHAPTER 1. INTRODUCTION

Chapter 2

Getting started

This chapter provides a basic introduction to ProVerif and is aimed at new users; experienced users may

choose to skip this chapter. ProVerif is a command-line tool which can be executed using the syntax:

./proverif [options] ⟨ﬁlename⟩

where ./proverif is ProVerif’s binary; ⟨ﬁlename⟩ is the input ﬁle; and command-line parameters

[options] will be discussed later (Section 6.6.1). ProVerif can handle input ﬁles encoded in several

languages. The typed pi calculus is currently considered to be state-of-the-art and ﬁles of this sort

are denoted by the ﬁle extension .pv. This manual will focus on protocols encoded in the typed

pi calculus. (For the interested reader, other input formats are mentioned in Section 6.6.1 and in

docs/manual-untyped.pdf.) The pi calculus is designed for representing concurrent processes that

interact using communications channels such as the Internet.

ProVerif is capable of proving reachability properties, correspondence assertions, and observational

equivalence. This chapter will demonstrate the use of reachability properties and correspondence as-

sertions in a very basic manner. The true power of ProVerif will be discussed in the remainder of this

manual.

Reachability properties. Let us consider the ProVerif script:

1 ( h e l l o . pv : H e l lo World S c r i p t )

3 free c : chan nel .

5 free Cocks : b i t s t r i n g [ private ] .

6 free RSA: b i t s t r i n g [ private ] .

10 process

11 out ( c ,RSA) ;

12 0

Line 1 contains the comment “hello.pv: Hello World Script”; comments are enclosed by ( comment ).

Line 3 declares the free name c of type channel which will later be used for public channel communication.

Lines 5 and 6 declare the free names Cocks and RSA of type bitstring , the keyword [private] excludes

the names from the attacker’s knowledge. Line 10 declares the start of the main process. Line 11 outputs

the name RSA on the channel c. Finally, the termination of the process is denoted by 0 on Line 12.

Names may be of any type, but we explicitly distinguish names of type channel from other types,

since the former may be used as a communications channel for message input/output. The concept of

bound and free names is similar to local and global scope in programming languages; that is, free names

are globally known, whereas bound names are local to a process. By default, free names are known by

the attacker. Free names that are not known by the attacker must be declared private with the addition

of the keyword [private]. The message output on Line 11 is broadcast using a public channel because

the channel name c is a free name; whereas, if c were a bound name or explicitly excluded from the

8 CHAPTER 2. GETTING STARTED

attacker’s knowledge, then the communication would be on a private channel. For convenience, the ﬁnal

line may be omitted and hence out(c,RSA) is an abbreviation of out(c,RSA);0.

Properties of the aforementioned script can be examined using ProVerif. For example, to test as to

whether the names Cocks and RSA are available derivable by the attacker, the following lines can be

included before the main process:

7 query attacker (RSA ) .

8 query attacker ( Cocks ) .

Internally, ProVerif attempts to prove that a state in which the names Cocks and RSA are known to the

attacker is unreachable (that is, it tests the queries not attacker(RSA) and not attacker(Cocks), and

these queries are true when the names are not derivable by the attacker). This makes ProVerif suitable

for proving the secrecy of terms in a protocol.

Executing ProVerif (./proverif docs/hello.pv) produces the output:

Process 0 (that is, the initial process):

{1}out(c, RSA)

-- Query not attacker(RSA[]) in process 0.

Translating the process into Horn clauses...

Completing...

Starting query not attacker(RSA[])

goal reachable: attacker(RSA[])

Derivation:

1. The message RSA[] may be sent to the attacker at output {1}.

attacker(RSA[]).

2. By 1, attacker(RSA[]).

The goal is reached, represented in the following fact:

attacker(RSA[]).

A more detailed output of the traces is available with

set traceDisplay = long.

out(c, ~M) with ~M = RSA at {1}

The attacker has the message ~M = RSA.

A trace has been found.

RESULT not attacker(RSA[]) is false.

-- Query not attacker(Cocks[]) in process 0.

Translating the process into Horn clauses...

Completing...

Starting query not attacker(Cocks[])

RESULT not attacker(Cocks[]) is true.

--------------------------------------------------------------

Verification summary:

Query not attacker(RSA[]) is false.

Query not attacker(Cocks[]) is true.

--------------------------------------------------------------

As can be interpreted from RESULT not attacker:(Cocks[]) is true, the attacker has not been able

to obtain the free name Cocks. The attacker has, however, been able to obtain the free name RSA as

denoted by the RESULT not attacker:(RSA[]) is false. ProVerif is also able to provide an attack

trace. In this instance, the trace is very short and denoted by

out(c, ~M) with ~M = RSA at {1}

The attacker has the message ~M = RSA.

which means that the name RSA is output on channel c at point {1} in the process and stored by the

attacker in ~M, where point {1} is annotated on Line 2 of the output. ProVerif concludes the trace

by saying that the attacker has RSA. ProVerif also provides an English language description of the

derivation denoted by

1. The message RSA[] may be sent to the attacker at output {1}.

attacker(RSA[]).

2. By 1, attacker(RSA[]).

The goal is reached, represented in the following fact:

attacker(RSA[]).

A derivation is the ProVerif internal representation of how the attacker may break the desired property,

here may obtain RSA. It generally corresponds to an attack as in the example above, but may sometimes

correspond to a false attack because of the internal approximations made by ProVerif. In contrast, when

ProVerif presents a trace, it always corresponds to a real attack. See Section 3.3 for more details. The

output ends with a summary of the results for all queries.

Correspondence assertions. Let us now consider an extended variant docs/hello ext.pv of the

script:

1 ( h e l l o e x t . pv : H e ll o Extended World S c r i p t )

3 free c : chan nel .

5 free Cocks : b i t s t r i n g [ private ] .

6 free RSA: b i t s t r i n g [ private ] .

8 event evCocks .

9 event evRSA .

11 query event ( evCocks ) ==> event (evRSA ) .

13 process

14 out ( c ,RSA) ;

15 in ( c , x : b i t s t r i n g ) ;

16 i f x = Cocks then

17 event evCocks ;

18 event evRSA

19 el se

20 event evRSA

Lines 1-7 should be familiar. Lines 8-9 declare events evCocks and evRSA. Intuition suggests that Line 11

is some form of query. Lines 13-14 should again be standard. Line 15 contains a message input of type

bitstring on channel c which it binds to the variable x. Lines 16-20 denote an if-then-else statement;

the body of the then branch can be found on Lines 17-18 and the else branch on Line 20. We remark

that the code presented is a shorthand for the more verbose

i f x = Cocks then event evCocks ; event evRSA ; 0 el se event evRSA ; 0

10 CHAPTER 2. GETTING STARTED

where 0 denotes the end of a branch (termination of a process). The statement event evCocks (similarly

event evRSA) declares an event and the query

query event ( evCocks ) ==> event (evRSA)

is true if and only if, for all executions of the protocol, if the event evCocks has been executed, then the

event evRSA has also been executed before. Executing the script produces the output:

Process 0 (that is, the initial process):

{1}out(c, RSA);

{2}in(c, x: bitstring);

{3}if (x = Cocks) then

{4}event evCocks;

{5}event evRSA

else

{6}event evRSA

-- Query event(evCocks) ==> event(evRSA) in process 0.

Translating the process into Horn clauses...

Completing...

Starting query event(evCocks) ==> event(evRSA)

RESULT event(evCocks) ==> event(evRSA) is true.

--------------------------------------------------------------

Verification summary:

Query event(evCocks) ==> event(evRSA) is true.

--------------------------------------------------------------

As expected, it is not possible to witness the event evCocks without having previously executed the event

evRSA and hence the correspondence event(evCocks) ==> event(evRSA) is true. In fact, a stronger

property is true: the event evCocks is unreachable. The reader can verify this claim with the addition of

query event(evCocks). (The authors remark that writing code with unreachable points is a common

source of errors for new users. Advice on avoiding such pitfalls will be presented in Section 4.3.1.)

Chapter 3

Using ProVerif

The primary goal of ProVerif is the veriﬁcation of cryptographic protocols. Cryptographic protocols

are concurrent programs which interact using public communication channels such as the Internet to

achieve some security-related objective. These channels are assumed to be controlled by a very powerful

environment which captures an attacker with “Dolev-Yao” capabilities [DY83]. Since the attacker has

complete control of the communication channels, the attacker may: read, modify, delete, and inject

messages. The attacker is also able to manipulate data, for example: compute the ith element of a

tuple; and decrypt messages if it has the necessary keys. The environment also captures the behavior

of dishonest participants; it follows that only honest participants need to be modeled. ProVerif’s input

language allows such cryptographic protocols and associated security objectives to be encoded in a formal

manner, allowing ProVerif to automatically verify claimed security properties. Cryptography is assumed

to be perfect; that is, the attacker is only able to perform cryptographic operations when in possession

of the required keys. In other words, it cannot apply any polynomial-time algorithm, but is restricted to

apply only the cryptographic primitives speciﬁed by the user. The relationships between cryptographic

primitives are captured using rewrite rules and/or an equational theory.

In this chapter, we demonstrate how to use ProVerif for verifying cryptographic protocols, by consid-

ering a na¨ıve handshake protocol (Figure 3.1) as an example. Section 3.1 discusses how cryptographic

protocols are encoded within ProVerif’s input language, a variant of the applied pi calculus [AF01, RS11]

which supports types; Section 3.2 shows the security properties that can be proved by ProVerif; and Sec-

tion 3.3 explains how to understand ProVerif’s output.

3.1 Modeling protocols

A ProVerif model of a protocol, written in the tool’s input language (the typed pi calculus), can be divided

into three parts. The declarations formalize the behavior of cryptographic primitives (Section 3.1.1); and

their use is demonstrated on the handshake protocol (Section 3.1.2). Process macros (Section 3.1.3) allow

sub-processes to be deﬁned, in order to ease development; and ﬁnally, the protocol itself can be encoded

as a main process (Section 3.1.4), with the use of macros.

3.1.1 Declarations

Processes are equipped with a ﬁnite set of types, free names, and constructors (function symbols) which

are associated with a ﬁnite set of destructors. The language is strongly typed and user-deﬁned types are

declared as

type t .

All free names appearing within an input ﬁle must be declared using the syntax

free n : t .

where n is a name and t is its type. Several free names of the same type t can be declared by

free n

, . . . , n

: t .

12 CHAPTER 3. USING PROVERIF

Figure 3.1 Handshake protocol

A na¨ıve handshake protocol between client A and server B is illustrated below. It is assumed that each

principal has a public/private key pair, and that the client A knows the server B’s public key pk(skB).

The aim of the protocol is for the client A to share the secret s with the server B. The protocol proceeds

as follows. On request from a client A, server B generates a fresh symmetric key k (session key), pairs it

with his identity (public key pk(skB)), signs it with his secret key skB and encrypts it using his client’s

public key pk(skA). That is, the server sends the message aenc(sign((pk(skB),k),skB),pk(skA)). When

A receives this message, she decrypts it using her secret key skA, veriﬁes the digital signature made by

B using his public key pk(skB), and extracts the session key k. A uses this key to symmetrically encrypt

the secret s. The rationale behind the protocol is that A receives the signature asymmetrically encrypted

with her public key and hence she should be the only one able to decrypt its content. Moreover, the

digital signature should ensure that B is the originator of the message. The messages sent are illustrated

as follows:

A → B : pk(skA)

B → A : aenc(sign((pk(skB),k),skB),pk(skA))

A → B : senc(s ,k)

Note that protocol narrations (as above) are useful, but lack clarity. For example, they do not specify any

checks which should be made by the participants during the execution of the protocol. Such checks include

verifying digital signatures and ensuring that encrypted messages are correctly formed. Failure of these

checks typically results in the participant aborting the protocol. These details will be explicitly stated

when protocols are encoded for ProVerif. (For further discussion on protocol speciﬁcation, see [AN96,

Aba00].)

Informally, the three properties we would like this protocol to provide are:

1. Secrecy: the value s is known only to A and B.

2. Authentication of A to B: if B reaches the end of the protocol and he believes he has shared the

key k with A, then A was indeed his interlocutor and she has shared k.

3. Authentication of B to A: if A reaches the end of the protocol with shared key k, then B proposed

k for use by A.

However, the protocol is vulnerable to a man-in-the-middle attack (illustrated below). If a dishonest

participant I starts a session with B, then I is able to impersonate B in a subsequent session the client

A starts with B. At the end of the protocol, A believes that she shares the secret s with B, while she

actually shares s with I.

I → B : pk(skI)

B → I : aenc(sign((pk(skB),k),skB),pk(skI))

A → B : pk(skA)

I → A : aenc(sign((pk(skB),k),skB),pk(skA))

A → B : senc(s ,k)

The protocol can easily be corrected by adding the identity of the intended client:

A → B : pk(skA)

B → A : aenc(sign((pk(skA),pk(skB),k),skB),pk(skA))

A → B : senc(s ,k)

With this correction, I is not able to re-use the signed key from B in her session with A.

3.1. MODELING PROTOCOLS 13

The syntax channel c. is a synonym for free c: channel. By default, free names are known by the

attacker. Free names that are not known by the attacker must be declared private:

free n : t [ private ] .

Constructors (function symbols) are used to build terms modeling primitives used by cryptographic

protocols; for example: one-way hash functions, encryptions, and digital signatures. Constructors are

deﬁned by

fun f(t

, . . . , t

) : t .

where f is a constructor of arity n, t is its return type, and t

, . . . , t

are the types of its arguments.

Constructors are available to the attacker unless they are declared private:

fun f(t

, . . . , t

) : t [ private ] .

Private constructors can be useful for modeling tables of keys stored by the server (see Section 6.7.3),

for example.

The relationships between cryptographic primitives are captured by destructors which are used to

manipulate terms formed by constructors. Destructors are modeled using rewrite rules of the form:

reduc f o r a l l x

1,1

: t

1,1

, . . . , x

1,n

: t

1,n

; g(M

1,1

, . . . , M

1,k

) = M

1,0

;

. . .

f o r a l l x

m,1

: t

m,1

, . . . , x

m,n

: t

m,n

; g(M

m,1

, . . . , M

m,k

) = M

m,0

where g is a destructor of arity k. The terms M

1,1

, . . . , M

1,k

, M

1,0

are built from the application of

constructors to variables x

1,1

, . . . , x

1,n

of types t

1,1

, . . . , t

1,n

respectively (and similarly for the other

rewrite rules). The return type of g is the type M

1,0

and M

1,0

, . . . , M

m,0

must have the same type. We

similarly require that the arguments of the destructor have the same type; that is, M

1,1

, . . . , M

1,k

have

the same types as M

i,1

, . . . , M

i,k

for i ∈ [2, m], and these types are the types of the arguments of g. When

the term g(M

1,1

, . . . , M

1,k

) (or an instance of that term) is encountered during execution, it is replaced

by M

1,0

, and similarly for the other rewrite rules. When no rule can be applied, the destructor fails,

and the process blocks (except for the let process, see Section 3.1.4). This behavior corresponds to real

world application of cryptographic primitives which include suﬃcient redundancy to detect scenarios in

which an operation fails. For example, in practice, encrypted messages may be assumed to come with

suﬃcient redundancy to discover when the ‘wrong’ key is used for decryption. It follows that destructors

capture the behavior of cryptographic primitives which can visibly fail.

When several variables have the same type, we can avoid repeating their type in the declaration,

writing for instance:

reduc f o r a l l x, y : t, z : t

′

; g(M

, . . . , M

) = M

Destructors must be deterministic, that is, for each terms (M

, . . . , M

) given as argument to g, when

several rewrite rules apply, they must all yield the same result and, in the rewrite rules, the variables

that occur in M

i,0

must also occur in M

i,1

, . . . , M

i,k

, so that the result of g(M

, . . . , M

) is entirely

determined.

In a similar manner to constructors, destructors may be declared private by appending [private].

The generic mechanism by which primitives are encoded permits the modeling of various cryptographic

operators.

It is possible to use let bindings within the declaration of each rewrite rule. For example, an abstract

zero knowledge proof used in some voting protocols could be declared as follows:

reduc f o r a l l r : rand , i : id , v : vote , pub : p u b l i c ke y ;

l et c i p h e r = ra enc ( v , r , pub ) in

checkzkp ( zkp ( r , i , v , c i p h e r ) , i , c i p h e r ) = ok .

3.1.2 Example: Declaring cryptographic primitives for the handshake pro-

tocol

We now formalize the basic cryptographic primitives used by the handshake protocol.

14 CHAPTER 3. USING PROVERIF

Symmetric encryption. For symmetric encryption, we deﬁne the type key and consider the binary

constructor senc which takes arguments of type bitstring , key and returns a bitstring .

1 type key .

3 fun s enc ( b i t s t r i n g , key ) : b i t s t r i n g .

Note that the type bitstring is built-in, and hence, need not be declared as a user-deﬁned type. The type

key is not built-in and hence we declare it on Line 1. To model the decryption operation, we introduce

the destructor:

4 reduc f o r a l l m: b i t s t r i n g , k : key ; sde c ( s enc (m, k ) , k ) = m.

where m represents the message and k represents the symmetric key.

Asymmetric encryption. For asymmetric cryptography, we consider the unary constructor pk, which

takes an argument of type skey (private key) and returns a pkey (public key), to capture the notion of

constructing a key pair. Decryption is captured in a similar manner to symmetric cryptography with a

public/private key pair used in place of a symmetric key.

5 type skey .

6 type pkey .

8 fun pk ( ske y ) : pkey .

9 fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

11 reduc f o r a l l m: b i t s t r i n g , k : skey ; adec ( aenc (m, pk ( k ) ) , k ) = m.

Digital signatures. In a similar manner to asymmetric encryption, digital signatures rely on a pair of

signing keys of types sskey (private signing key) and spkey (public signing key). We will consider digital

signatures with message recovery:

12 type ss ke y .

13 type spkey .

15 fun spk ( s sk ey ) : spkey .

16 fun s i g n ( b i t s t r i n g , s sk ey ) : b i t s t r i n g .

18 reduc f o r a l l m: b i t s t r i n g , k : ss ke y ; getmess ( si g n (m, k ) ) = m.

19 reduc f o r a l l m: b i t s t r i n g , k : ss ke y ; c he c k si g n ( si g n (m, k ) , spk ( k ) ) = m.

The constructors spk, for creating public keys, and sign, for constructing signatures, are standard.

The destructors permit message recovery and signature veriﬁcation. The destructor getmess allows the

attacker to get the message m from the signature, even without having the key. The destructor checksign

checks the signature, and returns m only when the signature is correct. Honest processes typically use

only checksign. This model of signatures assumes that the signature is always accompanied with the

message m. It is also possible to model signatures that do not reveal the message m, see Section 4.2.5.

Tuples and typing. For convenience, ProVerif has built-in support for tupling. A tuple of length

n > 1 is deﬁned as (M

, . . . , M

) where M

, . . . , M

are terms of any type. Once in possession of a

tuple, the attacker has the ability to recover the ith element. The inverse is also true: if the attacker is

in possession of terms M

, . . . , M

, then it can construct the tuple (M

, . . . , M

). Tuples are always of

type bitstring. Accordingly, constructors that take arguments of type bitstring may be applied to tuples.

Note that the term (M) is not a tuple and is equivalent to M. (Parentheses are needed to override the

default precedence of inﬁx operators.) It follows that (M) and M have the same type and that tuples of

arity one do not exist.

3.1. MODELING PROTOCOLS 15

3.1.3 Process macros

To facilitate development, protocols need not be encoded into a single main process (as we did in

Chapter 2). Instead, sub-processes may be speciﬁed in the declarations using macros of the form

l et R(x

: t

, . . . , x

: t

) = P .

where R is the macro name, P is the sub-process being deﬁned, and x

, . . . , x

, of types t

, . . . , t

respectively, are the free variables of P . The macro expansion R(M

, . . . , M

) will then expand to P with

substituted for x

, . . . , M

substituted for x

. As an example, consider a variant docs/hello var.pv

of docs/hello.pv (previously presented in Chapter 2):

free c : ch anne l .

free Cocks : b i t s t r i n g [ private ] .

free RSA: b i t s t r i n g [ private ] .

query attacker ( Cocks ) .

l et R( x : b i t s t r i n g ) = out ( c , x ) ; 0 .

l et R ( y : b i t s t r i n g )= 0 .

process R(RSA) | R ( Cocks )

By inspection of ProVerif’s output (see Section 3.3 for details on ProVerif’s output), one can observe

that this process is identical to the one in which the macro deﬁnitions are omitted and are instead

expanded upon in the main process. It follows immediately that macros are only an encoding which we

ﬁnd particularly useful for development.

3.1.4 Processes

The basic grammar of the language is presented in Figure 3.2; advanced features will be discussed in

Chapter 4; and the complete grammar is presented in Appendix A for reference.

Terms M, N consist of names a, b, c, k, m, n, s; variables x, y, z; tuples (M

, . . . , M

) where j is the

arity of the tuple; and constructor/destructor application, denoted h(M

, . . . , M

) where k is the arity

of h and arguments M

, . . . , M

have the required types. Some functions use the inﬁx notation: M =

N for equality, M <> N for disequality (both equality and disequality work modulo an equational

theory; they take two arguments of the same type and return a result of type bool), M && M for the

boolean conjunction, M || M for the boolean disjunction. We use not(M ) for the boolean negation. In

boolean operations, all values diﬀerent from true (modulo an equational theory) are considered as false .

Furthermore, if the ﬁrst argument of M && M is not true, then the second argument is not evaluated

and the result is false . Similarly, if the ﬁrst argument of M || M is true, then the second argument is

not evaluated and the result is true.

Processes P, Q are deﬁned as follows. The null process 0 does nothing; P | Q is the parallel com-

position of processes P and Q, used to represent participants of a protocol running in parallel; and the

replication !P is the inﬁnite composition P | P | . . ., which is often used to capture an unbounded number

of sessions. Name restriction new n : t; P binds name n of type t inside P , the introduction of restricted

names (or private names) is useful to capture both fresh random numbers (modeling nonces and keys,

for example) and private channels. Communication is captured by message input and message output.

The process in(M, x : t); P awaits a message of type t from channel M and then behaves as P with the

received message bound to the variable x; that is, every free occurrence of x in P refers to the message

received. The process out(M, N); P is ready to send N on channel M and then run P . In both of these

cases, we may omit P when it is 0. The conditional if M then P else Q is standard: it runs P when

the boolean term M evaluates to true, it runs Q when M evaluates to some other value. It executes

nothing when the term M fails (for instance, when M contains a destructor for which no rewrite rule

applies). For example, if M = N then P else Q tests equality of M and N. For convenience, condi-

tionals may be abbreviated as if M then P when Q is the null process. The power of destructors can

be capitalized upon by let x = M in P else Q statements where M may contain destructors. When

16 CHAPTER 3. USING PROVERIF

Figure 3.2 Term and process grammar

M, N ::= terms

a, b, c, k, m, n, s names

x, y, z variables

, . . . , M

) tuple

h(M

, . . . , M

) constructor/destructor application

M = N term equality

M <> N term disequality

M && M conjunction

M || M disjunction

not(M) negation

P, Q ::= processes

0 null process

P | Q parallel composition

!P replication

new n : t; P name restriction

in(M, x : t); P message input

out(M, N); P message output

if M then P else Q conditional

let x = M in P else Q term evaluation

R(M

, . . . , M

) macro usage

Figure 3.3 Pattern matching grammar

T ::= patterns

x : t typed variable

x variable without explicit type

: t unnamed typed variable

unnamed variable without explicit type

, ..., T

) tuple

=M equality test

this statement is encountered during process execution, there are two possible outcomes. If the term M

does not fail (that is, for all destructors in M , matching rewrite rules exist), then x is bound to M and

the P branch is taken; otherwise (rather than blocking), the Q branch is taken. (In particular, when M

never fails, the P branch will always be executed with x bound to M .) For convenience, the statement

let x = M in P else Q may be abbreviated as let x = M in P when Q is the null process. Finally, we

have R(M

, . . . , M

), denoting the use of the macro R with terms M

, . . . , M

as arguments.

Pattern matching.

For convenience, ProVerif supports pattern matching and we extend the grammar to include patterns

(Figure 3.3). The variable pattern x : t matches any term of type t and binds the matched term to x. The

variable pattern x is similar, but can be used only when the type of x can be inferred from the context.

When the matched term is not used, the variable can be replaced with the symbol , which matches any

term (of a certain type) without binding the matched term to a variable. The tuple pattern (T

, . . . , T

)

matches tuples (M

, . . . , M

) where each component M

(i ∈ {1, . . . , n}) is recursively matched with T

Finally, the pattern =M matches terms N where M = N . (This is equivalent to an equality test.)

To make use of patterns, the grammar for processes is modiﬁed. We omit the rule in(M, x : t); P

and instead consider in(M, T ); P which awaits a message matching the pattern T and then behaves as

P with the free variables of T bound inside P . Similarly, we replace let x = M in P else Q with the

more general let T = M in P else Q. (Note that let x = M in P else Q is a particular case in which

3.1. MODELING PROTOCOLS 17

the type of x is inferred from M; users may also write let x : t = M in P else Q where t is the type of

M, ProVerif will produce an error if there is a type mismatch.)

Scope and binding.

Bracketing must be used to avoid ambiguities in the way processes are written down. For example,

the process ! P | Q might be interpreted as !(P | Q), or as (!P ) | Q. These processes are diﬀerent.

To avoid too much bracketing, we adopt conventions about the precedence of process operators. The

binary parallel process P | Q binds most closely; followed by the binary processes if M then P else Q,

let x = M in P else Q; ﬁnally, unary processes bind least closely. It follows that ! P | Q is interpreted

as !(P | Q). Users should pay particular attention to ProVerif warning messages since these typically

arise from misunderstanding ProVerif’s binding conventions. For example, consider the process

new n : t ; out ( c , n ) | new n : t ; in ( c , x : t ) ; 0 | i f x = n then 0 | out ( c , n )

which produces the message “Warning: identiﬁer n rebound.” Moreover, the process will never perform

the ﬁnal out(c,n) because the process is bracketed as follows:

new n : t ; ( out ( c , n) | new n : t ; ( in ( c , x : t ) ; 0 | i f x = n then (0 | out ( c , n ) ) ) )

and hence the ﬁnal output is guarded by a conditional which can never be satisﬁed. The authors

recommend the distinct naming of names and variables to avoid confusion. New users may like to

refer to the output produced by ProVerif to ensure that they have deﬁned processes correctly (see also

Section 3.3). Another possible ambiguity arises because of the convention of omitting else 0 in the

if-then-else construct (and similarly for let-in-else): it is not clear which if the else applies to in the

expression:

i f M = M

′

then i f N = N

′

then P els e Q

In this instance, we adopt the convention that the else branch belongs to the closest if and hence the

statement should be interpreted as if M = M

′

then (if N = N

′

then P else Q). The convention is

similar for let-in-else.

Remarks about syntax

The restrictions on identiﬁers (Figure 3.2) for constructors/destructors h, names a, b, c, k, m, n, s, types

t, and variables x, y, z are completely relaxed. Formally, we do not distinguish between identiﬁers and

let identiﬁers range over an unlimited sequence of letters (a-z, A-Z), digits (0-9), underscores (

), single-

quotes (’), and accented letters from the ISO Latin 1 character set where the ﬁrst character of the

identiﬁer is a letter and the identiﬁer is distinct from the reserved words. Note that identiﬁers are case

sensitive. Comments can be included in input ﬁles and are surrounded by (* and *). Nested comments

are supported.

Reserved words. The following is a list of keywords in the ProVerif language; accordingly, they cannot

be used as identiﬁers.

among, axiom, channel, choice, clauses, const, def, diﬀ, do, elimtrue, else, equation, equiva-

lence, event, expand, fail, for, forall, foreach, free, fun, get, if, implementation, in, inj-event,

insert, lemma, let, letfun, letproba, new, noninterf, noselect, not, nounif, or, otherwise, out,

param, phase, pred, proba, process, proof, public vars, putbegin, query, reduc, restriction,

secret, select, set, suchthat, sync, table, then, type, weaksecret, yield.

ProVerif also has built-in types any type, bitstring , bool, nat, sid, time, constants true, false of type

bool, destructor is nat , predicates attacker, mess, subterm; although these identiﬁers can be reused

as identiﬁers, the authors strongly discourage this practice.

18 CHAPTER 3. USING PROVERIF

3.1.5 Example: handshake protocol

We are now ready to present an encoding of the handshake protocol, available in docs/ex handshake.pv

(for brevity, we omit function/type declarations and destructors, for details see Section 3.1.1):

1 free c : chan nel .

3 free s : b i t s t r i n g [ private ] .

4 query attacker ( s ) .

6 le t c l i e nt A (pkA : pkey , skA : skey , pkB : spkey ) =

7 out ( c , pkA ) ;

8 in ( c , x : b i t s t r i n g ) ;

9 l e t y = adec ( x , skA ) in

10 l e t (=pkB , k : key ) = ch ec k s ig n ( y , pkB) in

11 out ( c , se nc ( s , k ) ) .

13 le t se rve rB (pkB : spkey , skB : s sk ey ) =

14 in ( c , pkX : pkey ) ;

15 new k : key ;

16 out ( c , aenc ( s i g n ( ( pkB , k ) , skB ) , pkX ) ) ;

17 in ( c , x : b i t s t r i n g ) ;

18 l e t z = s dec ( x , k ) in

19 0 .

21 process

22 new skA : skey ;

23 new skB : s sk ey ;

24 l e t pkA = pk ( skA ) in out ( c , pkA ) ;

25 l e t pkB = spk ( skB ) in out ( c , pkB ) ;

26 ( ( ! c l i e n t A (pkA , skA , pkB ) ) | ( ! se rve rB ( pkB , skB ) ) )

The ﬁrst line declares the public channel c. Lines 3-4 should be familiar from Chapter 2 and further

details will be given in Section 3.2. The client process is deﬁned by the macro starting on Line 6 and

the server process is deﬁned by the macro starting on Line 13. The main process generates the private

asymmetric key skA and the private signing key skB for principals A, B respectively (Lines 22-23). The

public key parts pk(skA), spk(skB) are derived and then output on the public communications channel c

(Lines 24-25), ensuring that they are available to the attacker. (Observe that this is done using handles

pkA, pkB for convenience.) The main process also instantiates multiple copies of the client and server

macros with the relevant parameters representing multiple sessions of the roles.

We assume that the server B is willing to run the protocol with any other principal; the choice

of her interlocutor will be made by the environment. This is captured by modeling the ﬁrst input

in(c,pkX:pkey) to serverB as his client’s public key pkX (Line 14). The client A on the other hand only

wishes to share his secret s with the server B; accordingly, B’s public key is hard-coded into the process

clientA. We additionally assume that each principal is willing to engage in an unbounded number of

sessions and hence clientA(pkA,skA,pkB) and serverB(pkB,skB) are under replication.

The client and server processes correspond exactly to the description presented in Figure 3.1 and we

will now describe the details of our encoding. On request from a client, server B starts the protocol

by selecting a fresh key k and outputting aenc(sign((pkB,k),skB),pkX) (Line 16); that is, her signature

on the key k paired with her identity spk(skB) and encrypted for his client using her public key pkX.

Meanwhile, the client A awaits the input of his interlocutor’s signature on the pair (pkB,k) encrypted

using his public key (Line 8). A veriﬁes that the ciphertext is correctly formed using the destructor

adec on Line 9, which will visibly fail if x is not a message asymmetrically encrypted for the client;

that is, the (omitted) else branch of the statement will be evaluated because there is no corresponding

rewrite rule. The statement let (=pkB,k:key) = checksign(y,pkB) in on Line 10 uses destructors and

pattern matching with type checking to verify that y is a signature under skB containing a pair, where

the ﬁrst element is the server’s public signing key and the second is a symmetric key k. If y is not a

3.2. SECURITY PROPERTIES 19

correct signature, then the (omitted) else branch of the statement will be evaluated because there is

no corresponding rewrite rule, so the client halts. Finally, the server inputs a bitstring x and recovers

the cleartext as variable z. (Observe that the failure of decryption is again detectable.) Note that the

variable z in the server process is not used.

3.2 Security properties

The ProVerif tool is able to prove reachability properties, correspondence assertions, and observational

equivalence. In this section, we will demonstrate how to prove the security properties of the handshake

protocol. A more complete coverage of the properties that ProVerif can prove is presented in Section 4.3.

3.2.1 Reachability and secrecy

Proving reachability properties is ProVerif’s most basic capability. The tool allows the investigation of

which terms are available to an attacker; and hence (syntactic) secrecy of terms can be evaluated with

respect to a model. To test secrecy of the term M in the model, the following query is included in the

input ﬁle before the main process:

query attacker ( M ) .

where M is a ground term, without destructors, containing free names (possibly private and hence

not initially known to the attacker). We have already demonstrated the use of secrecy queries on our

handshake protocol (see the code in Section 3.1.5).

3.2.2 Correspondence assertions, events, and authentication

Correspondence assertions [WL93] are used to capture relationships between events which can be ex-

pressed in the form “if an event e has been executed, then event e

′

has been previously executed.” More-

over, these events may contain arguments, which allow relationships between the arguments of events to

be studied. To reason with correspondence assertions, we annotate processes with events, which mark

important stages reached by the protocol but do not otherwise aﬀect behavior. Accordingly, we extend

the grammar for processes to include events denoted

event e(M

, . . . , M

) ; P

Importantly, the attacker’s knowledge is not extended by the terms M

, . . . , M

following the execution

of event e(M

, . . . , M

); hence, the execution of the process Q after inserting events is the execution

of Q without events from the perspective of the attacker. All events must be declared (in the list of

declarations in the input ﬁle) in the form event e(t

, . . . , t

). where t

, . . . , t

are the types of the event

arguments. Relationships between events may now be speciﬁed as correspondence assertions.

Correspondence

The syntax to query a basic correspondence assertion is:

query x

: t

, . . . , x

: t

; event (e(M

, . . . , M

)) ==> event (e

′

, . . . , N

) ) .

where M

, . . . , M

, N

, . . . , N

are terms built by the application of constructors to the variables x

, . . . ,

of types t

, . . . , t

and e, e

′

are declared as events. The query is satisﬁed if, for each occurrence of the

event e(M

, . . . , M

), there is a previous execution of e

′

, . . . , N

). Moreover, the parameterization

of the events must satisfy any relationships deﬁned by M

, . . . , M

, N

, . . . , N

; that is, the variables

, . . . , x

have the same value in M

, . . . , M

and in N

, . . . , N

In such a query, the variables that occur before the arrow ==> (that is, in M

, . . . , M

) are universally

quantiﬁed, while the variables that occur after the arrow ==> (in N

, . . . , N

) but not before are

existentially quantiﬁed. For instance,

query x : t

, y : t

, z : t

; event (e(x, y)) ==> event (e

′

(y, z) ) .

means that, for all x, y, for each occurrence of e(x, y), there is a previous occurrence of e

′

(y, z) for some

20 CHAPTER 3. USING PROVERIF

Injective correspondence

The deﬁnition of correspondence we have just discussed is insuﬃcient to capture authentication in cases

where a one-to-one relationship between the number of protocol runs performed by each participant is

desired. Consider, for example, a ﬁnancial transaction in which the server requests payment from the

client; the server should complete the transaction only once for each transaction started by the client. (If

this were not the case, the client could be charged for several transactions, even if the client only started

one.) The situation is similar for access control and other scenarios. Injective correspondence assertions

capture the one-to-one relationship and are denoted:

query x

: t

, . . . , x

: t

; inj−event (e(M

, . . . , M

)) ==> inj−event (e

′

, . . . , N

) ) .

Informally, this correspondence asserts that, for each occurrence of the event e(M

, . . . , M

), there is

a distinct earlier occurrence of the event e

′

, . . . , N

). It follows immediately that the number of

occurrences of e

′

, . . . , N

) is greater than, or equal to, the number of occurrences of e(M

, . . . , M

Note that using inj−event or event before the arrow ==> does not change the meaning of the query.

It is only important after the arrow.

3.2.3 Example: Secrecy and authentication in the handshake protocol

Authentication can be captured using correspondence assertions (additional applications of correspon-

dence assertions were discussed in 1.1). Recall that in addition to the secrecy property mentioned for

the handshake protocol in Figure 3.1, there were also authentication properties. The protocol is intended

to ensure that, if client A thinks she executes the protocol with server B, then she really does so, and

vice versa. When we say ‘she thinks’ that she executes it with B, we mean that the data she receives

indicates that fact. Accordingly, we declare the events:

event acceptsClient(key), which is used by the client to record the belief that she has accepted to

run the protocol with the server B and the supplied symmetric key.

event acceptsServer(key,pkey), which is used to record the fact that the server considers he has

accepted to run the protocol with a client, with the proposed key supplied as the ﬁrst argument

and the client’s public key as the second.

event termClient(key,pkey), which means the client believes she has terminated a protocol run

using the symmetric key supplied as the ﬁrst argument and the client’s public key as the second.

event termServer(key), which denotes the server’s belief that he has terminated a protocol run

with the client A with the symmetric key supplied as the ﬁrst argument.

Recall that the client is only willing to share her secret with the server B; it follows that, if she completes

the protocol, then she believes she has done so with B and hence authentication of B to A should hold.

In contrast, server B is willing to run the protocol with any client (that is, he is willing to learn secrets

from many clients), and hence at the end of the protocol he only expects authentication of A to B to

hold, if he believes A was indeed his interlocutor (so termServer(x) is executed only when pkX = pkA).

We can now formalize the two authentication properties (given in Figure 3.1) for the handshake protocol.

They are, respectively:

query x : key , y : spkey ; event ( term Cl ie nt ( x , y))==>event ( a c c ep t s S e r v e r ( x , y ) ) .

query x : key ; inj −event ( t ermS erver ( x))==>inj−event ( a c c e p t s C l i e n t ( x ) ) .

The subtle diﬀerence between the two correspondence assertions is due to the diﬀering authentication

properties expected by participants A and B. The ﬁrst correspondence is not injective because the

protocol does not allow the client to learn whether the messages she received are fresh: the message from

the server to the client may be replayed, leading to several client sessions for a single server session. The

revised ProVerif encoding with annotations and correspondence assertions is presented below and in the

ﬁle docs/ex

handshake annotated.pv (cryptographic declarations have been omitted for brevity):

1 free c : chan nel .

3.2. SECURITY PROPERTIES 21

3 free s : b i t s t r i n g [ private ] .

4 query attacker ( s ) .

6 event a c c e p t s C l i e n t ( key ) .

7 event a c c e p t s Se r v e r ( key , pkey ) .

8 event t er mCl ie nt ( key , pkey ) .

9 event ter mSer ver ( key ) .

11 query x : key , y : pkey ; event ( t er mC li ent ( x , y))==>event ( a c c e p t s S e r v er (x , y ) ) .

12 query x : key ; inj−event ( term Serv er ( x))==>inj−event ( ac c e p t s C l i e n t ( x ) ) .

14 le t c l i e nt A (pkA : pkey , skA : skey , pkB : spkey ) =

15 out ( c , pkA ) ;

16 in ( c , x : b i t s t r i n g ) ;

17 l e t y = adec ( x , skA ) in

18 l e t (=pkB , k : key ) = ch ec k s ig n ( y , pkB) in

19 event a c c e p t s C l i e n t ( k ) ;

20 out ( c , se nc ( s , k ) ) ;

21 event te rm Cl ien t ( k , pkA ) .

23 le t se rve rB (pkB : spkey , skB : sskey , pkA : pkey ) =

24 in ( c , pkX : pkey ) ;

25 new k : key ;

26 event a c c e p t s S e r v e r ( k , pkX ) ;

27 out ( c , aenc ( s i g n ( ( pkB , k ) , skB ) , pkX ) ) ;

28 in ( c , x : b i t s t r i n g ) ;

29 l e t z = s dec ( x , k ) in

30 i f pkX = pkA then event t er mSer ver ( k ) .

32 process

33 new skA : skey ;

34 new skB : s sk ey ;

35 l e t pkA = pk ( skA ) in out ( c , pkA ) ;

36 l e t pkB = spk ( skB ) in out ( c , pkB ) ;

37 ( ( ! c l i e n t A (pkA , skA , pkB ) ) | ( ! se rve rB ( pkB , skB , pkA) ) )

Figure 3.4 Messages and events for authentication

Client

Server

event acceptsServer

event termServer

event termClient

event acceptsClient

message n − 1

message n

There is generally some ﬂexibility in the placement of events in a process, but not all choices are correct.

For example, in order to prove authentication in our handshake protocol, we consider the property

query x : key ; inj−event ( term Serv er ( x))==>inj−event ( ac c e p t s C l i e n t ( x ) ) .

and the event termServer is placed when the server terminates (typically at the end of the protocol),

while acceptsClient is placed when the client accepts (typically before the client sends its last message).

Therefore, when the last message, message n, is from the client to the server, the placement of events

follows Figure 3.4: the last message sent by the client is message n, so acceptsClient is placed before the

client sends message n, and termServer is placed after the server receives message n. The last message

sent by the server is message n − 1, so acceptsServer is placed before the server sends message n − 1, and

22 CHAPTER 3. USING PROVERIF

termClient is placed after the client receives message n − 1 (any position after that reception is ﬁne).

More generally, the event that occurs before the arrow ==> can be placed at the end of the protocol, but

the event that occurs after the arrow ==> must be followed by at least one message output. Otherwise,

the whole protocol can be executed without executing the latter event, so the correspondence certainly

does not hold.

One can also note that moving an event that occurs before the arrow ==> towards the beginning of

the protocol strengthens the correspondence property, and moving an event that occurs after the arrow

==> towards the end of the protocol also strengthens the correspondence property. Adding arguments

to the events strengthens the correspondence property as well.

3.3 Understanding ProVerif output

The output produced by ProVerif is rather verbatim and can be overwhelming for new users. In essence

the output is in the following format:

[ Equati ons ]

Pr oc es s :

[ Pr oc es s ]

−− Query [ Query ]

Completing . . .

S t a r t i n g query [ Query ]

go a l [ un ] r e a ch a b l e : [ Goal ]

Ab br e vi at io ns :

. . .

[ Attack d e r i v a t i o n ]

A more d e t a i l e d output of the t r a c e s i s a v a i l a b l e with

set tr a c e D i s p l a y = lon g .

[ Attack t r a c e ]

RESULT [ Query ] [ r e s u l t ] .

−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−

V e r i f i c a t i o n summary :

[ Summary o f v e r i f i c a t i o n r e s u l t s ]

−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−

where [Equations] summarizes the internal representation of the equations given in the input ﬁle (if any)

and [Process] presents the input process with all macros expanded and distinct identiﬁers assigned to

unique names/variables; in addition, parts of the process are annotated with identiﬁers {n} where n ∈ N

∗

(New users may like to refer to this interpreted process to ensure they have deﬁned the scope of variables

in the correct manner and to ensure they haven’t inadvertently bound processes inside if-then-else/let-in-

else statements.) ProVerif then begins to evaluate the [Query] provided by the user. Internally, ProVerif

attempts to prove that a state in which a property is violated is unreachable; it follows that ProVerif

shows the (un)reachability of some [Goal]. If a property is violated then ProVerif attempts to reconstruct

an [Attack derivation] in English and an [Attack trace] in the applied pi calculus. ProVerif then reports

whether the query was satisﬁed. Finally, ProVerif displays a summary of the veriﬁcation results of all the

queries in the ﬁle. For convenience, Linux and cygwin users may make use of the following command:

./proverif ⟨ﬁlename⟩.pv | grep "RES"

which reduces the output to the results of the queries.

3.3. UNDERSTANDING PROVERIF OUTPUT 23

3.3.1 Results

In order to understand the results correctly, it is important to understand the diﬀerence between the

attack derivation and the attack trace. The attack derivation is an explanation of the actions that the

attacker has to make in order to break the security property, in the internal representation of ProVerif.

Because this internal representation uses abstractions, the derivation is not always executable in reality;

for instance, it may require the repetition of certain actions that can in fact never be repeated, for

instance because they are not under a replication. In contrast, the attack trace refers to the semantics

of the applied pi calculus, and always corresponds to an executable trace of the considered process.

ProVerif can display three kinds of results:

RESULT [Query] is true: The query is proved, there is no attack. In this case, ProVerif displays

no attack derivation and no attack trace.

RESULT [Query] is false: The query is false, ProVerif has discovered an attack against the desired

security property. The attack trace is displayed just before the result (and an attack derivation is

also displayed, but you should focus on the attack trace since it represents the real attack).

RESULT [Query] cannot be proved: This is a “don’t know” answer. ProVerif could not prove that

the query is true and also could not ﬁnd an attack that proves that the query is false. Since the

problem of verifying protocols for an unbounded number of sessions is undecidable, this situation

is unavoidable. Still, ProVerif gives some additional information that can be useful in order to

determine whether the query is true. In particular, ProVerif displays an attack derivation. By

manually inspecting the derivation, it is sometimes possible to reconstruct an attack. For observa-

tional equivalence properties, it may also display an attack trace, even if this trace does not prove

that the observational equivalence does not hold. We will come back to this point when we deal

with observational equivalence, in Section 4.3.2. Sources of incompleteness, which explain why

ProVerif sometimes fails to prove properties that hold, will be discussed in Section 6.7.5.

Interpreting results. Understanding the internal manner in which ProVerif operates is useful to

interpret the results output. Recall that ProVerif attempts to prove that a state in which a property

is violated is unreachable. It follows that when ProVerif is supplied with query attacker(M )., that

internally ProVerif attempts to show not attacker(M) and hence RESULT not attacker(M) is true.

means that the secrecy of M is preserved by the protocol.

Error and warning messages. In case of a syntax error, ProVerif indicates the character position of

the error (line and column numbers). Please use your text editor to ﬁnd the position of the error. (The er-

ror messages can be interpreted by emacs.) In addition, ProVerif may provide various warning messages.

The earlier grep command can be modiﬁed into ./proverif ⟨ﬁlename⟩.pv | egrep "RES|Err|War"

for more manageable output with notiﬁcation of error/warnings, although a more complex command

is required to read any associated messages. In this case, the command ./proverif ⟨ﬁlename⟩.pv |

less can be useful.

3.3.2 Example: ProVerif output for the handshake protocol

Executing the handshake protocol with ./proverif docs/ex handshake annotated.pv | grep "RES"

produces the following output:

RESULT not attacker ( s [ ] ) i s f a l s e .

RESULT event ( te rm Cl ie nt ( x 2 , y 1 ) ) ==> event ( ac c e p t s S e r v er ( x 2 , y 1 ) ) i s f a l s e .

RESULT inj −event ( term Serve r ( x 2 ) ) ==> inj−event ( ac c e p t s C l i e n t ( x 2 ) ) i s tr ue .

which informs us that authentication of A to B holds, but authentication of B to A and secrecy of s do

not hold.

24 CHAPTER 3. USING PROVERIF

Analyzing attack traces.

By inspecting the output more closely, we can reconstruct the attack. For example, let us consider the

query query attacker(s) which produces the following:

1 Pr oc ess 0 ( t hat i s , t he i n i t i a l process ) :

2 {1}new skA : skey ;

3 {2}new skB : s sk ey ;

4 {3} l e t pkA : pkey = pk ( skA ) in

5 {4} out ( c , pkA ) ;

6 {5} l e t pkB : spkey = spk ( skB ) in

7 {6} out ( c , pkB ) ;

8 (

9 {7 } !

10 {8}out ( c , pkA ) ;

11 {9} in ( c , x : b i t s t r i n g ) ;

12 {10} l et y : b i t s t r i n g = adec ( x , skA ) in

13 {11} l et (=pkB , k : key ) = ch ec k s ig n ( y , pkB) in

14 {12}event a c c e p t s C l i e n t ( k ) ;

15 {13}out ( c , sen c ( s , k ) ) ;

16 {14}event t er mC lie nt ( k , pkA)

17 ) | (

18 {1 5} !

19 {16} in ( c , pkX : pkey ) ;

20 {17}new k 1 : key ;

21 {18}event a c c e p t s S erv e r ( k 1 , pkX ) ;

22 {19}out ( c , aenc ( s i g n ( ( pkB , k 1 ) , skB ) , pkX ) ) ;

23 {20} in ( c , x 1 : b i t s t r i n g ) ;

24 {21} l et z : b i t s t r i n g = sd ec ( x 1 , k 1 ) in

25 {22} i f (pkX = pkA) then

26 {23}event te rmSer ver ( k 1 )

27 )

29 −− Query not attacker ( s [ ] ) in process 0 .

30 Completing . . .

31 S t a r t i n g query not attacker ( s [ ] )

32 go al r e a ch a b l e : attacker ( s [ ] )

34 De ri v at i on :

35 Ab b re vi at io n s :

36 k

2 = k 1 [ pkX = pk ( sk ) , ! 1 = @sid ]

38 1 . The attacker has some term sk .

39 attacker ( sk ) .

41 2 . By 1 , the attacker may know sk .

42 Using the f u n c t i o n pk the attacker may ob ta in pk ( sk ) .

43 attacker ( pk ( sk ) ) .

45 3 . The message pk ( sk ) th at the attacker may have by 2 may be r e c e i v e d at

46 i npu t { 16 }.

47 So th e message aenc ( s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( sk ) ) may be s en t to the

48 attacker at output {19} .

49 attacker ( aenc ( s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( sk ) ) ) .

51 4 . By 3 , the attacker may know aenc ( si g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( sk ) ) .

3.3. UNDERSTANDING PROVERIF OUTPUT 25

52 By 1 , the attacker may know sk .

53 Using the f u n c t i o n adec the attacker may o bta in s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) .

54 attacker ( si g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) ) .

56 5 . By 4 , the attacker may know s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) .

57 Using the f u n c t i o n getmess th e attacker may o bt ai n ( spk ( skB [ ] ) , k 2 ) .

58 attacker ( ( spk ( skB [ ] ) , k 2 ) ) .

60 6 . By 5 , the attacker may know ( spk ( skB [ ] ) , k 2 ) .

61 Using the f u n c t i o n 2−p roj −2−t u pl e th e attacker may o bt ai n k 2 .

62 attacker ( k 2 ) .

64 7 . The message pk ( skA [ ] ) may be s en t t o th e attacker at output {4 }.

65 attacker ( pk ( skA [ ] ) ) .

67 8 . By 4 , the attacker may know s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) .

68 By 7 , the attacker may know pk ( skA [ ] ) .

69 Using the f u n c t i o n aenc the attacker may o bta in

70 aenc ( s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( skA [ ] ) ) .

71 attacker ( aenc ( s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( skA [ ] ) ) ) .

73 9 . The message aenc ( s i g n ( ( spk ( skB [ ] ) , k 2 ) , skB [ ] ) , pk ( skA [ ] ) ) th at t he attacker

74 may have by 8 may be r e c e i v e d at i npu t {9 }.

75 So th e message s enc ( s [ ] , k 2 ) may be s e nt to th e attacker at output {13 } .

76 attacker ( se nc ( s [ ] , k 2 ) ) .

78 10 . By 9 , the attacker may know s enc ( s [ ] , k 2 ) .

79 By 6 , the attacker may know k 2 .

80 Using the f u n c t i o n sd ec t he attacker may o bt ai n s [ ] .

81 attacker ( s [ ] ) .

83 11 . By 10 , attacker ( s [ ] ) .

84 The g o a l i s reached , r e p r e se n t e d in the f o l l o w i n g f a c t :

85 attacker ( s [ ] ) .

88 A more d e t a i l e d output of the t r a c e s i s a v a i l a b l e with

89 set t r a c e D i s p l ay = lo ng .

91 new skA : skey c r e a t i n g skA

1 at {1}

93 new skB : s sk ey c r e a t i n g skB 1 at {2}

95 out ( c , ˜M) with ˜M = pk ( skA 1 ) at {4}

97 out ( c , ˜M 1) with ˜M 1 = spk ( skB 1 ) at {6}

99 out ( c , ˜M 2) with ˜M 2 = pk ( skA 1 ) at {8} in copy a

100

101 in ( c , pk ( a 1 )) at {16} in copy a 2

102

103 new k 1 : key c r e a t i n g k 2 at {17} in copy a 2

104

105 event a c c e p t s Se r v e r ( k 2 , pk ( a 1 ) ) at {18} in copy a 2

106

26 CHAPTER 3. USING PROVERIF

107 out ( c , ˜M 3) with ˜M 3 = aenc ( s i g n ( ( spk ( skB 1 ) , k 2 ) , skB 1 ) , pk ( a 1 )) at {19}

108 in copy a 2

109

110 in ( c , aenc ( adec (˜ M 3 , a 1 ) , ˜M) ) with aenc ( adec (˜ M 3 , a 1 ) ,˜M) =

111 aenc ( s i g n ( ( spk ( skB 1 ) , k 2 ) , skB 1 ) , pk ( skA 1 ) ) at {9} in copy a

112

113 event a c c e p t s C l i e n t ( k 2 ) at {12} in copy a

114

115 out ( c , ˜M 4) with ˜M 4 = se nc ( s , k 2 ) at {13} in copy a

116

117 event t er mCl ie nt ( k 2 , pk ( skA 1 ) ) at {14} in copy a

118

119 The attacker has th e message

120 s dec (˜M 4,2− pro j −2−t u p l e ( getmess ( adec (˜M 3 , a 1 ) ) ) ) = s .

121 A tr a c e has been found .

122 RESULT not attacker ( s [ ] ) i s f a l s e .

123

ProVerif ﬁrst outputs its internal representation of the process under consideration. Then, it handles

each query in turn. The output regarding the query query attacker(s) can be split into three main

parts:

From “Abbreviations” to “A more detailed... ”, a description of the derivation that leads to the

fact attacker(s).

After “A more detailed... ” until “A trace has been found”, a description of the corresponding at-

tack trace.

Finally, the “RESULT” line concludes: the property is false, there is an attack in which the attacker

gets s.

Let us ﬁrst explain the derivation. It starts with a list of abbreviations: these abbreviations give names

to some subterms, in order to display them more brieﬂy; such abbreviations are used for the internal

representation of names (keys, nonces, . . . ), which can sometimes be large terms that represent simple

atomic data. Next, the description of the derivation itself starts. It is a numbered list of steps, here

from 1 to 10. Each step corresponds to one action of the process or of the attacker. After an English

description of the step, ProVerif displays the fact that is derived thanks to this step, here attacker(M )

for some term M , meaning that the attacker has M.

In step 1, the attacker chooses any value sk in its knowledge (which it is going to use as its secret

key).

In step 2, the attacker uses the knowledge of sk obtained at step 1 (“By 1”) to compute the

corresponding public key pk(sk) using function pk.

Step 3 is a step of the process. Input {16} (the numbers between braces refer to program

points also written between braces in the description of the process, so input {16} is the in-

put of Line 19) receives the message pk(sk) from the attacker, and output {19} (the one at

Line 22) replies with aenc(sign((spk(skB[]), k 2),skB []), pk(sk)). Note that k 2 is an abbreviation

for k 2 = k 1[pkX = pk(sk),!1 = @sid], as listed at the beginning of the derivation. It designates

the key k 2 generated by the new at Line 20, in session @sid (the number of the copy generated

by the replication at Line 18, designated by !1, that is, the ﬁrst replication), when the key pkX

received by the input at Line 19 is pk(sk). ProVerif displays skB[] instead of skB when skB is a

name without argument (that is, a free name or a name chosen under no replication and no input).

In other words, the attacker starts a session of the server B with its own public key and gets the

corresponding message aenc(sign((spk(skB[]), k 2),skB []), pk(sk)).

Steps 4 to 6 are again applications of functions by the attacker to perform its internal computations:

the attacker decrypts the message aenc(sign((spk(skB[]), k 2),skB []), pk(sk)) received at step 3 and

gets the signed message, so it obtains sign((spk(skB[]), k 2),skB[]) (step 4) and k 2 (step 6).

3.3. UNDERSTANDING PROVERIF OUTPUT 27

Step 7 uses a step of the process: by the output {4} (the one at Line 5), the attacker gets pk(skA[]).

At step 8, the attacker reencrypts sign((spk(skB []),k 2),skB[]) with pk(skA[]).

Step 9 is again a step of the process: the attacker sends aenc(sign((spk(skB[]), k 2),skB []), pk(skA[]))

(obtained at step 8) to input {9} (at Line 11) and gets the reply senc(s [], k 2). In other words, the

attacker has obtained a correct message 2 for a session between A and B. It sends this message to

A who replies with senc(s [], k 2) as if it was running a session with B.

In step 10, the attacker decrypts senc(s [], k 2) since it has k 2 (by step 6), so it obtains s [] .

Finally, step 11 indicates that the query goal has been reached, that is, attacker(s[]).

As one can notice, this derivation corresponds exactly to the attack against the protocol outlined in

Figure 3.1. The display of the derivation can be tuned by some settings: set abbreviateDerivation = false

prevents the use of abbreviations for names and set explainDerivation = false switches to a display of

the derivation by explicit references to the Horn clauses used internally by ProVerif instead of relating

the derivation to the process. (See also Section 6.6.2 for details on these settings.)

Next, ProVerif reconstructs a trace in the semantics of the pi calculus, corresponding to this deriva-

tion. This trace is presented as a sequence of inputs and outputs on public channels and of events. The

internal reductions of the process are not displayed for brevity. (As mentioned in the output, it is possible

to obtain a more detailed display with the state of the process and the knowledge of the attacker at each

step by adding set traceDisplay = long. in your input ﬁle.) Each input, output, or event is followed by

its location in the process “at {n}”, which refers to the program point between braces in the process

displayed at the beginning. When the process is under replication, several copies of the process may be

generated. Each of these copies is named (by a name like “a n”), and ProVerif indicates in which copy

of the process the input, output, or event is executed. The name itself is unimportant, just the fact that

the copy is the same or diﬀerent is important: the presence of diﬀerent names of copies for the same

replication shows that several sessions are used. Let us explain the trace in the case of the handshake

protocol:

The ﬁrst two new correspond to the creation of secret keys.

The ﬁrst two outputs correspond to the outputs of public keys, at outputs {4} (Line 5) and {6}

(Line 7). The attacker stores these public keys in fresh variables ˜M and ˜M 1 respectively, so that

it can reuse them later.

The third output is the output of pkA at output {8} (Line 10), in a session of the client A named

The next 4 steps correspond to a session of the server B (copy a 2) with the attacker: the attacker

sends its public key pk(a 1) at the input {16} (Line 19). A fresh shared key k 2 is then created. The

event acceptsServer is executed (Line 21), and the message aenc(sign((spk(skB 1), k 2), skB 1),

pk(a 1)) is sent at output {19} (Line 22) and stored in variable ˜M 3, a fresh variable that can be

used later by the attacker. These steps correspond to step 3 of the derivation above.

The last 4 steps correspond to the end of the execution of the session a of the

client A. The attacker computes aenc(adec(˜M 3,a 1),˜M)) and obtains the message

aenc(sign((spk(skB 1),k 2),skB 1),pk(skA 1)), which it sends to the input {9} (Line 11). The

event acceptsClient is executed (Line 14), the message senc(s ,k 2) is sent at output {13} (Line 15)

and stored in variable ˜M 4 and ﬁnally the event termClient is executed (Line 16). These steps

correspond to step 9 of the derivation above.

Finally, the attacker obtains s [] by computing sdec(˜M 4, 2−proj−2−tuple(getmess(adec(˜M 3,

a 1 )))).

This trace shows that there is an attack against the secrecy of s, it corresponds to the attack against the

protocol outlined in Figure 3.1.

Another way to represent an attack found by ProVerif is by a graph. For instance, the attack explained

previously is shown in Figure 3.5. To obtain such a graph, use the command-line option -graph or -html

28 CHAPTER 3. USING PROVERIF

Figure 3.5 Handshake protocol attack trace

described in Section 6.6.1. The detailed version is built when set traceDisplay = long. has been added

to the input .pv ﬁle. The graph starts always with two processes: the honest one, and the attacker. The

progress of the attack is represented vertically. Parallel processes are represented by several columns.

Replications of processes are denoted by nodes labeled by !, with a column for each created process.

Processes fork when a parallel composition is reduced. The termination of a process is represented by a

point. An output on a public channel is represented by a horizontal arrow from the process that makes

the output to the attacker. The edge is labeled with an equality X = M where M is the sent message

and X is a fresh variable (or tuple of variables) in which the adversary stores it. An input on a public

channel is represented by an arrow from the attacker to the receiving process, labeled with an equality

R = M, where R is the computation performed by the attacker to obtain the sent message M. The

message M is omitted when it is exactly equal to R, for instance when R is a constant. A communication

made on a private channel is represented by an arrow from the process that outputs the message to the

process that receives it; this arrow is labeled with the message. Creation of nonces and other steps are

represented in boxes. Information about the attack is written in red; the displayed information depends

on the security property that is broken by the attack. The text “a trace has been found” is written at

the top of the ﬁgure, possibly with assumptions necessary for the attack. When labels are too long to

3.3. UNDERSTANDING PROVERIF OUTPUT 29

ﬁt on arrows, a table of abbreviations appears at the top right of the ﬁgure.

Let us take a closer look at Figure 3.5. First, two new secret keys are created by the honest pro-

cess. Then the corresponding public keys are sent on a public channel; the attacker receives them

and stores them in ˜M and ˜M 1. Next, a parallel reduction is made. We obtain two processes

which replicate themselves once each. The ﬁrst process (clientA) sends its public key on a pub-

lic channel, and the attacker receives it. Then the attacker sends the message pk(a 1), containing

its own public key, to the second process serverB. This process then creates a new shared key k 2

and executes the event acceptsServer(k 2,pk(a 1)). It sends the message aenc(sign((spk(skB 1), k 2),

skB 1), pk(a 1)) on a public channel; the attacker receives it and stores it in ˜M 3. The attacker

computes aenc(adec(˜M 3,a 1),˜M)), that is, it decrypts and reencrypts the message, thus obtaining

aenc(sign((spk(skB 1),k 2),skB 1),pk(skA 1)). It sends that message to clientA. The process clientA

executes the event acceptsClients(k 2) and sends the message senc(s ,k 2). The attacker receives it and

stores it in ˜M 4. Finally, the attacker computes sdec(˜M 4,2−proj−2−tuple(getmess(adec(˜M 3,a 1)))),

and obtains the secret s. This point is mentioned in the red box at the bottom right of the page. The

process clientA executes the last event termClient, and terminates. This is the end of the attack. The

line numbers of each step appear in green in boxes. The keywords are written in blue, while the names

of processes are written in green.

For completeness, we present the complete formalization of the rectiﬁed protocol, which ProVerif can

successfully verify, below and in the ﬁle docs/ex handshake annotated fixed.pv.

1 ( Symmetric key en c r yp t io n )

3 type key .

4 fun s enc ( b i t s t r i n g , key ) : b i t s t r i n g .

5 reduc f o r a l l m: b i t s t r i n g , k : key ; sde c ( s enc (m, k ) , k ) = m.

8 ( Asymmetric key en c ry p t io n )

10 type skey .

11 type pkey .

13 fun pk ( ske y ) : pkey .

14 fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

16 reduc f o r a l l m: b i t s t r i n g , sk : sk ey ; adec ( aenc (m, pk ( sk ) ) , sk ) = m.

19 ( D i g i t a l s i g n a t u r e s )

21 type ss ke y .

22 type spkey .

24 fun spk ( s sk ey ) : spkey .

25 fun s i g n ( b i t s t r i n g , s sk ey ) : b i t s t r i n g .

27 reduc f o r a l l m: b i t s t r i n g , s s k : s sk ey ; getmess ( s i g n (m, ss k ) ) = m.

28 reduc f o r a l l m: b i t s t r i n g , s s k : s sk ey ; ch e ck s ig n ( s i g n (m, ss k ) , spk ( s sk ) ) = m.

31 free c : chan nel .

33 free s : b i t s t r i n g [ private ] .

34 query attacker ( s ) .

36 event a c c e p t s C l i e n t ( key ) .

30 CHAPTER 3. USING PROVERIF

37 event a c c e p t s Se r v e r ( key , pkey ) .

38 event t er mCl ie nt ( key , pkey ) .

39 event ter mSer ver ( key ) .

41 query x : key , y : pkey ; event ( t er mC li ent ( x , y))==>event ( a c c e p t s S e r v er (x , y ) ) .

42 query x : key ; inj−event ( term Serv er ( x))==>inj−event ( ac c e p t s C l i e n t ( x ) ) .

44 le t c l i e nt A (pkA : pkey , skA : skey , pkB : spkey ) =

45 out ( c , pkA ) ;

46 in ( c , x : b i t s t r i n g ) ;

47 l e t y = adec ( x , skA ) in

48 l e t (=pkA,=pkB , k : key ) = ch ec k si g n ( y , pkB) in

49 event a c c e p t s C l i e n t ( k ) ;

50 out ( c , se nc ( s , k ) ) ;

51 event te rm Cl ien t ( k , pkA ) .

53 le t se rve rB (pkB : spkey , skB : sskey , pkA : pkey ) =

54 in ( c , pkX : pkey ) ;

55 new k : key ;

56 event a c c e p t s S e r v e r ( k , pkX ) ;

57 out ( c , aenc ( s i g n ( ( pkX , pkB , k ) , skB ) ,pkX ) ) ;

58 in ( c , x : b i t s t r i n g ) ;

59 l e t z = s dec ( x , k ) in

60 i f pkX = pkA then event t er mSer ver ( k ) .

62 process

63 new skA : skey ;

64 new skB : s sk ey ;

65 l e t pkA = pk ( skA ) in out ( c , pkA ) ;

66 l e t pkB = spk ( skB ) in out ( c , pkB ) ;

67 ( ( ! c l i e n t A (pkA , skA , pkB ) ) | ( ! se rve rB ( pkB , skB , pkA) ) )

3.4 Interactive mode

As indicated in Section 1.4, ProVerif comes with a program proverif interact which allows to simulate

the execution of a process run. There are two ways to launch this program. By typing the name of

the program. It then opens a ﬁle chooser dialog allowing to choose a .pv or .pcv ﬁle containing the

description of the protocol. (.pcv ﬁles are for CryptoVerif compatibility, see Section 6.8. To choose a

.pcv ﬁle, you ﬁrst need to change the ﬁlter at the bottom right of the ﬁle chooser dialog.) The other

way is by typing the name of the program, followed by the path of the .pv or .pcv ﬁle. In this case, the

simulator starts directly. When the input ﬁle is correctly loaded, a window appears, as in Figure 3.6,

where the loaded ﬁle is the model of the handshake protocol, available in docs/ex

handshake.pv.

3.4.1 Interface description

The simulator is made of a main window which allows to make reduction steps on running processes.

This window contains several columns representing the current state of the run. The ﬁrst column, titled

“Public”, contains all public elements of the current state. For example, after loading the ﬁle containing

the handshake protocol, the channel c appears in the public column as expected, since c is declared

public in the input ﬁle (see Figure 3.6). The last columns show processes that are currently running

in parallel. To make a reduction step on a speciﬁc process, you can click on the head of the column

representing the process to reduce. To allow the attacker to create a nonce, there is a button “New

nonce”, or an option in the “Reduction” menu, or a keyboard shortcut Ctrl+C. If the types are not

ignored (by including set ignoreTypes = false in your input ﬁle, see Section 6.6.2), a dialog box opens

3.4. INTERACTIVE MODE 31

Figure 3.6 Handshake protocol - Initial simulator window

and asks the type of the nonce. When a nonce is created, it is added to the public elements of the

current state. To go one step backward, there is a button “Backward”, or an option in the “Reduction”

menu, or a keyboard shortcut Ctrl+B. The button “Forward”, the option “Forward” of the “Reduction”

menu, or the keyboard shortcut Ctrl+F allow the user to re-execute a step that has been undone by the

“Backward” button. The button “Add a term to public ” is explained in Section 3.4.5. The interface

also allows to display a drawing of the current trace in a new window by clicking on “Display trace”

in the “Show” menu, or by hitting Ctrl+D. Each time a new reduction step is made, the drawing is

refreshed. The trace can be saved by selecting “Save File” in the “Save” menu, or hitting Ctrl+S.

One of these formats: .png, .pdf, .jpg or .eps, must be used to save the ﬁle, and the name of

the ﬁle with its extension must be given. Note that a more detailed version of the trace is available if

set traceDisplay = long. has been added to the input ﬁle. The main window and the menu also contains

two other options: “Next auto-step” and “All auto-steps”. We explain this functionality in the next

section.

3.4.2 Manual and auto-reduction

There are two kinds of processes. The ones on which the ﬁrst reduction can be done without the

intervention of the user (called auto-reducible processes), and the ones that require the intervention of

the user (called manually-reducible processes).

The processes 0, P | Q, new n : t; P , let x = M in P else Q, if M then P else Q, and

event e(M

, . . . , M

); P are all auto-reducible.

The process !P is manually reducible.

The process out(M, N ); P is auto-reducible if the channel M is public, or the evaluation of the

message N or of the channel M fails. Otherwise, it is a manually-reducible process.

The process in(M, x : T ); P is auto-reducible if the evaluation of the channel M fails. Otherwise,

it is a manually-reducible process.

When auto-reducible processes are running and you press the button “All auto-steps” (or if you select this

option on the menu), it reduces all auto-reducible processes that are running. When you press the button

“Next auto-step”, it makes one step of reduction on the ﬁrst auto-reducible process. Manually-reducible

processes can be reduced only by clicking on the head of their column.

32 CHAPTER 3. USING PROVERIF

3.4.3 Execution of 0, P | Q, !P , new, let, if, and event

The reduction of 0 just removes the process. The reduction of P | Q separates the process P | Q into

two processes P and Q (a column is added to the main window). The reduction of !P adds a copy of

P in a new column at the left of !P . The reduction of new n : t; P creates a fresh nonce local to the

process P . The reduction of let x = M in P else Q evaluates M. If this evaluation succeeds, then the

process becomes P with the result of M substituted for x. Otherwise, the process becomes Q. The

reduction of if M then P else Q evaluates M . If M evaluates to true, then the process becomes P .

If the evaluation of M succeeds and M evaluates to a value other than true, then the process becomes

Q. If the evaluation of M fails, then the process is removed. The reduction of event e(M

, . . . , M

); P

evaluates M

, . . . , M

. If these evaluations succeed, the process becomes P . Otherwise, the process is

removed. The user can display a column titled “Events”, showing the list of executed events by selecting

the item “Show/hide events” in the “Show” menu or using the keyboard shortcut Ctrl+E.

3.4.4 Execution of inputs and outputs

They are several possible kinds of inputs and outputs, depending on whether the process is auto-reducible

or not, and on whether the channel is public or not. Let us ﬁrst consider the case of out(M, N ); P .

If the process is auto-reducible because the evaluation of the channel M or of the message N fails,

then the process is removed.

If the evaluations of the message N and the channel M succeed and the channel M is public, then

the output is made as explained in Section 3.1.4. The message is added to the public elements of

the current state. It is displayed as follows ˜M i = N, where ˜M i is a new binder: this binder can

then be used to designate the term N in the computations that the adversary makes in the rest of

the execution. Such computations are called recipes. They are terms built from the binders ˜M i,

the nonces created by the adversary, the names that are initially public, and application of public

functions to recipes. In the general case, the public elements of the current state are represented

in the form binder = recipe = message, where the recipe is the computation that the adversary

makes to obtain the corresponding message, and the binder can be used to designate that message

in future recipes. To lighten the display, the binder is omitted when it is equal to the recipe, and

the recipe is omitted when it is equal to the message itself.

If the evaluations of the message N and the channel M succeed but the channel M is not known

to be public (this case is displayed “Output (private)” in the head of the column), then there are

two possibilities.

– Prove that the channel is in fact public, and make a public communication. To do so, a recipe

using public elements of the current state must be given. If this recipe is evaluated as equal

to the channel, a public output on this channel is made.

– Make a private communication on this channel between two processes. If this choice has been

made, the list of all the input processes on the same channel appears in the main window.

The user chooses the process that will receive the output message. If there is no such process,

the reduction is not possible and an error message appears.

Let us now consider the case of in(M ,x : T ); P .

If the evaluation of the channel M fails, then the process is removed.

If the evaluation of the channel M succeeds and the channel is public, then a pop-up window

opens, and the user gives the message to send on the channel. The message is given in the form

of a recipe, which can contain recipes of public elements of the current state, and applications of

public functions. In case the recipe is wrongly typed, if types are ignored (the default), then a

warning message box appears, allowing the user to choose to continue or go back. If types are not

ignored (the input ﬁle contains set ignoreTypes = false), an error message box appears, and a new

message must be given.

3.4. INTERACTIVE MODE 33

If the evaluation of the channel M succeeds and the channel is not known to be public (this case

is displayed “Input (private)” in the head of the column), then the program works similarly to

the case of a private output. There are again two possibilities: prove that the channel is public

by giving a recipe and make an input from the adversary, or choose an output process to make a

private communication between these processes as explained above.

In addition to the public functions explicitly deﬁned in the input ﬁle, recipes can also contain projec-

tion functions. The syntax for projections associated to tuples diﬀers depending on whether types

are ignored or not. If types are ignored (the default), then the i-th projection of a tuple of ar-

ity m is written i−proj−m−tuple. Otherwise, when the input ﬁle contains set ignoreTypes = false,

i−proj−<type

>− . . . −<type

>−tuple is the i-th projection of a tuple of arity m, when <type

> is the

type of the n-th argument of the tuple. For instance, 2−proj−channel−bitstring−tuple is the second pro-

jection of a pair with arguments of type channel and bitstring , so 2−proj−channel−bitstring−tuple((c,

m)) = m, where c is a channel and m is a bitstring. The i-th projection of a previously deﬁned data

constructor f (see Section 4.1.2) is written i−proj−f .

3.4.5 Button “Add a term to public”

Please recall that the elements in public are of the form binder = recipe = message (see Section 3.4.4 for

more information on public elements). Clicking the button “Add a term to public” allows the user to

add a public term to the current state computed by attacker. The user gives the recipe that the attacker

uses to compute this term. It is then evaluated. If the evaluation fails, an error message appears. If the

evaluation succeeds, an entry ˜M i = recipe = t is added to the column “Public”, where t is the result of

the evaluation of the recipe and ˜M i is a fresh binder associated to it. ˜M i can then be used in future

recipes in order to represent the term t.

3.4.6 Execution of insert and get

You can ignore this section if you do not use tables, deﬁned in Section 4.1.5. The constructs insert and

get respectively insert an element in a table and read a table.

The process insert d(M

, . . . , M

); P is auto-reducible if it is the only process or if the evaluation

of one of the M

fails. To insert an element, just click on the head of the column representing the

insert process to reduce. If the evaluation succeeds, the element is inserted and appears in the column

“Tables”. Otherwise, the process is removed. The user can display a column titled “Tables”, containing

all elements of tables obtained by insert steps, by selecting the item “Show/hide tables” in the “Show”

menu or using the keyboard shortcut Ctrl+T.

The process get d(T

, . . . , T

) suchthat M in P else Q is never auto-reducible. To get an element

from a table, click on the head of the column to reduce. Three cases are possible, depending on the set

of terms in the table d that match the patterns T

, . . . T

and satisfy the condition M . First, if there is

no such term, then the else branch of the get is executed. Second, if there is only one such term, then

this term is selected, and the in branch is executed with the variables of T

, . . . T

instantiated to match

this term, as explained in Section 4.1.5. Or third, if there are several such terms, then a window showing

all the possible terms is opened. To make the reduction, double-click on the chosen term.

3.4.7 Handshake run in interactive mode

Let us see how to execute a trace similar to the one represented in Figure 3.5 starting from Figure 3.6.

First, a click on the “All auto-steps” button will lead to the situation represented in Figure 3.7:

the honest process ﬁrst creates two secret keys, then output a ﬁrst public key after a let, and then

a second one after another let on channel c. The attacker stores these public keys in fresh variables

˜M 2 and ˜M 3. A parallel reduction is then made after that.

The ﬁrst process ClientA can now be replicated, by clicking “Replication” at the top of its column.

Three processes are obtained. The ﬁrst process can make an output by clicking on “Next auto-

step”.

34 CHAPTER 3. USING PROVERIF

Figure 3.7 Handshake protocol - Simulator window 1

Figure 3.8 Handshake protocol - Simulator window 2

The process ServerB is then replicated by clicking on the column representing the third process. A

click on “New nonce” allows the attacker to create his secret key n, which is added to the public

elements of the current state. The message pk(n) can then be input on channel c by clicking on

the same column and giving pk(n) as recipe. The result is shown in Figure 3.8.

A new click on the third process creates a fresh key k 2. Another click sends the message

aenc(sign(spk(skB 2), k 2), skB 2, pk(n)), and the attacker stores this message in a fresh vari-

able ˜M 4.

The message aenc(adec(˜M 4, n),˜M 2) can then be input on channel c, by clicking on the ﬁrst

process and giving aenc(adec(˜M 4, n),˜M 2) as recipe.

A click on the “All auto-steps” makes all possible reductions on the ﬁrst process, leading to the

output of the message senc(s, k 2) stored by the attacker in a variable ˜M 5. It leads to the

window represented in Figure 3.9, and to a trace similar to the one represented in Figure 3.5.

Finally, by clicking the button “Add a term to public” and giving the recipe sdec(˜M 5,

2−proj−2−tuple(getmess(adec(˜M 4,n)))), the attacker computes this recipe and obtains the se-

cret s. The secret s is then added to the set of public terms.

Figure 3.9 Handshake protocol - Simulator window 3

3.4. INTERACTIVE MODE 35

3.4.8 Advanced features

If the process representing by the input ﬁle contains subterms of the form choice[L,R] or diﬀ [L,R] (see

Section 4.3.2), a pop-up window will ask the user to choose either the ﬁrst or the second component of

choice, or the biprocess (process with choice[L,R]). If the user choses the ﬁrst or second component,

all instances of choice inside the process will then be replaced accordingly. Otherwise, the tool runs the

processes using the semantics of biprocesses. If the input ﬁle is made to test the equivalence between two

processes P

and P

(see Section 4.3.2), a pop-up window will ask the user to choose to emulate either

or P

The processes let ... suchthat ... (see Section 6.3) and sync (see Section 4.1.7) are not supported

yet. Passive adversaries (the setting set attacker = passive., see Section 6.6.2) and key compromise

(the setting set keyCompromise = approx. or set keyCompromise = strict., see Section 6.6.2) are not

supported either. The simulator always simulates an active adversary without key compromise, even if

diﬀerent settings are present.

The command line options -lib [filename] (see Section 6.6.1), and -commandGraph (used to deﬁne

the command for the creation of the graph trace from the dot ﬁle generated by the simulator) can be

used.

36 CHAPTER 3. USING PROVERIF

Chapter 4

Language features

In the previous chapter, the basic features of the language were introduced; we will now provide a more

complete coverage of the language features. These features will be used in Chapter 5 to study the

Needham-Schroeder public key protocol as a case study. More advanced features of the language will be

discussed in Chapter 6 and the complete input grammar is presented in Appendix A for reference; the

features presented in this chapter should be suﬃcient for most users.

4.1 Primitives and modeling features

In Section 3.1.1, we introduced the basic components of the declarations of the language and how to

model processes; this section will develop our earlier presentation.

4.1.1 Constants

A constant may be deﬁned as a function of arity 0, for example “fun c() : t.” ProVerif also provides a

speciﬁc construct for constants:

const c : t .

where c is the name of the constant and t is its type. Several constants of the same type t can be declared

const c

, . . . , c

: t .

4.1.2 Data constructors and type conversion

Constructors fun f(t

, . . . , t

) : t. may be declared as items of data by appending [data], that is,

fun f(t

, . . . , t

) : t [ data ] .

A constructor declared as data is similar to a tuple: the attacker can construct and decompose data

constructors. In other words, declaring a data constructor f as above implicitly declares n destructors

that map f(x

, . . . , x

) to x

, where i ∈ {1, . . . , n}. One can inverse a data constructor by pattern-

matching: the pattern f (T

, . . . , T

) is added as pattern in the grammar of Figure 3.3. The type of

, . . . , T

is the type of the arguments of f, so when T

is a variable, its type can be omitted. For

example, with the declarations

type key .

type h ost .

fun keyh os t ( key , ho st ) : b i t s t r i n g [ data ] .

we can write

l et keyh ost ( k , h) = x in . . .

38 CHAPTER 4. LANGUAGE FEATURES

Constructors declared data cannot be declared private.

One application of data constructors is type conversion. As discussed in Section 3.1.1, the type

system occasionally makes it diﬃcult to apply functions to arguments due to type mismatches. This can

be overcome with type conversion. A type converter is simply a special type of data constructor deﬁned

as follows:

fun tc(t) : t

′

[ typeConverter ] .

where the type converter tc takes input of type t and returns a result of type t

′

. Observe that, since the

constructor is a data constructor, the attacker may recover term M from the term tc(M). Intuitively,

the keyword typeConverter means that the function is the identity function, and so has no eﬀect

except changing the type. By default, types are used for typechecking the protocol but during protocol

veriﬁcation, ProVerif ignores types. The typeConverter functions are thus removed. (This behavior

allows ProVerif to detect type ﬂaw attacks, in which the attacker mixes data of diﬀerent types. This

behavior can be changed by the setting set ignoreTypes = ... as discussed in Section 6.6.2.)

The reverse type conversion, from t

′

to t, should be performed by pattern-matching:

l et tc(x) = M in . . .

where M is of type t

′

and x is of type t. This construct is allowed since type converters are data

constructors. When one deﬁnes a type converter tc(t) : t

′

from type t to t

′

, all elements of type t can be

converted to type t

′

, but the only elements of type t

′

that can be converted to type t are the elements

of the form tc(M). Hence, for instance, it is reasonable to deﬁne a type converter from a type key

representing 128-bit keys to type bitstring , but not in the other direction, since all 128-bit keys are

bitstrings but only some bitstrings are 128-bit keys.

4.1.3 Natural numbers

Natural numbers are natively supported and have the built-in type nat. Internally, ProVerif models

natural numbers following the Peano axioms, that is, it considers a constant 0 of type nat and a data

constructor for successor. As such, all natural numbers are terms and can be used with other user-deﬁned

functions. A term is said to be a natural number if it is the constant 0 or the application of the successor

to a natural number. The grammar of terms (Figure 3.2) is extended in Figure 4.1 to consider the

built-in inﬁx functions manipulating natural numbers.

Figure 4.1 Natural number grammar

M, N ::= terms

...

i natural number (i ∈ N)

M + i addition (i ∈ N)

i + M addition (i ∈ N)

M − i subtraction (i ∈ N)

M > N greater

M < N smaller

M >= N greater or equal

M <= N smaller or equal

Finally, ProVerif has a built-in boolean function is nat checking whether a term is a natural number

of not, that is, is nat(M ) returns true if and only if M is equal modulo the equational theory to a natural

number.

Note that addition between two arbitrary terms is not allowed. The order relations >, <, >=, <= are

internally represented by boolean destructor functions that compare the value of two natural numbers.

As such, M > N returns true (resp. false ) if M and N are both natural numbers and M is strictly

greater than (resp. smaller or equal to) N. Note that M > N fails if M or N is not a natural number.

Similarly, the subtraction is internally represented by a destructor function and for instance, M − i fails

if M is a natural number strictly smaller than i. It corresponds to the fact that negative numbers are

not allowed in ProVerif.

4.1. PRIMITIVES AND MODELING FEATURES 39

Restrictions. Since natural numbers are represented with a constant 0 and a data constructor succes-

sor, the attacker can generate all natural numbers. Therefore, ProVerif does not allow the declaration of

new names with the type nat, i.e., new k:nat, since it would allow a process to generate a term declared

as a natural number but that does not satisfy the Peano axioms. Similarly, user deﬁned constructors

cannot have nat as their return type. However, this restriction does not apply to destructors. Finally,

all functions can have nat as argument type. For example, the following declarations and process are

allowed.

1 type key .

3 f re e c : c hanne l .

5 f re e s : b i t s t r i n g [ private ] .

7 fun i e n c ( nat , key ) : b i t s t r i n g .

8 fun i d e c ( b i t s t r i n g , key ) : nat

9 reduc f o r a l l x : nat , y : key ; i d e c ( i e n c (x+1,y ) , y ) = x .

11 query attacker ( s ) .

13 process

14 new k : key ; (

15 out ( c , i e n c (2 , k ) )

16 | in ( c , x : nat ) ; in ( c , y : b i t s t r i n g ) ; i f x + 3 > i d e c (y , k ) then out ( c , s )

17 )

The function idec is allowed to have nat as return type as it is declared as a destructor. In this

example, the query is false since the attacker can obtain s by inputting any natural number for x. Note

that the test if x + 3 > idec(y,k) then . . . is not equivalent to if x > idec(y,k) − 3 then . . .. Indeed,

in the latter, ProVerif ﬁrst evaluates the terms x and idec(y,k) − 3 before comparing their values. In

our example, idec(y,k) − 3 will always fail since the only case where the evaluation of idec(y,k) would

not fail is when y is equal to ienc (2,k). In such a case, idec(y,k) would be evaluated to 1 but then the

evaluation of 1 − 3 would fail. Hence, the query attacker(s) is true for the following process:

1 process

2 new k : key ; (

3 out ( c , i e n c (2 , k ) )

4 | in ( c , x : nat ) ; in ( c , y : b i t s t r i n g ) ; i f x > i d e c (y , k ) − 3 then out( c , s )

5 )

4.1.4 Enriched terms

For greater ﬂexibility, we redeﬁne our grammar for terms (Figures 3.2 and 4.1) to include restrictions,

conditionals, and term evaluations as presented in Figure 4.2. The behavior of enriched terms will now

be discussed. Names, variables, tuples, and constructor/destructor application are deﬁned as standard.

The term new a : t; M constructs a new name a of type t and then evaluates the enriched term M .

The term if M then N else N

′

is deﬁned as N if the condition M is equal to true and N

′

when M

does not fail but is not equal to true. If M fails, or the else branch is omitted and M is not equal to

true, then the term if M then N else N

′

fails (like when no rewrite rule matches in the evaluation of

a destructor). Similarly, let T = M in N else N

′

is deﬁned as N if the pattern T is matched by M,

and the variables of T are bound by this pattern-matching. As before, if the pattern is not matched,

then the enriched term is deﬁned as N

′

; and when the else branch is omitted, the term fails. The term

event e(M

, . . . , M

); M executes the event e(M

, . . . , M

) and then evaluates the enriched term M .

The use of enriched terms will be demonstrated in the Needham-Schroeder case study in Section 5.3.

ProVerif’s internal encoding for enriched terms. Enriched terms are a convenient tool for the end

user; internally, ProVerif handles such constructs by encoding them: the conditional if M then N else N

′

40 CHAPTER 4. LANGUAGE FEATURES

Figure 4.2 Enriched terms grammar

M, N ::= enriched terms

a, b, c, k, m, n, s names

x, y, z variables

, . . . , M

) tuple

h(M

, . . . , M

) constructor/destructor application

i natural number (i ∈ N)

M + i addition (i ∈ N)

i + M addition (i ∈ N)

M − i subtraction (i ∈ N)

M > N greater

M < N smaller

M >= N greater or equal

M <= N smaller or equal

M = N term equality

M <> N term disequality

M && M conjunction

M || M disjunction

not(M) negation

new a : t; M name restriction

if M then N else N

′

conditional

let T = M in N else N

′

term evaluation

event e(M

, . . . , M

); M event

is encoded as a special destructor also displayed as if M then N else N

′

; the restriction new a : t; M

is expanded into a process; the term evaluation let T = M in N else N

′

is encoded as a mix of processes

and special destructors. As an example, let us consider the following process.

1 free c : chan nel .

3 free A: b i t s t r i n g .

4 free B: b i t s t r i n g .

6 process

7 in ( c , ( x : b i t s t r i n g , y : b i t s t r i n g ) ) ;

8 i f x = A | | x = B then

9 l e t z = ( i f y = A then new n : b i t s t r i n g ; ( x , n ) e ls e ( x , y ) ) in

10 out ( c , z )

The process takes as input a pair of bitstrings x,y and checks that either x=A or x=B. The term

evaluation let z = (if y = A then new n:bitstring; (x,n) else (x,y)) in is deﬁned using the enriched

term if y = A then new n:bitstring; (x,n) else (x,y) which evaluates to the tuple (x,n) where n is a

new name of type bitstring if y=A; or (x,y) otherwise. (Note that brackets have only been added for

readability.) Internally, ProVerif encodes the above main process as:

1 in ( c , ( x : b i t s t r i n g , y : b i t s t r i n g ) ) ;

2 i f ( ( x = A) | | ( x = B) ) then

3 new n : b i t s t r i n g ;

4 le t z : b i t s t r i n g = ( i f ( y = A) then (x , n ) el se (x , y ) ) in

5 out ( c , z )

This encoding sometimes has visible consequences on the behavior of ProVerif. Note that this process

was obtained by beautifying the output produced by ProVerif (see Section 3.3 for details on ProVerif

output).

4.1. PRIMITIVES AND MODELING FEATURES 41

4.1.5 Tables and key distribution

ProVerif provides tables (or databases) for persistent storage. Tables must be speciﬁed in the declarations

in the following form:

table d(t

, . . . , t

) .

where d is the name of the table which takes records of type t

, . . . , t

. Processes may populate and

access tables, but deletion is forbidden. Note that tables are not accessible by the attacker. Accordingly,

the grammar for processes is extended:

insert d(M

, . . . , M

); P insert record

get d(T

, . . . , T

) in P else Q read record

get d(T

, . . . , T

) suchthat M in P else Q read record

The process insert d(M

, . . . , M

); P inserts the record M

, . . . , M

into the table d and then executes

P ; when P is the 0 process, it may be omitted. The process get d(T

, . . . , T

) in P else Q attempts

to retrieve a record in accordance with patterns T

, . . . , T

. When several records can be matched,

one possibility is chosen (but ProVerif considers all possibilities when reasoning) and the process P

is evaluated with the free variables of T

, . . . , T

bound inside P . When no such record is found, the

process Q is executed. The else branch can be omitted; in this case, when no suitable record is found, the

process blocks. The get process also has a richer form get d(T

, . . . , T

) suchthat M in P else Q; in

this case, the retrieved record is required to satisfy the condition M in addition to matching the patterns

, . . . , T

. The grammar for enriched terms is extended similarly:

insert d(M

, . . . , M

); M insert record

get d(T

, . . . , T

) in N else N

′

read record

get d(T

, . . . , T

) suchthat M in N else N

′

read record

When the else branch of get is omitted in an enriched term, it equivalent to else fail .

The use of tables for key management will be demonstrated in the Needham-Schroeder public key

protocol case study (Chapter 5).

As a side remark, tables can be encoded using private channels. We provide a speciﬁc construct since

it is frequently used, it can be analyzed precisely by ProVerif (more precisely than some other uses of

private channels), and it is probably easier to understand for users that are not used to the pi calculus.

4.1.6 Phases

Many protocols can be broken into phases, and their security properties can be formulated in terms of

these phases. Typically, for instance, if a protocol discloses a session key after the conclusion of a session,

then the secrecy of the data exchanged during that session may be compromised but not its authenticity.

To enable modeling of protocols with several phases the syntax for processes is supplemented with a

phase preﬁx phase t; P, where t is a positive integer. Observe that all processes are under phase 0 by

default and hence the instruction phase 0 is not allowed. Intuitively, t represents a global clock, and the

process phase t; P is active only during phase t. A process with phases is executed as follows. First, all

instructions under phase 0 are executed, that is, all instructions not under phase i ≥ 1. Then, during

a stage transition from phase 0 to phase 1, all processes which have not yet reached phase i ≥ 1 are

discarded and the process may then execute instructions under phase 1, but not under phase i ≥ 2. More

generally, when changing from phase n to phase n + 1, all processes which have not reached a phase

i ≥ n + 1 are discarded and instructions under phase n + 1, but not for phase i ≥ n + 2, are executed.

It follows from our description that it is not necessary for all instructions of a particular phase to be

executed prior to phase transition. Moreover, processes may communicate only if they are under the

same phase.

Phases can be used, for example, to prove forward secrecy properties: the goal is to show that, even

if some participants get corrupted (so their secret keys are leaked to the attacker), the secrets exchanged

in sessions that took place before the corruption are preserved. Corruption can be modeled in ProVerif

by outputting the secret keys of the corrupted participants in phase 1; the secrets of the sessions run in

phase 0 should be preserved. This is done for the ﬁxed handshake protocol of the previous chapter in

the following example (ﬁle docs/ex handshake forward secrecy skB.pv):

42 CHAPTER 4. LANGUAGE FEATURES

1 free c : chan nel .

3 free s : b i t s t r i n g [ private ] .

4 query attacker ( s ) .

6 le t c l i e nt A (pkA : pkey , skA : skey , pkB : spkey ) =

7 out ( c , pkA ) ;

8 in ( c , x : b i t s t r i n g ) ;

9 l e t y = adec ( x , skA ) in

10 l e t (=pkA,=pkB , k : key ) = ch ec k si g n ( y , pkB) in

11 out ( c , se nc ( s , k ) ) .

13 le t se rve rB (pkB : spkey , skB : sskey , pkA : pkey ) =

14 in ( c , pkX : pkey ) ;

15 new k : key ;

16 out ( c , aenc ( s i g n ( ( pkX , pkB , k ) , skB ) ,pkX ) ) ;

17 in ( c , x : b i t s t r i n g ) ;

18 l e t z = s dec ( x , k ) .

20 process

21 new skA : skey ;

22 new skB : s sk ey ;

23 l e t pkA = pk ( skA ) in out ( c , pkA ) ;

24 l e t pkB = spk ( skB ) in out ( c , pkB ) ;

25 ( ( ! c l i e n t A (pkA , skA , pkB ) ) | ( ! se rve rB ( pkB , skB , pkA) ) |

26 phase 1 ; out( c , skB ) )

The secret key skB of the server B is leaked in phase 1 (last line). The secrecy of s is still preserved in

this example: the attacker can impersonate B in phase 1, but cannot decrypt messages of sessions run

in phase 0. (Note that one could hope for a stronger model: this model does not consider sessions that

are running precisely when the key is leaked. While the attacker can simulate B in phase 1, the model

above does not run A in phase 1; one could easily add a model of A in phase 1 if desired.) In contrast, if

the secret key of the client A is leaked, then the secrecy of s is not preserved: the attacker can decrypt

the messages of previous sessions by using skA, and thus obtain s.

4.1.7 Synchronization

The synchronization command sync t [tag] introduces a global synchronization [BS16], which has some

similarity with phases.

The synchronization level t must be a positive integer. Synchronizations sync t cannot occur under

replications. Synchronizations with the same level t and the same tag tag are considered as the “same

synchronization”, that is, synchronizations with the same level t and the same tag tag are allowed only

in diﬀerent branches of if , let, let . . . suchthat, get. Since only one of these branches will be executed

at runtime, at most one synchronization with a given level t and tag tag can be reached.

The global synchronizations must be executed in increasing order of level t. The process waits until

sync t commands with all existing tags at level t are reached before executing the synchronization t.

More precisely, assuming t is the smallest synchronization level that occurs in the initial process and has

not been executed yet, if the initial process contains commands sync t with tags tag

, . . . , tag

, then

the process waits until it reaches exactly commands sync t with tags tag

, . . . , tag

, then it executes

the synchronization t and continues after the sync t commands. So, in contrast to phases, processes are

never discarded by synchronization, but the process may block in case some synchronizations cannot be

reached or are discarded for instance by a test that fails above them.

The tags of synchronizations are determined as follows:

The user can specify the tag of the synchronization by writing sync t [tag]. When the user omits

the tag and just writes sync t, ProVerif gives it a fresh tag.

4.2. FURTHER CRYPTOGRAPHIC OPERATORS 43

When a synchronization occurs inside a process macro and the process macro is expanded, a tag

preﬁx is added to all synchronizations inside the process macro. The preﬁx p is speciﬁed by writing

[sync: tag preﬁx p] at the expansion of the process macro. For instance:

l et P( x : b i t s t r i n g )=

sync 1 [T ] ;

out ( c , x ) .

process

P( a ) [ sync : tag p r e f i x T1 ] | P( b) [ sync : tag p r e f i x T2 ]

yields the process

sync 1 [ T1 T ] ; out ( c , a ) | sync 1 [ T2 T ] ; out( c , b )

(The preﬁx is separated from the tag by an underscore.) When the indication [sync: tag preﬁx p]

is omitted, ProVerif chooses a fresh preﬁx. One can tell ProVerif not to add a preﬁx, that

is, leave the tags of synchronizations unchanged, by writing [sync: no tag preﬁx ] instead of

[sync: tag preﬁx p].

Therefore, when all tags of synchronizations and tag preﬁxes of process macros are omitted, all synchro-

nizations in the resulting process have distinct tags. This is suitable when these synchronizations occur

in parallel processes.

When synchronizations occur in branches of tests, one typically wants them to have the same tag

(because otherwise the synchronization would block). So one would write for instance

i f . . . then (. . . sync 1 [T ] ; . . .) el se (. . . sync 1 [T ] ; . . .)

i f . . . then (. . . P(. . .) [ sync : t ag p r e f i x T ] )

el se ( . . . P( . . . ) [ sync : t ag p r e f i x T ] )

Synchronizations cannot be used with phases. Synchronizations are implemented in ProVerif by

translating them into outputs and inputs; the translated process is displayed by ProVerif. Further

discussion of synchronization with an example can be found in Section 4.3.2, page 62.

4.2 Further cryptographic operators

In Section 3.1.1, we introduced how to model the relationships between cryptographic operations and

in Section 3.1.2 we considered the formalization of basic cryptographic primitives needed to model the

handshake protocol. This section will consider more advanced formalisms and provide a small library of

cryptographic primitives.

4.2.1 Extended destructors

We introduce an extended way to deﬁne the behaviour of destructors [CB13].

fun g(t

, . . . , t

) : t

reduc f o r a l l x

1,1

: t

1,1

, . . . , x

1,n

: t

1,n

; g(M

1,1

, . . . , M

1,k

) = M

1,0

otherwise . . .

otherwise f o r a l l x

m,1

: t

m,1

, . . . , x

m,n

: t

m,n

; g(M

m,1

, . . . , M

m,k

) = M

m,0

This declaration should be seen as a sequence of rewrite rules rather than as a set of rewrite rules.

Thus, when the term g(N

, . . . , N

) is encountered, ProVerif will try to apply the ﬁrst rewrite rule

of the sequence, forall x

1,1

: t

1,1

, . . . , x

1,n

: t

1,n

; g(M

1,1

, . . . , M

1,k

) = M

1,0

. If this rewrite rule is

applicable, then the term g(N

, . . . , N

) is reduced according to that rewrite rule. Otherwise, ProVerif

tries the second rewrite rule of the sequence and so on. If no rule can be applied, the destructor fails.

This deﬁnition of destructors allows one to deﬁne new destructors that could not be deﬁned with the

deﬁnition of Section 3.1.1.

44 CHAPTER 4. LANGUAGE FEATURES

1 fun eq ( b i t s t r i n g , b i t s t r i n g ) : bo ol

2 reduc f o r a l l x : b i t s t r i n g ; eq (x , x ) = tr ue

3 otherwise f o r a l l x : b i t s t r i n g , y : b i t s t r i n g ; eq ( x , y ) = f a l s e .

With this deﬁnition, eq(M, N ) can be reduced to false only if M and N are diﬀerent modulo the equational

theory.

As previously mentioned, when no rule can be applied, the destructor fails. However, this formalism

does not allow a destructor to succeed when one of its arguments fails. To lift this restriction, we allow

to represent the case of failure by the special value fail.

8 fun t e s t ( bool , b i t s t r i n g , b i t s t r i n g ) : b i t s t r i n g

9 reduc

10 f o r a l l x : b i t s t r i n g , y : b i t s t r i n g ; t e s t ( true , x , y ) = x

11 otherwise f o r a l l c : bool , x : b i t s t r i n g , y : b i t s t r i n g ; t e s t ( c , x , y ) = y

12 otherwise f o r a l l x : b i t s t r i n g , y : b i t s t r i n g ; t e s t ( f a i l , x , y ) = y .

In the previous example, the function test returns the third argument even when the ﬁrst argument fails.

A variable x of type t can be declared as a possible failure by the syntax: x:t or fail. It indicates that

x can be any message or even the special value fail . Relying on this new declaration of variables, the

destructor test could have been deﬁned as follows:

14 fun t e s t ( bool , b i t s t r i n g , b i t s t r i n g ) : b i t s t r i n g

15 reduc

16 f o r a l l x : b i t s t r i n g , y : b i t s t r i n g ; t e s t ( true , x , y ) = x

17 otherwise f o r a l l c : boo l or f a i l , x : b i t s t r i n g , y : b i t s t r i n g ;

18 t e s t ( c , x , y ) = y .

A variant of this test destructor is the following one:

20 fun t e s t ( bool , b i t s t r i n g , b i t s t r i n g ) : b i t s t r i n g

21 reduc

22 f o r a l l x : b i t s t r i n g or f a i l , y : b i t s t r i n g or f a i l ; t e s t ( true , x , y ) = x

23 otherwise f o r a l l c : bool , x : b i t s t r i n g or f a i l , y : b i t s t r i n g or f a i l ;

24 t e s t ( c , x , y ) = y .

This destructor returns its second argument when the ﬁrst argument c is true, its third argument when

the ﬁrst argument c does not fail but is not true, and fails otherwise. With this deﬁnition, when the

ﬁrst argument is true, test returns the second argument even when the third argument fails (which

models that the third argument does not need to be evaluated in this case). Symmetrically, when the

ﬁrst argument does not fail but is not true, test returns the third argument even when the second

argument fails. In contrast, the previous destructor test fails when its second or third arguments fail.

It is also possible to transform the special failure value fail into a non-failure value c0 by a destructor:

27 const c0 : b i t s t r i n g .

28 fun c a t c h f a i l ( b i t s t r i n g ) : b i t s t r i n g

29 reduc

30 f o r a l l x : b i t s t r i n g ; c a t c h f a i l ( x ) = x

31 otherwise c a t c h f a i l ( f a i l ) = c0 .

Such a destructor is used internally by ProVerif.

Let bindings. Similarly to the simple way of deﬁning destructors (see Section 3.1.1), it is possible to

use let bindings within the declaration of each rewrite rule.

4.2.2 Equations

Certain cryptographic primitives, such as the Diﬃe-Hellman key agreement, cannot be encoded as de-

structors, because they require algebraic relations between terms. Accordingly, ProVerif provides an

alternative model for cryptographic primitives, namely equations. The relationships between construc-

tors are captured using equations of the form

4.2. FURTHER CRYPTOGRAPHIC OPERATORS 45

equation f o r a l l x

: t

, . . . , x

: t

; M = N .

where M, N are terms built from the application of (deﬁned) constructor symbols to the variables

, . . . , x

of type t

, . . . , t

. Note that when no variables are required (that is, when terms M, N are

constants) forall x

: t

, . . . , x

: t

; may be omitted.

More generally, one can declare several equations at once, as follows:

equation f o r a l l x

1,1

: t

1,1

, . . . , x

1,n

: t

1,n

; M

= N

;

. . .

f o r a l l x

m,1

: t

m,1

, . . . , x

m,n

: t

m,n

; M

= N

option .

where option can either be empty, [convergent], or [ linear ]. When an option [convergent] or [ linear ]

is present, it means that the group of equations is convergent (the equations, oriented from left to right,

form a convergent rewrite system) or linear (each variable occurs at most once in the left-hand and

once in the right-hand side of each equation), respectively. In this case, this group of equations must

use function symbols that appear in no other equation. ProVerif checks that the convergent or linear

option is correct. However, in case ProVerif cannot prove termination of the rewrite system associated

to equations declared [convergent], it just displays a warning, and continues assuming that the rewrite

system terminates. Indeed, ProVerif’s algorithm for proving termination is obviously not complete,

so the rewrite system may terminate and ProVerif not be able to prove it. The main interest of the

[convergent] option is then to bypass the veriﬁcation of termination of the rewrite system.

Let bindings. Similarly to destructors, it is possible to use let bindings within the declaration of each

equation.

Performance. It should be noted that destructors are more eﬃcient than equations. The use of

destructors is therefore advocated where possible.

Limitations. ProVerif does not support all equations. It must be possible to split the set of equations

into two kinds of equations that do not share constructor symbols: convergent equations and linear

equations. Convergent equations are equations that, when oriented from left to right, form a convergent

(that is, terminating and conﬂuent) rewriting system. Linear equations are equations such that each

variable occurs at most once in the left-hand side and at most once in the right-hand side. When

ProVerif cannot split the equations into convergent equations and linear equations, an error message is

displayed.

Moreover, even when the equations can be split as above, it may happen that the pre-treatment of

equations by ProVerif does not terminate. Essentially, ProVerif computes rewrite rules that encode the

equations and it requires that, when M

, . . . , M

are in normal form, the normal form of f(M

, . . . , M

)

can be computed by a single rewrite step. For some equations, this constraint implies generating an

inﬁnite number of rewrite rules, so in this case ProVerif does not terminate. For instance, associativity

cannot be handled by ProVerif for this reason, which prevents the modeling of primitives such as XOR

(exclusive or) or groups. Another example that leads to non-termination for the same reason is the

equation f(g(x)) = g(f(x)). In the obtained rewrite rules, all variables that occur in the right-hand side

must also occur in the left-hand side.

It is also worth noting that, because ProVerif orients equations from left to right when it builds the

rewrite system, the orientation in which the equations are written may inﬂuence the success or failure of

ProVerif (even if the semantics of the equation obviously does not depend on the orientation). Informally,

the equations should be written with the most complex term on the left and the simplest one on the

right.

Even with these limitations, many practical primitives can be modeled by equations in ProVerif, as

illustrated below.

Diﬃe-Hellman key agreement. The Diﬃe-Hellman key agreement relies on modular exponentiation

in a cyclic group G of prime order q; let g be a generator of G. A principal A chooses a random exponent

a in Z

∗

, and sends g

to B. Similarly, B chooses a random exponent b, and sends g

to A. Then A

46 CHAPTER 4. LANGUAGE FEATURES

computes (g

)

and B computes (g

)

. These two keys are equal, since (g

)

= (g

)

, and cannot be

obtained by a passive attacker who has g

and g

but neither a nor b.

We model the Diﬃe-Hellman key agreement as follows:

1 type G.

2 type exponent .

4 const g : G [ data ] .

5 fun exp (G, exponent ) : G.

7 equation f o r a l l x : exponent , y : exponent ; exp ( exp ( g , x ) , y ) = exp ( exp ( g , y ) , x ) .

The elements of G have type G, the exponents have type exponent, g is the generator g, and exp models

modular exponentiation exp(x,y) = x

. The equation means that (g

)

= (g

)

This model of Diﬃe-Hellman key agreement is limited in that it just takes into account the equation

needed for the protocol to work, while there exist other equations, coming from the multiplicative group

∗

. A more complete model is out of scope of the current treatment of equations in ProVerif, because it

requires an associative function symbol, but extensions have been proposed to handle it [KT09].

Symmetric encryption. We model a symmetric encryption scheme for which one cannot distinguish

whether decryption succeeds or not. We consider the binary constructors senc and sdec, the arguments

of which are of types bitstring and key.

1 type key .

3 fun s enc ( b i t s t r i n g , key ) : b i t s t r i n g .

4 fun s dec ( b i t s t r i n g , key ) : b i t s t r i n g .

To model the properties of decryption, we introduce the equations:

5 equation f o r a l l m: b i t s t r i n g , k : key ; s dec ( s enc (m, k ) , k ) = m.

6 equation f o r a l l m: b i t s t r i n g , k : key ; s enc ( s dec (m, k ) , k ) = m.

where k represents the symmetric key and m represents the message. The ﬁrst equation is standard: it

expresses that, by decrypting the ciphertext with the correct key, one gets the cleartext. The second

equation might seem more surprising. It implies that encryption and decryption are two inverse bijections;

it is satisﬁed by block ciphers, for instance. One can also note that this equation is necessary to make

sure that one cannot distinguish whether decryption succeeds or not: without this equation, sdec(M,k)

succeeds if and only if senc(sdec(M,k),k) = M.

Trapdoor commitments. As a more involved example, let us consider trapdoor commitments [DDKS17].

Trapdoor commitments are commitments that can be opened to a diﬀerent value than the one initially

committed, using a trapdoor. We represent a trapdoor commitment of message m with randomness r

and trapdoor td by tdcommit(m, r, td). The normal opening of the commitment returns the message m,

so we have the equation

open(tdcommit(m, r, td), r) = m

To change the message, we use the equation:

tdcommit(m

, f(m

, r, td, m

), td) = tdcommit(m

, r, td)

These equations, oriented from left to right, are not convergent. We need to complete them to obtain a

convergent system, with the following equations:

open(tdcommit(m

, r, td), f (m

, r, td, m

)) = m

f(m

, f(m, r, td, m

), td, m

) = f(m, r, td, m

)

These equations are convergent, but ProVerif is unable to show termination, so it fails to handle the

equations if they are given separately. We can bypass the termination check by giving the equations

together and indicating that they are convergent, as follows:

4.2. FURTHER CRYPTOGRAPHIC OPERATORS 47

type t rap doo r .

type rand .

fun tdcommit ( b i t s t r i n g , rand , tr apd oor ) : b i t s t r i n g .

fun open ( b i t s t r i n g , rand ) : b i t s t r i n g .

fun f ( b i t s t r i n g , rand , trap door , b i t s t r i n g ) : rand .

equation f o r a l l m: b i t s t r i n g , r : rand , td : tr apd oor ;

open ( tdcommit (m, r , td ) , r ) = m;

f o r a l l m1: b i t s t r i n g , m2: b i t s t r i n g , r : rand , td : tr apd oo r ;

tdcommit (m2, f (m1, r , td , m2) , td ) = tdcommit (m1, r , td ) ;

f o r a l l m1: b i t s t r i n g , m2: b i t s t r i n g , r : rand , td : tr apd oo r ;

open ( tdcommit (m1, r , td ) , f (m1, r , td ,m2) ) = m2 ;

f o r a l l m: b i t s t r i n g , m1 : b i t s t r i n g , m2: b i t s t r i n g , r : rand , td : tr apd oor ;

f (m1, f (m, r , td , m1) , td , m2) = f (m, r , td , m2) [ c onv erg en t ] .

ProVerif still displays a warning because it cannot prove that the equations terminate:

Warning : the f o l l o w i n g e qu a t io n s

open ( tdcommit (m, r , td ) , r ) = m

tdcommit (m2, f (m1, r 7 , td 8 , m2) , td 8 ) = tdcommit (m1, r 7 , t d 8 )

open ( tdcommit ( m1 9 , r 11 , td 1 2 ) , f (m1 9 , r 11 , td 12 , m2 10 ) ) = m2 10

f (m1 14 , f (m 13 , r 16 , td 17 , m1 14 ) , td 17 , m2 15 ) = f ( m 13 , r 16 , td 17 , m2 15 )

ar e de c l a r e d con ver gen t . I cou ld not prove t e rm i na t io n .

I assume t ha t they r e a l l y t er mi na te .

Expect problems ( such as Pro Ve ri f go ing i n t o a l oo p ) i f they do not !

but it accepts the equations.

4.2.3 Function macros

Sometimes, terms that consist of more than just a constructor or destructor application are repeated

many times. ProVerif provides a macro mechanism in order to deﬁne a function symbol that represents

that term and avoid the repetition. Function macros are deﬁned by the following declaration:

letfun f(x

: t

[ or f a i l ] , . . . , x

: t

[ or f a i l ] ) = M .

where the macro f takes arguments x

, . . . , x

of types t

, . . . , t

and evaluates to the enriched term M

(see Figure 4.2). The type of the function macro f is inferred from the type of M . The optional or fail

after the type of each argument allows the user to control the behavior of the function macro in case

some of its arguments fail:

If or fail is absent and the argument fails, the function macro fails as well. For instance, with the

deﬁnitions

fun h ( ) : t

reduc h ( ) = f a i l .

letfun f (x : t ) =

l et y = x in c0 e lse c1 .

h() is fail and f(h()) returns fail and f never returns c1.

If or fail is present and the argument fails, the failure value is passed to the function macro, which

may for instance catch it and return some non-failure result. For instance, with the same deﬁnition

of h as above and the following deﬁnition of f

letfun f (x : t or f a i l ) =

l et y = x in c0 e lse c1 .

f(h()) returns c1.

48 CHAPTER 4. LANGUAGE FEATURES

Function macros can be used as constructors/destructors h in terms (see Figure 4.2). The applicability

of function macros will be demonstrated by the following example.

Probabilistic asymmetric encryption. Recall that asymmetric cryptography makes use of the

unary constructor pk, which takes an argument of type skey (private key) and returns a pkey (public

key). Since the constructors of ProVerif always represent deterministic functions, we model probabilistic

encryption by considering a constructor that takes the random coins used inside the encryption algorithm

as an additional argument, so probabilistic asymmetric encryption is modeled by a ternary constructor

internal aenc, which takes as arguments a message of type bitstring, a public key of type pkey, and ran-

dom coins of type coins. When encryption is used properly, the random coins must be freshly chosen at

each encryption, so that the encryption of x under y is modeled by new r: coins; internal aenc (x,y,r).

In order to avoid writing this code at each encryption, we can deﬁne a function macro aenc, which

expands to this code, as shown below. Decryption is deﬁned in the usual way.

type skey .

type pkey .

type c o i n s .

fun pk ( skey ) : pkey .

fun i n t e r n a l a e n c ( b i t s t r i n g , pkey , c o i n s ) : b i t s t r i n g .

reduc f o r a l l m: b i t s t r i n g , k : skey , r : c o i n s ;

adec ( i n t e r n a l a e n c (m, pk ( k ) , r ) , k ) = m.

letfun aenc ( x : b i t s t r i n g , y : pkey ) = new r : c o i n s ; i n t e r n a l a e n c (x , y , r ) .

Observe that the use of probabilistic cryptography increases the complexity of the model due to the

additional names introduced. This may slow down the analysis process.

4.2.4 Process macros with fail

Much like function macros above, process macros may also be declared with arguments of type t or fail:

l et p(x

: t

[ or f a i l ] , . . . , x

: t

[ or f a i l ] ) = P .

The optional or fail after the type of each argument allows the user to control the behavior of the

process in case some of its arguments fail:

If or fail is absent and the argument fails, the process blocks. For instance, with the deﬁnitions

fun h ( ) : t

reduc h ( ) = f a i l .

l et p ( x : t ) =

l et y = x in out ( c , c0 ) e ls e out ( c , c1 ) .

p(h()) does nothing and p never outputs c1.

If or fail is present and the argument fails, the failure value is passed to the process, which may

for instance catch it and continue to run. For instance, with the same deﬁnition of h as above and

the following deﬁnition of p

l et p ( x : t or f a i l ) =

l et y = x in out ( c , c0 ) e ls e out ( c , c1 ) .

p(h()) outputs c1 on channel c.

4.2. FURTHER CRYPTOGRAPHIC OPERATORS 49

4.2.5 Suitable formalizations of cryptographic primitives

In this section, we present various formalizations of basic cryptographic primitives, and relate them to

the assumptions on these primitives. We would like to stress that we make no computational soundness

claims: ProVerif relies on the symbolic, Dolev-Yao model of cryptography; its results do not apply to the

computational model, at least not directly. If you want to obtain proofs of protocols in the computational

model, you should use other tools, for instance CryptoVerif (http://cryptoverif.inria.fr). Still,

even in the symbolic model, some formalizations correspond better than others to certain assumptions

on primitives. The goal of this section is to help you ﬁnd the best formalization for your primitives.

Hash functions. A hash function is represented as a unary constructor h with no associated destructor

or equations. The constructor takes as input, and returns, a bitstring. Accordingly, we deﬁne:

fun h( b i t s t r i n g ) : b i t s t r i n g .

The absence of any associated destructor or equational theory captures pre-image resistance, second pre-

image resistance and collision resistance properties of cryptographic hash functions. In fact, far stronger

properties are ensured: this model of hash functions is close to the random oracle model.

Symmetric encryption. The most basic formalization of symmetric encryption is the one based on

decryption as a destructor, given in Section 3.1.2. However, formalizations that are closer to practical

cryptographic schemes are as follows:

1. For block ciphers, which are deterministic, bijective encryption schemes, a better formalization is

the one based on equations and given in Section 4.2.2.

2. Other symmetric encryption schemes are probabilistic. This can be formalized in a way similar to

what was presented for probabilistic public-key encryption in Section 4.2.3.

type key .

type c o i n s .

fun i n t e r n a l s e n c ( b i t s t r i n g , key , c o i n s ) : b i t s t r i n g .

reduc f o r a l l m: b i t s t r i n g , k : key , r : c o i n s ;

sde c ( i n t e r n a l s e n c (m, k , r ) , k ) = m.

letfun s enc ( x : b i t s t r i n g , y : key ) = new r : c o i n s ; i n t e r n a l s e n c ( x , y , r ) .

As shown in [CHW06], for protocols that do not test equality of ciphertexts, for secrecy and authen-

tication, one can use the simpler, deterministic model of Section 3.1.2. However, for observational

equivalence properties, or for protocols that test equality of ciphertexts, using the probabilistic

model does make a diﬀerence.

Note that these encryption schemes generally leak the length of the cleartext. (The length of

the ciphertext depends on the length of the cleartext.) This is not taken into account in this

formalization, and generally diﬃcult to take into account in formal protocol provers, because it

requires arithmetic manipulations. For some protocols, one can argue that this is not a problem,

for example when the length of the messages is ﬁxed in the protocol, so it is a priori known to the

attacker. Block ciphers are not concerned by this comment since they encrypt data of ﬁxed length.

Also note that, in this formalization, encryption is authenticated. In this respect, this formal-

ization is close to IND-CPA and INT-CTXT symmetric encryption. So it does not make sense

to add a MAC (message authentication code) to such an encryption, as one often does to obtain

authenticated encryption from unauthenticated encryption: the MAC is already included in the

encryption here. If desired, it is sometimes possible to model malleability properties of some en-

cryption schemes, by adding the appropriate equations. However, it is diﬃcult to model general

unauthenticated encryption (IND-CPA encryption) in formal protocol provers.

In this formalization, encryption hides the encryption key. If one wants to model an encryption

scheme that does not conceal the key, one can add the following destructor [ABCL09]:

50 CHAPTER 4. LANGUAGE FEATURES

reduc f o r a l l m: b i t s t r i n g , k : key , r : co in s , m : b i t s t r i n g , r : c o i n s ;

samekey ( i n t e r n a l s e n c (m, k , r ) , i n t e r n a l s e n c (m , k , r ) ) = t r ue .

This destructor allows the attacker to test whether two ciphertexts have been built with the same

key. The presence of such a destructor makes no diﬀerence for reachability properties (secrecy,

correspondences) since it does not enable the attacker to construct terms that it could not construct

otherwise. However, it does make a diﬀerence for observational equivalence properties. (Note that

it would obviously be a serious mistake to give out the encryption key to the attacker, in order to

model a scheme that does not conceal the key.)

Asymmetric encryption. A basic, deterministic model of asymmetric encryption has been given in

Section 3.1.2. However, cryptographically secure asymmetric encryption schemes must be probabilistic.

So a better model for asymmetric encryption is the probabilistic one given in Section 4.2.3. As shown

in [CHW06], for protocols that do not test equality of ciphertexts, for secrecy and authentication, one can

use the simpler, deterministic model of Section 3.1.2. However, for observational equivalence properties,

or for protocols that test equality of ciphertexts, using the probabilistic model does make a diﬀerence.

It is also possible to model that the encryption leaks the key. Since the encryption key is public, we

can do this simply by giving the key to the attacker:

reduc f o r a l l m: b i t s t r i n g , pk : pkey , r : c o i n s ; get key ( i n t e r n a l a e n c (m, pk , r ) ) = pk .

The previous models consider a unary constructor pk that computes the public key from the secret key.

An alternative (and equivalent) formalism for asymmetric encryption considers the unary constructors

pk , sk which take arguments of type seed , to capture the notion of constructing a key pair from some

seed.

type seed .

type pkey .

type skey .

fun pk ( seed ) : pkey .

fun sk ( seed ) : skey .

fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

reduc f o r a l l m: b i t s t r i n g , k : seed ; adec ( aenc (m, pk ( k ) ) , sk ( k ) ) = m.

The addition of single quotes (’) is only for distinction between the diﬀerent formalizations. We have

given here the deterministic version, a probabilistic version is obviously also possible.

Digital signatures. The Handbook of Applied Cryptography deﬁnes four diﬀerent classes of digital

signature schemes [MvOV96, Figure 11.1], we explain how to model these four classes. Deterministic

signatures with message recovery were already modeled in Section 3.1.2. Probabilistic signatures with

message recovery can be modeled as follows, using the same ideas as for asymmetric encryption:

type s sk ey .

type spkey .

type s c o i n s .

fun spk ( s s ke y ) : spkey .

fun i n t e r n a l s i g n ( b i t s t r i n g , sskey , s c o i n s ) : b i t s t r i n g .

reduc f o r a l l m: b i t s t r i n g , k : sskey , r : s c o i n s ;

getmess ( i n t e r n a l s i g n (m, k , r ) ) = m.

reduc f o r a l l m: b i t s t r i n g , k : sskey , r : s c o i n s ;

c he c ks i gn ( i n t e r n a l s i g n (m, k , r ) , spk ( k ) ) = m.

letfun si g n (m: b i t s t r i n g , k : ss ke y ) = new r : s c o i n s ; i n t e r n a l s i g n (m, k , r ) .

There also exist signatures that do not allow message recovery, named digital signatures with appendix

in [MvOV96]. Here is a model of such signatures in the deterministic case:

4.3. FURTHER SECURITY PROPERTIES 51

type sskey .

type spkey .

fun spk ( sske y ) : spkey .

fun sig n ( b i t s t r i n g , sskey ) : b i t s t r i n g .

reduc f o r a l l m: b i t s t r i n g , k : sskey ; ch ec ksi gn ( si gn (m, k ) , spk ( k ) ,m) = t ru e .

For such signatures, the message must be given when verifying the signature, and signature veriﬁcation

just returns true when it succeeds. Note that these signatures hide the message as if it were encrypted;

this is often a stronger property than desired. If one wants to model that these signatures do not hide

the message, then one can reintroduce a destructor that leaks the message:

reduc f o r a l l m: b i t s t r i n g , k : sskey ; getmess ( sig n (m, k ) ) = m.

Only the adversary should use this destructor; it may be an overapproximation of the capabilities of the

adversary, since the message may not be fully recoverable from the signature. Probabilistic signatures

with appendix can also be modeled by combining the models given above.

It is also possible to model that the signature leaks the key. Obviously, we must not leak the secret

key, but we can leak the corresponding public key using the following destructor:

reduc f o r a l l m: b i t s t r i n g , k : sskey , r : s c o i n s ;

get key ( i n t e r n a l s i g n (m, k , r ) ) = spk ( k ) .

This model is for probabilistic signatures; it can be straightforwardly adapted to deterministic signatures.

Finally, as for asymmetric encryption, we can also consider unary constructors pk , sk which take

arguments of type seed , to capture the notion of constructing a key pair from some seed. We leave the

construction of these models to the reader.

Message authentication codes. Message authentication codes (MACs) can be formalized by a con-

structor with no associated destructor or equation, much like a keyed hash function:

type mkey .

fun mac ( b i t s t r i n g , mkey ) : b i t s t r i n g .

This model is strong: it considers the MAC essentially as a random oracle. It is probably the best

possible model if the MAC is assumed to be a pseudo-random function (PRF). If the MAC is assumed

to be unforgeable (UF-CMA), then one can add a destructor that leaks the MACed message:

reduc f o r a l l m: b i t s t r i n g , k : mkey ; g et m es sa ge (mac (m, k ) ) = m.

Only the adversary should use this destructor; it may be an overapproximation of the capabilities of the

adversary, since the message may not be fully recoverable from the MAC. We also remind the reader

that using MACs in conjunction with symmetric encryption is generally useless in ProVerif since the

basic encryption is already authenticated.

Other primitives. A simple model of Diﬃe-Hellman key agreements is given in Section 4.2.2, bit-

commitment and blind signatures are formalized in [KR05, DKR09], trapdoor commitments are formal-

ized in Section 4.2.2, and non-interactive zero-knowledge proofs are formalized in [BMU08]. Since deﬁning

correct models for cryptographic primitives is diﬃcult, we recommend reusing existing deﬁnitions, such

as the ones given in this manual.

4.3 Further security properties

In Section 3.2, the basic security properties that ProVerif is able to prove were introduced. In this

section, we generalize our earlier presentation and introduce further security properties.

52 CHAPTER 4. LANGUAGE FEATURES

ProVerif is sound, but not complete. ProVerif’s ability to reason with reachability, correspon-

dences, and observational equivalence is sound (sometimes called correct); that is, when ProVerif says

that a property is satisﬁed, then the model really does guarantee that property. However, ProVerif

is not complete; that is, ProVerif may not be capable of proving a property that holds. Sources of

incompleteness are detailed in Section 6.7.5.

4.3.1 Complex correspondence assertions, secrecy, and events

In Section 3.2.2, we demonstrated how to model correspondence assertions of the form: “if an event e

has been executed, then event e

′

has been previously executed.” We will now generalize these assertions

considerably. The syntax for correspondence assertions is revised as follows:

query x

: t

, . . . , x

: t

; q .

where the query q is constructed by the grammar presented in Figure 4.3, such that all terms appearing

in q are built by the application of constructors to the variables x

, . . . , x

of types t

, . . . , t

and all

events appearing in q have been declared with the appropriate type. Equalities as well as disequalities

and inequalities that involve time variables are not allowed before an arrow ==> or alone as single

fact in the query. If q or a subquery of q is of the form F ==> H and H contains an injective event,

then F must be an injective event. If F is a non-injective event, it is automatically transformed into an

injective event by ProVerif. The indication public vars y

, . . . , y

, when present, means that y

, . . . , y

are public, that is, the adversary has read access to them. The identiﬁers y

, . . . , y

must correspond

to bound variables or names inside the considered process. (Variables or names bound inside enriched

terms are not allowed because the expansion of terms may modify the conditions under which they are

deﬁned.) ProVerif then outputs them on public channels as soon as they are deﬁned, to give their value

to the adversary. This is mainly useful for compatibility with CryptoVerif. We will explain the meaning

of these queries through many examples.

Reachability

This corresponds to the case in which the query q is just a fact F. Such a query is in fact an abbreviation

for F ==> false, that is, not F . In other words, ProVerif tests whether F holds, but returns the following

results:

“RESULT not F is true.” when F never holds.

“RESULT not F is false.” when there exists a trace in which F holds, and ProVerif displays such

a trace.

“RESULT not F cannot be proved.” when ProVerif cannot decide either way.

For instance, we have seen query attacker(M) before: this query tests the secrecy of the term M and

ProVerif returns “RESULT not attacker(M) is true.” when M is secret, that is, the attacker cannot

reconstruct M. When phases (see Section 4.1.6) are used, this query returns “RESULT not attacker(M)

is true.” when M is secret in all phases, or equivalently in the last phase. When M contains variables,

they must be declared with their type at the beginning of the query, and ProVerif returns “RESULT

not attacker(M) is true.” when all instances of M are secret.

We can test secrecy in a speciﬁc phase n by query attacker(M) phase n. which returns “RESULT

not attacker(M) phase n is true.” when M is secret in phase n, that is, the attacker cannot reconstruct

M in phase n.

We can also test whether the protocol sends a term M on a channel N (during the last phase if

phases are used) by query mess(N, M ). This query returns “RESULT not mess(N,M ) is true.” when

the message M is never sent on channel N. We can also specify which phase should be considered by

query mess(N, M ) phase n. This query is intended for use when the channel N is private (the attacker

does not have it). When the attacker has the channel N, this query is equivalent to query attacker(M).

Similarly, we can test whether the element (M

, . . . , M

) is present in table d by query table(d(M

. . . , M

)).

ProVerif can also evaluate the reachability of events within a model using the following query:

4.3. FURTHER SECURITY PROPERTIES 53

Figure 4.3 Grammar for correspondence assertions

q ::= query

cq pv reachability or correspondence

secret x pv [options] secrecy

pv ::= public variables

empty

public vars y

, . . . , y

public variables

cq ::= reachability or correspondence query

&& . . . && F

reachability

&& . . . && F

==> H correspondence

H ::= hypothesis

F fact

H && H conjunction

H || H disjunction

false constant false

(F ==> H) nested correspondence

F ::= fact

M op N constraint with op ∈ {<, <=, >, >=, =; <>}

is nat (M) natural number

AF action fact

AF @t action fact executed at time t

AF ::= action fact

attacker(M) the attacker has M (in any phase)

attacker(M) phase n the attacker has M in phase n

mess(N, M) M is sent on channel N (in the last phase)

mess(N, M) phase n M is sent on channel N in phase n

table(d(M

, . . . , M

)) the element M

, . . . , M

is in table d (in any phase)

table(d(M

, . . . , M

)) phase n the element M

, . . . , M

is in table d in phase n

event(e(M

, . . . , M

)) non-injective event

inj−event(e(M

, . . . , M

)) injective event

query x

: t

, . . . , x

: t

; event (e(M

, . . . , M

) ) .

This query returns “RESULT not event(e(M

, . . . , M

)) is true.” when the event is not reachable.

Such queries are useful for debugging purposes, for example, to detect unreachable branches of a model.

With reference to the “Hello World” script (docs/hello ext.pv) in Chapter 2, one could examine as to

whether the else branch is reachable.

More generally, such a query can be F

&& . . . && F

, which is in fact an abbreviation for F

. . . && F

==> false , that is, not (F

&& . . . && F

): ProVerif tries to prove that F

, . . . , F

are not

simultaneously reachable. The similar query with inj−event instead of event is useless: it has the same

meaning as the one with event. Injective events are useful only for correspondences described below.

Equalities, disequalities, and inequalities are not allowed in reachability queries as mentioned above.

Basic correspondences

Basic correspondences are queries q = F

&& . . . && F

==> H where H does not contain nested

correspondences. They mean that, if F

, . . . , F

hold, then H also holds. We have seen such corre-

spondences in Section 3.2.2. We can extend them to conjunctions and disjunctions of events in H. For

54 CHAPTER 4. LANGUAGE FEATURES

instance,

query event (e

) ==> event (e

) && event ( e

) .

means that, if e

has been executed, then e

and e

have been executed. Similarly,

query event (e

) ==> event (e

) | | event (e

) .

means that, if e

has been executed, then e

or e

has been executed. If the correspondence F ==> H

holds, F is an event, and H contains events, then the events in H must be executed before the event F

(or at the same time as F in case an event in H may be equal to F ). This property is proved by stopping

the execution of the process just after the event F .

Conjunctions and disjunctions can be combined:

query event (e

) ==> event (e

) | | ( event (e

) && event ( e

) ) .

means that, if e

has been executed, then either e

has been executed, or e

and e

have been executed.

The conjunction has higher priority than the disjunction, but one should use parentheses to disambiguate

the expressions. The events can of course have arguments, and can also be injective events. For instance,

query inj−event ( e

) ==> event (e

) | | ( inj −event ( e

) && event ( e

) ) .

means that each execution of e

corresponds to either an execution of e

(perhaps the same execution of

for diﬀerent executions of e

), or to a distinct execution of e

and an execution of e

. Note that using

inj−event or event before the arrow ==> does not change anything, since event is automatically

changed into inj−event before ==> when there is inj−event after the arrow ==>.

Conjunctions are also allowed before the arrow ==>. For instance,

event (e

) ) && . . . && event (e

) ) ==> H

means that, if events e

), . . . , e

) are executed, then H holds. When there are several injective

events before the arrow ==>, the query means that for each tuple of executed injective events before

the arrow, there are distinct injective events after the arrow. For instance, the query

inj −event ( e

) && inj−event ( e

) ==> inj−event ( e

)

requires that if event e

is executed n

times and event e

is executed n

times, then event e

is executed

at least n

× n

times.

Correspondences may also involve the knowledge of the attacker or the messages sent on channels.

For instance,

query attacker ( M ) ==> event (e

) .

means that, when the attacker knows M, the event e

has been executed. Conversely,

query event (e

) ==> attacker (M ) .

means that, when event e

has been executed, the attacker knows M . (In practice, ProVerif may have

more diﬃculties proving the latter correspondence. Technically, ProVerif needs to conclude attacker(M)

from facts that occur in the hypothesis of a clause that concludes event(e

); this hypothesis may get

simpliﬁed during the resolution process in a way that makes the desired facts disappear.)

One may also use equalities, disequalities, and inequalities after the arrow ==>. For instance,

assuming a free name a,

query x : t ; event ( e ( x ) ) ==> x = a .

means that the event e(x) can be executed only when x is a. Similarly,

query x : t , y : t ; event ( e ( x ) ) ==> event ( e ( y ) ) && x = f ( y )

means that, when the event e(x) is executed, the event event(e (y)) has been executed and x = f(y).

Using disequalities,

query x : t ; event ( e ( x ) ) ==> x <> a .

means that the event e(x) can be executed only when x is diﬀerent from a.

4.3. FURTHER SECURITY PROPERTIES 55

Nested correspondences

The grammar permits the construction of nested correspondences, that is, correspondences F

. . . && F

==> H in which some of the events H are replaced with correspondences. Such cor-

respondences allow us to order events. More precisely, in order to explain a nested correspondence

&& . . . && F

==> H, let us deﬁne a hypothesis H

by replacing all arrows ==> of H with con-

junctions &&. The nested correspondence F

&& . . . && F

==> H holds if and only if the basic

correspondence F

&& . . . && F

==> H

holds and additionally, for each F

′

==> H

′

that occurs in

&& . . . && F

==> H, if F

′

is an event, then the events of H

′

have been executed before F

′

(or at

the same time as F

′

in case events in H

′

may be equal to F

′

For example,

event (e

) ==> (event (e

) ==> event (e

) ) )

is true when, if the event e

has been executed, then events e

, e

have been previously executed in

that order and before e

. In contrast, the correspondence

event (e

) ==> (event (e

) ==> event (e

) ) && ( event (e

) ==> event (e

) )

holds when, if the event e

has been executed, then e

has been executed before e

and e

before e

, and

those occurrences of e

and e

have been executed before e

Even if the grammar of correspondences does not explicitly require that facts F that occur before

arrows in nested correspondences are events (or injective events), in practice they are because the only

goal of nested correspondences is to order such events.

Our study of the JFK protocol, which can be found in the subdirectory examples/pitype/jfk (if you

installed by OPAM in the switch ⟨switch⟩, the directory

/.opam/⟨switch⟩/doc/proverif/examples/

pitype/jfk), provides several interesting examples of nested correspondence assertions used to prove

the correct ordering of messages of the protocol.

ProVerif proves nested correspondences essentially by proving several correspondences. For instance,

in order to prove

event (e

) ==> (event (e

) ==> event (e

) )

where the events e

, e

may have arguments, ProVerif proves that each execution of e

is preceded

by the execution of an instance of e

, and that, when e

is executed, each execution of that instance of

is preceded by the execution of an instance of e

A typical usage of nested correspondences is to order all messages in a protocol. One would like to

prove a correspondence in the style:

inj −event ( e

end

) ==>

( inj −event ( e

) ==> . . . ==> ( inj −event (e

) ==> inj−event ( e

) ) )

where e

means that the ﬁrst message of the protocol has been sent, e

(i > 0) means that the i-th

message of the protocol has been received and the (i + 1)-th has been sent, and ﬁnally e

end

means that

the last message of the protocol has been received. (These events have at least as arguments the messages

of the protocol.)

Temporal correspondences

Correspondences and nested correspondences allow one to verify the order in which facts occur in exe-

cution traces. The grammar also permits to reason on the order of facts through time variables. In a

query, each fact F can be associated with a variable t of type time with the construct F @t, meaning

that the fact F is executed at time t. When several facts are associated with time variables t, t

′

, . . ., we

can reason on the order in which these facts are executed using equalities, inequalities, and disequalities

Although the meaning of a basic correspondence such as event(e

) ==> event(e

) is similar to a logical implication,

the meaning of a nested correspondence such as event(e

) ==> (event(e

) ==> event(e

)) is very diﬀerent from the log-

ical formula event(e

)⇒(event(e

)⇒event(e

)) in classical logic, which would mean (event(e

)∧event(e

))⇒event(e

The nested correspondence event(e

) ==> (event(e

) ==> event(e

)) rather means that, if e

is executed, then some

instance of e

is executed (before e

), and if that instance of e

is executed, then an instance of e

is executed (before e

So the nested correspondence is similar to an abbreviation for the two correspondences event(e

) ==> event(σe

) and

event(σe

) ==> event(σe

) for some substitution σ.

56 CHAPTER 4. LANGUAGE FEATURES

between the time variables, e.g. t < t

′

, t = t

′

, . . . For example, in our study of the Yubikey proto-

col, which can be found in the ﬁle examples/pitype/lemma/yubikey-less-axioms-time.pv (if you

installed by OPAM in the switch ⟨switch⟩, the ﬁle

/.opam/⟨switch⟩/doc/proverif/examples/pitype/

lemma/yubikey-less-axioms-time.pv), a server executes the event Login(pid,k, i ,x) every time it ac-

cepts a connection from a Yubikey with identity pid and key k. The value i is the value of the server’s

counter and x is the value of the Yubikey’s counter sent to the server. The following query ensures that

the server never executes two login events at diﬀerent times with the same value for the identity, the key,

and Yubikey’s counter.

query t : time , t : time , pid : b i t s t r i n g , k : b i t s t r i n g , i : nat , i : nat ,

x : nat , x : nat ;

event ( Login ( pid , k , i , x ) ) @t && event ( Login ( pid , k , i , x ) ) @t ==> t = t .

Formally, the query is true when, if two Login events are executed with the same key, identity, and

Yubikey’s counter value, then the two events are executed at the same time. Since the semantics of

ProVerif’s calculus only allows events to be executed one at a time, it also implies that the two events

are equal, i.e., i = i

′

Using temporal correspondences allows one to be more precise than basic correspondences. For

example, the following query is not equivalent to the previous one.

query p id : b i t s t r i n g , k : b i t s t r i n g , i : nat , i : nat , x : nat , x : nat ;

event ( Login ( pid , k , i , x ) ) && event ( Login ( pid , k , i , x ) ) ==> i = i .

Indeed, an execution trace where the Login event is executed twice with the same arguments (at diﬀerent

times) would satisfy this query but not the former one.

Temporal variables can be used to compare facts from both the premise and the conclusion of the

query. For example,

event (e

) && event ( e

)@t

==> event (e

)@t

&& t

< t

is true when if two events e

and e

are executed then the event e

must have been executed strictly

before the event e

Note that temporal variables can be used in combination with injective events and nested correspon-

dences, although they overlap in some cases. For example, the query

event (e

) ==> (event (e

) ==> event (e

) )

is equivalent to the query

event (e

) ==> event (e

)@t

&& event (e

)@t

&& t

<= t

The grammar of correspondences also allows attacker, message, and table facts to be associated with

time variables. However, when an inequality i > j or i >= j occurs in the conclusion of a query, with

i and j time variables associated to facts F and G respectively, the following two conditions must hold:

1) F @i occurs in the premise or F is an event; 2) G@j occurs in the conclusion or G is an event. More

generally, in practice, ProVerif is more successful in proving correspondence queries containing mainly

events. Note that the type time can only be used in queries and cannot be used in declarations of

processes, function symbols, names, . . .

Secrecy

The query query secret x provides an alternative way to test secrecy to query attacker(M ). The

latter query is meant to test whether the attacker can compute the term M, built from free names.

The query query secret x can test the secrecy of the bound name or variable x. The identiﬁer x must

correspond to a bound variable or name inside the considered process. (Variables or names bound inside

enriched terms are not allowed because the expansion of terms may modify the conditions under which

they are deﬁned.) This query comes in two ﬂavors:

query secret x, query secret x [reachability], or query secret x [pv reachability] tests whether

the attacker can compute a value stored in the variable x or equal to the bound name x.

4.3. FURTHER SECURITY PROPERTIES 57

query secret x [real or random] or query secret x [pv real or random] tests whether the attacker

can distinguish each value of x from a fresh name (representing a fresh random value). This query

is in fact encoded as an observational query between processes that diﬀer only by terms. Such

queries are explained in the next section.

This query is designed for compatibility with CryptoVerif: the options that start with pv apply only

to ProVerif; those that start with cv apply only to CryptoVerif and are ignored by ProVerif; the others

apply to both tools. The various options make it possible to test, in each tool, whether the attacker can

compute the value of x or whether it can distinguish it from a fresh random value. (The former is the

default in ProVerif while the latter is the default in CryptoVerif.)

4.3.2 Observational equivalence

The notion of indistinguishability is a powerful concept which allows us to reason about complex proper-

ties that cannot be expressed as reachability or correspondence properties. The notion of indistinguisha-

bility is generally named observational equivalence in the formal model. Intuitively, two processes P and

Q are observationally equivalent, written P ≈ Q, when an active attacker cannot distinguish P from Q.

Formal deﬁnitions can be found in [AF01, BAF08]. Using this notion, one can for instance specify that

a process P follows its speciﬁcation Q by saying that P ≈ Q. ProVerif can prove some observational

equivalences, but not all of them because their proof is complex. In this section, we present the queries

that enable us to prove observational equivalences using ProVerif.

Strong secrecy

A ﬁrst class of equivalences that ProVerif can prove is strong secrecy. Strong secrecy means that the

attacker is unable to distinguish when the secret changes. In other words, the value of the secret should

not aﬀect the observable behavior of the protocol. Such a notion is useful to capture the attacker’s ability

to learn partial information about the secret: when the attacker learns the ﬁrst component of a pair, for

instance, the whole pair is secret in the sense of reachability (the attacker cannot reconstruct the whole

pair because it does not have the second component), but it is not secret in the sense of strong secrecy

(the attacker can notice changes in the value of the pair, since it has its ﬁrst component). The concept

is particularly important when the secret consists of known values. Consider for instance a process P

that uses a boolean b. The variable b can take two values, true or false, which are both known to the

attacker, so it is not secret in the sense of reachability. However, one may express that b is strongly

secret by saying that P {true/b} ≈ P {false/b}: the attacker cannot determine whether b is true or false.

({true/b} denotes the substitution that replaces b with true.)

The strong secrecy of values x

, . . . , x

is denoted by

noninterf x

, . . . , x

When the process under consideration is P , this query is true if and only if

P {M

, . . . , M

} ≈ P {M

′

, . . . , M

′

}

for all terms M

, . . . , M

, M

′

, . . . , M

′

. ({M

, . . . , M

} denotes the substitution that replaces x

with M

, . . . , x

with M

.) In other words, the attacker cannot distinguish changes in the values of

, . . . , x

. The values x

, . . . , x

must be free names of P , declared by free x

: t

[private]. This point

is particularly important: if x

, . . . , x

do not occur in P or occur as bound names or variables in P , the

query noninterf x

, . . . , x

holds trivially, because P {M

, . . . , M

} = P {M

′

, . . . , M

′

To express secrecy of bound names or variables, one can use choice, described below. In the equivalence

above, the attacker is permitted to replace the values x

, . . . , x

with any term M

, . . . , M

, M

′

, . . . , M

′

it can build, that is, any term that can be built from public free names, public constructors, and fresh

names created by the attacker. These terms cannot contain bound names (or private free names).

For instance, this strong secrecy query can be used to show the secrecy of a payload sent encrypted

under a session key. Here is a trivial example of a such situation, in which we use a previously shared

long-term key k as session key (ﬁle docs/ex noninterf1.pv).

58 CHAPTER 4. LANGUAGE FEATURES

1 free c : c hann el .

3 ( Shared key en cr y p ti o n )

4 type key .

5 fun s enc ( b i t s t r i n g , key ) : b i t s t r i n g .

6 reduc f o r a l l x : b i t s t r i n g , y : key ; s dec ( s enc ( x , y ) , y ) = x .

8 ( The s hare d key )

9 free k : key [ private ] .

11 ( Query )

12 free s ec r e t ms g : b i t s t r i n g [ private ] .

13 noninterf s e c re t m sg .

15 process ( ! out ( c , se nc ( se cr et ms g , k ) ) ) |

16 ( ! in ( c , x : b i t s t r i n g ) ; l e t s = sde c ( x , k ) in 0 )

One can also specify the set of terms in which M

, . . . , M

, M

′

, . . . , M

′

are taken, using a variant of

the noninterf query:

noninterf x

among (M

1,1

, . . . , M

1,k

) ,

. . . ,

among (M

n,1

, . . . , M

n,k

) .

This query is true if and only if

P {M

, . . . , M

} ≈ P {M

′

, . . . , M

′

}

for all terms M

, M

′

∈ {M

1,1

, . . . , M

1,k

}, . . . , M

, M

′

∈ {M

n,1

, . . . , M

n,k

}. Obviously, the terms

j,1

, . . . , M

j,k

must have the same type as x

. For instance, the secrecy of a boolean b could be

expressed by noninterf b among (true, false).

Consider the following example (docs/ex noninterf2.pv) in which the attacker is asked to distin-

guish between sessions which output x ∈ {n, h(n)}, where n is a private name.

1 free c : c hann el .

3 fun h ( b i t s t r i n g ) : b i t s t r i n g .

5 free x , n : b i t s t r i n g [ private ] .

6 noninterf x among (n , h ( n ) ) .

8 process out ( c , x )

Note that free x,n: bitstring [private]. is a convenient shorthand for

free x : b i t s t r i n g [ private ] .

free n : b i t s t r i n g [ private ] .

More complex examples can be found in subdirectory examples/pitype/noninterf (if you installed

by OPAM in the switch ⟨switch⟩, the directory

/.opam/⟨switch⟩/doc/proverif/examples/pitype/

noninterf).

Oﬀ-line guessing attacks

Protocols may rely upon weak secrets, that is, values with low entropy, such as human-memorable

passwords. Protocols which rely upon weak secrets are often subject to oﬀ-line guessing attacks, whereby

an attacker passively observes, or actively participates in, an execution of the protocol and then has the

ability to verify if a guessed value is indeed the weak secret without further interaction with the protocol.

This makes it possible for the attacker to enumerate a dictionary of passwords, verify each of them, and

ﬁnd the correct one. The absence of oﬀ-line guessing attacks against a name n can be tested by the

query:

4.3. FURTHER SECURITY PROPERTIES 59

weaksecret n .

where n is declared as a private free name: free n : t [private]. ProVerif then tries to prove that the

attacker cannot distinguish a correct guess of the secret from an incorrect guess. This can be written

formally as an observational equivalence

P | phase 1 ; out ( c , n) ≈ P | phase 1 ; new n

′

: t ; out ( c , n

′

)

where P is the process under consideration and t is the type of n. In phase 0, the attacker interacts with

the protocol P . In phase 1, the attacker can no longer interact with P , but it receives either the correct

password n or a fresh (incorrect) password n

′

, and it should not be able to distinguish between these

two situations.

As an example, we will consider the na¨ıve voting protocol introduced by Delaune & Jacquemard [DJ04].

The protocol proceeds as follows. The voter V constructs her ballot by encrypting her vote v with the

public key of the administrator. The ballot is then sent to the administrator whom is able to decrypt

the message and record the voter’s vote, as modeled in the ﬁle docs/ex weaksecret.pv shown below:

1 free c : c hann el .

3 type skey .

4 type pkey .

6 fun pk ( ske y ) : pkey .

7 fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

9 reduc f o r a l l m: b i t s t r i n g , k : skey ; adec ( aenc (m, pk ( k ) ) , k ) = m.

11 free v : b i t s t r i n g [ private ] .

12 weaksecret v .

14 le t V(pkA : pkey ) = out ( c , aenc (v , pkA ) ) .

16 le t A( skA : skey ) = in ( c , x : b i t s t r i n g ) ; le t v = adec ( x , skA ) in 0.

18 process

19 new skA : skey ;

20 l e t pkA = pk ( skA ) in

21 out ( c , pkA ) ;

22 ! (V(pkA) | A( skA ) )

The voter’s vote is syntactically secret; however, if the attacker is assumed to know a small set of possible

votes, then v can be deduced from the ballot. The oﬀ-line guessing attack can be thwarted by the use of

a probabilistic public-key encryption scheme.

More examples regarding guessing attacks can be found in subdirectory examples/pitype/weaksecr

(if you installed by OPAM in the switch ⟨switch⟩, the directory

/.opam/⟨switch⟩/doc/proverif/

examples/pitype/weaksecr).

Observational equivalence between processes that diﬀer only by terms

The most general class of equivalences that ProVerif can prove are equivalences P ≈ Q where the

processes P and Q have the same structure and diﬀer only in the choice of terms. These equivalences

are written in ProVerif by a single “biprocess” that encodes both P and Q. Such a biprocess uses the

construct choice[M ,M

′

] to represent the terms that diﬀer between P and Q: P uses the ﬁrst component

of the choice, M , while Q uses the second one, M

′

. (The keyword diﬀ is also allowed as a synonym

for choice; diﬀ is used in research papers.) For example, the secret ballot (privacy) property of an

electronic voting protocol can be expressed as:

P (sk

, v

) | P (sk

, v

) ≈ P (sk

, v

) | P (sk

, v

) (4.1)

60 CHAPTER 4. LANGUAGE FEATURES

where P is the voter process, sk

(respectively sk

) is the voter’s secret key and v

(respectively v

) is

the candidate for whom the voter wishes to vote for: one cannot distinguish the situation in which A

votes for v

and B votes from v

from the situation in which A votes for v

and B votes for v

. (The

simpler equivalence P (sk

, v

) ≈ P (sk

, v

) typically does not hold because, if A is the only voter, one

can know for whom she voted from the ﬁnal result of the election.) The pair of processes (4.1) can be

expressed as a single biprocess as follows:

P (sk

, choice[v

, v

]) | P (sk

, choice[v

, v

])

Accordingly, we extend our grammar for terms to include choice[M,N ].

Unlike the previous security properties we have studied, there is no need to explicitly tell ProVerif that

a script aims at verifying an observational equivalence, since this can be inferred from the occurrence of

choice[M,N ]. It should be noted that the analysis of observational equivalence is incompatible with other

security properties, that is, scripts in which choice[M ,N] appears cannot contain query, noninterf,

nor weaksecret. (For this reason, you may have to write several distinct input ﬁles in order to prove

several properties of the same protocol. You can use a preprocessor such as m4 or cpp to generate all

these ﬁles from a single master ﬁle.)

Example: Decisional Diﬃe-Hellman assumption The decisional Diﬃe-Hellman (DDH) assump-

tion states that, given a cyclic group G of prime order q with generator g, (g

, g

) is computationally

indistinguishable from (g

, g

), where a, b, c are random elements from Z

∗

. A formal counterpart of

this property can be expressed as an equivalence using the ProVerif script below (ﬁle docs/dh-fs.pv).

1 free d : cha nnel .

3 type G.

4 type exponent .

6 const g : G [ data ] .

7 fun exp (G, exponent ) : G.

9 equation f o r a l l x : exponent , y : exponent ; exp ( exp ( g , x ) , y ) = exp ( exp ( g , y ) , x ) .

11 process

12 new a : exponent ; new b : exponent ; new c : exponent ;

13 out (d , ( exp ( g , a ) , exp ( g , b ) , choice [ exp ( exp ( g , a ) , b ) , exp ( g , c ) ] ) )

ProVerif succeeds in proving this equivalence. Intuitively, this result shows that our model of the Diﬃe-

Hellman key agreement is stronger than the Decisional Diﬃe-Hellman assumption.

Observe that the biprocess out(d,(exp(g,a),exp(g,b),choice[exp(exp(g,a),b),exp(g,c )])) is equivalent

out ( choice [ d , d ] , ( choice [ exp ( g , a ) , exp ( g , a ) ] , choice [ exp ( g , b ) , exp ( g , b ) ] ,

choice [ exp ( exp ( g , a ) , b ) , exp ( g , c ) ] ) ) .

That is, choice[M,M] may be abbreviated as M; it follows immediately that the choice operator is only

needed to model the terms that are diﬀerent in the pair of processes.

Real-or-random secrecy In the computational model, one generally expresses the secrecy of a value

x by saying that x is indistinguishable from a fresh random value. One can express a similar idea in

the formal model using observational equivalence. For instance, this notion can be used for proving

secrecy of a session key k, as in the following variant of the ﬁxed handshake protocol of Chapter 3 (ﬁle

docs/ex handshake RoR.pv).

1 free c : chan nel .

3 le t c l i e nt A (pkA : pkey , skA : skey , pkB : spkey ) =

4 out ( c , pkA ) ;

4.3. FURTHER SECURITY PROPERTIES 61

5 in ( c , x : b i t s t r i n g ) ;

6 l e t y = adec ( x , skA ) in

7 l e t (=pkA,=pkB , k : key ) = ch ec k si g n ( y , pkB) in

8 new random : key ;

9 out ( c , choice [ k , random ] ) .

11 le t se rve rB (pkB : spkey , skB : sskey , pkA : pkey ) =

12 in ( c , pkX : pkey ) ;

13 new k : key ;

14 out ( c , aenc ( s i g n ( ( pkX , pkB , k ) , skB ) ,pkX ) ) .

16 process

17 new skA : skey ;

18 new skB : s sk ey ;

19 l e t pkA = pk ( skA ) in out ( c , pkA ) ;

20 l e t pkB = spk ( skB ) in out ( c , pkB ) ;

21 ( ( ! c l i e n t A (pkA , skA , pkB ) ) | ( ! se rve rB ( pkB , skB , pkA) ) )

In Line 9, one outputs to the attacker either the real key (k) or a random key (random), and the

equivalence holds when the attacker cannot distinguish these two situations. As ProVerif ﬁnds, the

equivalence does not hold in this example, because of a replay attack: the attacker can replay the

message from the server B to the client A, which leads several sessions of the client to have the same

key k. The attacker can distinguish this situation from a situation in which the key is a fresh random

number (random) generated at each session of the client. Another example can be found in Section 5.4.2.

When the observational equivalence proof fails on the biprocess given by the user, ProVerif tries

to simplify that biprocess by transforming as far as possible tests that occur in subprocesses into

tests done inside terms, which increases the chances of success of the proof. The proof is then re-

tried on the simpliﬁed process(es). This simpliﬁcation of biprocesses can be turned oﬀ by the setting

set simplifyProcess = false . (See Section 6.6.2 for details on this setting.) More complex examples using

choice can be found in subdirectory examples/pitype/choice (if you installed by OPAM in the switch

⟨switch⟩, the directory

/.opam/⟨switch⟩/doc/proverif/examples/pitype/choice).

Remarks The absence of oﬀ-line guessing attacks can also be expressed using choice:

P | phase 1 ; new n

′

: t ; out ( c , choice [ n ,n

′

] )

This is how ProVerif handles guessing attacks internally, but using weaksecret is generally more con-

venient in practice. (For instance, one can query for the secrecy of several weak secrets in the same

ProVerif script.)

Strong secrecy noninterf x

, . . . , x

can also be formalized using choice, by inputting two messages

′

, x

′′

for each i ≤ n and deﬁning x

by let x

= choice[x

′

, x

′′

] before starting the protocol itself

(possibly in an earlier phase than the protocol). However, the query noninterf is typically much more

eﬃcient than choice. On the other hand, in the presence of equations that can be applied to the secrets,

noninterf commonly leads to false attacks. So we recommend trying with noninterf for properties

that can be expressed with it, especially when there is no equation, and using choice in the presence of

equations or for properties that cannot be expressed using noninterf.

Strong secrecy with among can also be encoded using choice. That may require many equiva-

lences when the sets are large, even if some examples are very easy to encode. For instance, the query

noninterf b among (true, false) can also be encoded as let b = choice[true, false ] in P where P is

the protocol under consideration.

Static equivalence [AF01] is an equivalence between frames, that is, substitutions with hidden names

ϕ = new n

: t

; . . . new n

: t

; {M

, . . . , M

}

′

= new n

′

: t

′

; . . . new n

′

: t

′

; {M

′

, . . . , M

′

}

Static equivalence corresponds to the case in which the attacker receives either the messages M

, . . . , M

or M

′

, . . . , M

′

, and should not be able to distinguish between these two situations; static equivalence

can be expressed by the observational equivalence

62 CHAPTER 4. LANGUAGE FEATURES

new n

: t

; . . . new n

: t

; out ( c , ( M

, . . . , M

) )

≈

new n

′

: t

′

; . . . new n

′

: t

′

; out ( c , ( M

′

, . . . , M

′

) )

which can always be written using choice:

new n

: t

; . . . new n

: t

; new n

′

: t

′

; . . . new n

′

: t

′

;

out ( c , ( choice [ M

, M

′

] , . . . , choice [ M

, M

′

] ) )

The Diﬃe-Hellman example above is an example of static equivalence.

Internally, ProVerif proves a property much stronger than observational equivalence of P and Q.

In fact, it shows that for each reachable test, the same branch of the test is taken both in P and

in Q; for each reachable destructor application, the destructor application either succeeds both in P

and Q or fails both in P and Q; for each reachable conﬁguration with an input and an output on

private channels, the channels are equal in P and in Q, or diﬀerent in P and in Q. In other words,

it shows that each reduction step is executed in the same way in P and Q. Because this property is

stronger than observational equivalence, we may have “false attacks” in which this property is wrong

but observational equivalence in fact holds. When ProVerif does not manage to prove observational

equivalence, it tries to reconstruct an attack against the stronger property, that is, it provides a trace of

P and Q that arrives at a point at which P and Q reduce in a diﬀerent way. This trace explains why

the proof fails, and may also enable the user to understand if observational equivalence really does not

hold, but it does not provide a proof that observational equivalence does not hold. That is why ProVerif

never concludes “RESULT [Query] is false” for observational equivalences; when the proof fails, it just

concludes “RESULT [Query] cannot be proved”.

Observational equivalence with synchronizations Synchronizations (see Section 4.1.7) can help

proving equivalences with choice, because they allow swapping data between processes at the synchro-

nization points [BS16]. The following toy example illustrates this point:

1 free c : c hann el .

2 free m, n : b i t s t r i n g .

4 process

5 (

6 out ( c ,m) ;

7 sync 1 ;

8 out ( c , choice [m, n ] )

9 ) | (

10 sync 1 ;

11 out ( c , choice [ n ,m] )

12 )

The two processes represented by this biprocess are observationally equivalent, and this property is

proved by swapping m and n in the second component of choice at the synchronization point. By

default, ProVerif tries all possible swapping strategies in order to prove the equivalence. It is also

possible to choose the swapping strategy in the input ﬁle by set swapping = ”swapping stragegy”., or

to choose it interactively by adding set interactiveSwapping = true. to the input ﬁle. In the latter case,

ProVerif displays a description of the possible swappings and asks the user which swapping strategy to

choose.

A swapping strategy is described as follows. The swapping strategies are permutations of the synchro-

nizations, represented by their tag (given by the user or chosen automatically by ProVerif as explained

in Section 4.1.7; for stability of the tags, when a swapping strategy is given, it is recommend that the

user speciﬁes the tags). They are denoted as follows:

tag

1,1

−> . . .−> tag

1,n

;. . .;tag

k,1

−> . . .−> tag

k,n

which means that tag

i,j

has image tag

i,j+1

when j < n

and tag

i,n

has image tag

i,1

by the permutation.

(In other words, we give the cycles of the permutation.) When the tag of a synchronization does not

4.3. FURTHER SECURITY PROPERTIES 63

appear in the swapping strategy, data is not swapped at that synchronization. For instance, the previous

example may the rewritten:

1 free c : c hann el .

2 free m, n : b i t s t r i n g .

4 set swapping = ” tag1 −> tag2 ” .

6 process

7 (

8 out ( c ,m) ;

9 sync 1 [ tag1 ] ;

10 out ( c , choice [m, n ] )

11 ) | (

12 sync 1 [ tag2 ] ;

13 out ( c , choice [ n ,m] )

14 )

with additional tags, and the swapping strategy is tag1 −> tag2.

When a synchronization is tagged with a tag that contains the string noswap, data is not swapped

at that synchronization.

Swapping data at synchronizations point can help for instance proving ballot secrecy in e-voting

protocols: as mentioned above, this property is proved by showing that the two processes represented

by the biprocess

P (sk

, choice[v

, v

]) | P (sk

, choice[v

, v

])

are observationally equivalent, and proving this property often requires swapping the votes v

and v

This technique is illustrated on the FOO e-voting protocol in the ﬁle examples/pitype/sync/foo.pv

of the documentation package proverifdoc2.05.tar.gz. Other examples appear in the directory

examples/pitype/sync/ in that package.

Observational equivalence between two processes

ProVerif can also prove equivalence P ≈ Q between two processes P and Q presented separately, using

the following command (instead of process P )

equivalence P Q

where P and Q are processes that do not contain choice. ProVerif will in fact try to merge the processes

P and Q into a biprocess and then prove equivalence of this biprocess. Note that ProVerif is not always

capable of merging two processes into a biprocess: the structure of the two processes must be fairly

similar. Here is a toy example:

1 type key .

2 type macs .

4 fun mac( b i t s t r i n g , key ) : macs .

6 free c : c hann el .

8 equivalence

9 ! new k : key ; ! new a : b i t s t r i n g ; out ( c , mac( a , k ) )

10 ! new k : key ; new a : b i t s t r i n g ; out ( c , mac( a , k ) )

The diﬀerence between the two processes is that the ﬁrst process can use the same key k for sending

several MACs, while the second one sends one MAC for each key k. Even though the structure of the two

processes is slightly diﬀerent (there is an additional replication in the ﬁrst process), ProVerif manages

to merge these two processes into a single biprocess:

64 CHAPTER 4. LANGUAGE FEATURES

1 !

2 new k 39 : key ;

3 !

4 new a 4 0 : b i t s t r i n g ;

5 new k 41 : key ;

6 new a 4 2 : b i t s t r i n g ;

7 out ( c , choice [ mac( a 40 , k 39 ) , mac( a 42 , k 41 ) ] )

and to prove that the two processes are observationally equivalent.

When proving an equivalence by equivalence P Q, the processes P and Q must not contain syn-

chronizations sync n (see Section 4.1.7).

Chapter 5

Needham-Schroeder public key

protocol: Case Study

The Needham-Schroeder public key protocol [NS78] is intended to provide mutual authentication of two

principals Alice A and Bob B. Although it is not stated in the original description, the protocol may

also provide a secret session key shared between the participants. In addition to the two participants,

we assume the existence of a trusted key server S.

The protocol proceeds as follows. Alice contacts the key server S and requests Bob’s public key. The

key server responds with the key pk(skB) paired with Bob’s identity, signed using his private signing key

for the purposes of authentication. Alice proceeds by generating a nonce Na, pairs it with her identity A,

and sends the message encrypted with Bob’s public key. On receipt, Bob decrypts the message to recover

Na and the identity of his interlocutor A. Bob then establishes Alice’s public key pk(skA) by requesting

it to the key server S. Bob then generates his nonce Nb and sends the message (Na,Nb) encrypted for

Alice. Finally, Alice replies with the message aenc(Nb, pk(skB)). The rationale behind the protocol is

that, since only Bob can recover Na, only he can send message 6; and hence authentication of Bob should

hold. Similarly, only Alice should be able to recover Nb; and hence authentication of Alice is expected

on receipt of message 7. Moreover, it follows that Alice and Bob have established the shared secrets

Na and Nb which can subsequently be used as session keys. The protocol can be summarized by the

following narration:

(1) A → S : (A, B)

(2) S → A : sign((B, pk(skB)), skS)

(3) A → B : aenc((Na, A), pk(skB))

(4) B → S : (B, A)

(5) S → B : sign((A, pk(skA)), skS)

(6) B → A : aenc((Na, Nb), pk(skA))

(7) A → B : aenc(Nb, pk(skB))

Informally, the protocol is expected to satisfy the following properties:

1. Authentication of A to B: if B reaches the end of the protocol and he believes he has done so with

A, then A has engaged in a session with B.

2. Authentication of B to A: similarly to the above.

3. Secrecy for A: if A reaches the end of the protocol with B, then the nonces Na and Nb that A has

are secret; in particular, they are suitable for use as session keys for preserving the secrecy of an

arbitrary term M in the symmetric encryption senc(M,K) where K ∈ {Na, Nb}.

4. Secrecy for B: similarly.

However, nearly two decades after the protocol’s inception, Gavin Lowe discovered a man-in-the-middle

attack [Low96]. An attacker I engages Alice in a legitimate session of the protocol; and in parallel, the

attacker is able to impersonate Alice in a session with Bob. In practice, one may like to consider the

66 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

attacker to be a malicious retailer I whom Alice is willing to communicate with (presumably without

the knowledge that the retailer is corrupt), and Bob is an honest institution (for example, a bank) whom

Alice conducts legitimate business with. In this scenario, the honest bank B is duped by the malicious

retailer I who is pertaining to be Alice. The protocol narration below describes the attack (with the

omission of key distribution).

A → I : aenc((Na,A), pk(skI))

I → B : aenc((Na,A), pk(skB))

B → A : aenc((Na,Nb), pk(skA))

A → I : aenc(Nb, pk(skI))

I → B : aenc(Nb, pk(skB))

Lowe ﬁxes the protocol by the inclusion of Bob’s identity in message 6; that is,

′

) B → A : aenc((Na,Nb,B), pk(skA))

This correction allows Alice to verify whom she is running the protocol with and prevents the attack. In

the remainder of this chapter, we demonstrate how the Needham-Schroeder public key protocol can be

analyzed using ProVerif with various degrees of complexity.

5.1 Simpliﬁed Needham-Schroeder protocol

We begin our study with the investigation of a simplistic variant which allows us to concentrate on the

modeling process rather than the complexities of the protocol itself. Accordingly, we consider the essence

of the protocol which is speciﬁed as follows:

A → B : aenc((Na,pk(skA)), pk(skB))

B → A : aenc((Na,Nb), pk(skA))

A → B : aenc(Nb, pk(skB))

In this formalization, the role of the trusted key server is omitted and hence we assume that participants

Alice and Bob are in possession of the necessary public keys prior to the execution of the protocol. In

addition, Alice’s identity is modeled using her public key.

5.1.1 Basic encoding

The declarations are standard, they specify a public channel c and constructors/destructors required to

capture cryptographic primitives in the now familiar fashion:

1 free c : c hann el .

3 ( P u bl i c key e n c ry p ti o n )

4 type pkey .

5 type skey .

7 fun pk ( ske y ) : pkey .

8 fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

9 reduc f o r a l l x : b i t s t r i n g , y : skey ; adec ( aenc (x , pk ( y ) ) , y ) = x .

11 ( S i g na t u r e s )

12 type spkey .

13 type ss ke y .

15 fun spk ( s sk ey ) : spkey .

16 fun s i g n ( b i t s t r i n g , s sk ey ) : b i t s t r i n g .

17 reduc f o r a l l x : b i t s t r i n g , y : s sk ey ; getmess ( si g n ( x , y ) ) = x .

18 reduc f o r a l l x : b i t s t r i n g , y : s sk ey ; c h e ck s ig n ( s i g n ( x , y ) , spk ( y ) ) = x .

5.1. SIMPLIFIED NEEDHAM-SCHROEDER PROTOCOL 67

20 ( Shared key en cr y p ti o n )

21 fun s enc ( b i t s t r i n g , b i t s t r i n g ) : b i t s t r i n g .

22 reduc f o r a l l x : b i t s t r i n g , y : b i t s t r i n g ; sd ec ( se nc ( x , y ) , y ) = x .

Process macros for A and B can now be declared and the main process can also be speciﬁed:

l et pro ce ssA (pkB : pkey , skA : skey ) =

in ( c , pkX : pkey ) ;

new Na : b i t s t r i n g ;

out ( c , aenc ( ( Na, pk ( skA ) ) , pkX ) ) ;

in ( c , m: b i t s t r i n g ) ;

l et (=Na , NX: b i t s t r i n g ) = adec (m, skA) in

out ( c , aenc (NX, pkX ) ) .

l et proc essB (pkA : pkey , skB : skey ) =

in ( c , m: b i t s t r i n g ) ;

l et (NY: b i t s t r i n g , pkY : pkey ) = adec (m, skB ) in

new Nb: b i t s t r i n g ;

out ( c , aenc ( (NY, Nb) , pkY ) ) ;

in ( c , m3: b i t s t r i n g ) ;

i f Nb = adec (m3, skB ) then 0 .

process

new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ;

new skB : s key ; l et pkB = pk ( skB ) in out ( c , pkB ) ;

( ( ! proc ess A (pkB , skA ) ) | ( ! proc essB (pkA , skB ) ) )

The main process begins by constructing the private keys skA and skB for principals A and B respectively.

The public parts pk(skA) and pk(skB) are then output on the public communication channel c, ensuring

they are available to the attacker. (Observe that this is done using the handles pkA and pkB for

convenience.) An unbounded number of instances of processA and processB are then instantiated (with

the relevant parameters), representing A and B’s willingness to participate in arbitrarily many sessions

of the protocol.

We assume that Alice is willing to run the protocol with any other principal; the choice of her inter-

locutor will be made by the environment. This is captured by modeling the ﬁrst input in(c, pkX: pkey)

to processA as the interlocutor’s public key pkX. The actual protocol then commences with Alice select-

ing her nonce Na, which she pairs with her identity pkA = pk(skA) and outputs the message encrypted

with her interlocutor’s public key pkX. Meanwhile, Bob awaits an input from his initiator; on receipt,

Bob decrypts the message to recover his initiator’s nonce NY and identity pkY. Bob then generates

his nonce Nb and sends the message (NY,Nb) encrypted for the initiator using the key pkY. Next, if

Alice believes she is talking to her interlocutor, that is, if the ciphertext she receives contains her nonce

Na, then she replies with aenc(Nb, pk(skB)). (Recall that only the interlocutor who has the secret key

corresponding to the public key part pkX should have been able to recover Na and hence if the ciphertext

contains her nonce, then she believes authentication of her interlocutor holds.) Finally, if the ciphertext

received by Bob contains his nonce Nb, then he believes that he has successfully completed the protocol

with his initiator.

5.1.2 Security properties

Recall that the primary objective of the protocol is mutual authentication of the principals Alice and

Bob. Accordingly, when A reaches the end of the protocol with the belief that she has done so with B,

then B has indeed engaged in a session with A; and vice-versa for B. We declare the events:

event beginAparam(pkey), which is used by Bob to record the belief that the initiator whose public

key is supplied as a parameter has commenced a run of the protocol with him.

68 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

event endAparam(pkey), which means that Alice believes she has successfully completed the pro-

tocol with Bob. This event is executed only when Alice believes she runs the protocol with Bob,

that is, when pkX = pkB. Alice supplies her public key pk(skA) as the parameter.

event beginBparam(pkey), which denotes Alice’s intention to initiate the protocol with an inter-

locutor whose public key is supplied as a parameter.

event endBparam(pkey), which records Bob’s belief that he has completed the protocol with Alice.

He supplies his public key pk(skB) as the parameter.

Intuitively, if Alice believes she has completed the protocol with Bob and hence executes the event

endAparam(pk(skA)), then there should have been an earlier occurrence of the event beginAparam(pk(

skA)), indicating that Bob started a session with Alice; moreover, the relationship should be injective.

A similar property should hold for Bob.

In addition, we wish to test if, at the end of the protocol, the nonces Na and Nb are secret. These

nonces are names created by new or variables such as NX and NY, while the standard secrecy queries

of ProVerif deal with the secrecy of private free names. To solve this problem, we can use the following

general technique: instead of directly testing the secrecy of the nonces, we use them as session keys in

order to encrypt some free name and test the secrecy of that free name. For instance, in the process

for Alice, we output senc(secretANa,Na) and we test the secrecy of secretANa: secretANa is secret if

and only if the nonce Na that Alice has is secret. Similarly, we output senc(secretANb,NX) and we

test the secrecy of secretANb: secretANb is secret if and only if NX (that is, the nonce Nb that Alice

has) is secret. We proceed symmetrically for Bob using secretBNa and secretBNb. (Alternatively, we

could also deﬁne a variable NaA to store the nonce Na that Alice has at the end of the protocol, and

test its secrecy using the query query secret NaA. We can proceed similarly using NbA to store the

nonce Nb on Alice’s side, and NaB and NbB to store the nonces on Bob’s side. This is done in the ﬁle

docs/NeedhamSchroederPK-var5.pv.)

Observe that the use of four names secretANa, secretANb, secretBNa, secretBNb for secrecy queries

allows us to analyze the precise point of failure; that is, we can study secrecy for Alice and secrecy for

Bob. Moreover, we can analyze both nonces Na and Nb independently for each of Alice and Bob.

The corresponding ProVerif code annotated with events and additional code to model secrecy, along

with the relevant queries, is presented as follows (ﬁle docs/NeedhamSchroederPK-var1.pv):

23 ( A u th e n t ic a t i on q u e r i e s )

24 event beginBparam ( pkey ) .

25 event endBparam ( pkey ) .

26 event beginAparam ( pkey ) .

27 event endAparam( pkey ) .

29 query x : pkey ; inj−event ( endBparam ( x ) ) ==> inj −event ( beginBparam ( x ) ) .

30 query x : pkey ; inj−event ( endAparam ( x ) ) ==> inj−event ( beginAparam ( x ) ) .

32 ( S ecr ecy q u e r i e s )

33 free secretANa , secretANb , secretBNa , secretBNb : b i t s t r i n g [ private ] .

35 query attacker ( secretANa ) ;

36 attacker ( secretANb ) ;

37 attacker ( secretBNa ) ;

38 attacker ( secretBNb ) .

40 ( A l ic e )

41 le t p roc ess A (pkB : pkey , skA : skey ) =

42 in ( c , pkX : pkey ) ;

43 event beginBparam (pkX ) ;

44 new Na : b i t s t r i n g ;

45 out ( c , aenc ( ( Na, pk ( skA ) ) , pkX ) ) ;

46 in ( c , m: b i t s t r i n g ) ;

5.1. SIMPLIFIED NEEDHAM-SCHROEDER PROTOCOL 69

47 l e t (=Na , NX: b i t s t r i n g ) = adec (m, skA) in

48 out ( c , aenc (NX, pkX ) ) ;

49 i f pkX = pkB then

50 event endAparam ( pk ( skA ) ) ;

51 out ( c , sen c ( secretANa , Na ) ) ;

52 out ( c , sen c ( secretANb , NX) ) .

54 ( Bob )

55 le t pro cessB (pkA : pkey , skB : skey ) =

56 in ( c , m: b i t s t r i n g ) ;

57 l e t (NY: b i t s t r i n g , pkY : pkey ) = adec (m, skB ) in

58 event beginAparam (pkY ) ;

59 new Nb: b i t s t r i n g ;

60 out ( c , aenc ( (NY, Nb) , pkY ) ) ;

61 in ( c , m3: b i t s t r i n g ) ;

62 i f Nb = adec (m3, skB ) then

63 i f pkY = pkA then

64 event endBparam ( pk ( skB ) ) ;

65 out ( c , sen c ( secretBNa , NY) ) ;

66 out ( c , sen c ( secretBNb , Nb ) ) .

68 ( Main )

69 process

70 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ;

71 new skB : s key ; l et pkB = pk ( skB ) in out ( c , pkB ) ;

72 ( ( ! processA ( pkB , skA ) ) | ( ! p roces sB (pkA , skB ) ) )

Analyzing the simpliﬁed Needham-Schroeder protocol. Executing the Needham-Schroeder pro-

tocol with the command ./proverif docs/NeedhamSchroederPK-var1.pv | grep "RES" produces the

output:

RESULT not attacker ( secretANa [ ] ) i s tr u e .

RESULT not attacker ( secretANb [ ] ) i s t r ue .

RESULT not attacker ( secretBNa [ ] ) i s f a l s e .

RESULT not attacker ( secretBNb [ ] ) i s f a l s e .

RESULT i n j −event ( endAparam ( x

56 9 ) ) ==> i nj −event ( beginAparam ( x 56 9 ) ) i s t r ue .

RESULT i n j −event ( endBparam ( x 999 ) ) ==> in j −event ( beginBparam ( x 9 99 ) ) i s f a l s e .

RESULT ( even event ( endBparam ( x 1486 ) ) ==> event ( beginBparam ( x 1486 ) ) i s f a l s e . )

As we would expect, this means that the authentication of B to A and secrecy for A hold; whereas

authentication of A to B and secrecy for B are violated. Notice how the use of four independent queries

for secrecy makes the task of evaluating the output easier. In addition, we learn

RESULT ( even event ( endBparam ( x 1486 ) ) ==> event ( beginBparam ( x 1486 ) ) i s f a l s e . )

which means that even the non-injective authentication of A to B is false; that is, Bob may end the

protocol thinking he talks to Alice while Alice has never run the protocol with Bob. For the query

attacker(secretBNa[]), ProVerif returns the following trace of an attack:

1 new skA c r e a t i n g skA 411 at {1}

2 out ( c , pk ( skA 411 ) ) at {3}

3 new skB c r e a t i n g skB 412 at {4}

4 out ( c , pk ( skB 412 ) ) at {6}

5 in ( c , pk ( a ) ) at {8} in copy a 408

6 event ( beginBparam ( pk ( a ) ) ) at {9} in copy a

408

7 new Na c r e a t i n g Na 410 at {10} in copy a 4 08

8 out ( c , aenc ( ( Na 410 , pk ( skA 411 ) ) , pk ( a ) ) ) at {11} in copy a 408

9 in ( c , aenc ( ( Na 410 , pk ( skA 411 ) ) , pk ( skB 412 ) ) ) at {20} in copy a 409

70 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

10 event ( beginAparam ( pk ( skA 411 ) ) ) at {22} in copy a 409

11 new Nb c r e a t i n g Nb 413 at {23} in copy a 4 09

12 out ( c , aenc ( ( Na 410 , Nb 413 ) , pk ( skA 411 ) ) ) at {24} in copy a 409

13 in ( c , aenc ( ( Na 410 , Nb 413 ) , pk ( skA 411 ) ) ) at {12} in copy a 408

14 out ( c , aenc ( Nb 413 , pk ( a ) ) ) at {14} in copy a 408

15 in ( c , aenc ( Nb 413 , pk ( skB 412 ) ) ) at {25} in copy a 4 09

16 event ( endBparam ( pk ( skB 412 ) ) ) at {28} in copy a 409

17 out ( c , s enc ( secretBNa , Na 410 ) ) at {29} in copy a 4 09

18 out ( c , s enc ( secretBNb , Nb 413 ) ) at {30} in copy a 4 09

19 The attacker has th e message secretBNa .

This trace corresponds to Lowe’s attack. The ﬁrst two new and outputs correspond to the creation of

the secret keys and outputs of the public keys of A and B in the main process. Next, processA starts,

inputting the public key pk(a) of its interlocutor: a has been generated by the attacker, so this inter-

locutor is dishonest. A then sends the ﬁrst message of the protocol aenc((Na 410,pk(skA 411)),pk(a))

(Line 8 of the trace). This message is received by B after having been decrypted and reencrypted

under pkB by the attacker. It looks like a message for a session between A and B, B replies with

aenc((Na 410,Nb 413),pk(skA 411)) which is then received by A. A replies with aenc(Nb 413,pk(a)).

This message is again received by B after having been decrypted and reencrypted under pkB by the

attacker. B has then apparently concluded a session with A, so it sends senc(secretBNa,Na 410). The

attacker has obtained Na 410 by decrypting the message aenc((Na 410,pk(skA 411)),pk(a)) (sent at

Line 8 of the trace), so it can compute secretBNa, thus breaking secrecy. The traces found for the other

queries are similar.

5.2 Full Needham-Schroeder protocol

In this section, we will present a model of the full protocol and will demonstrate the use of some ProVerif

features. (A more generic model is presented in Section 5.3.) In this formalization, we preserve the

types of the Needham-Schroeder protocol more closely. In particular, we model the type nonce (rather

than bitstring) and we introduce the type host for participants identities. Accordingly, we make use

of type conversion where necessary. Since the modeling process should now be familiar, we present the

complete encoding, which can be found in the ﬁle docs/NeedhamSchroederPK-var2.pv, and then discuss

particular aspects.

1 free c : c hann el .

3 ( P u bl i c key e n c ry p ti o n )

4 type pkey .

5 type skey .

7 fun pk ( ske y ) : pkey .

8 fun aenc ( b i t s t r i n g , pkey ) : b i t s t r i n g .

9 reduc f o r a l l x : b i t s t r i n g , y : skey ; adec ( aenc (x , pk ( y ) ) , y ) = x .

11 ( S i g na t u r e s )

12 type spkey .

13 type ss ke y .

15 fun spk ( s sk ey ) : spkey .

16 fun s i g n ( b i t s t r i n g , s sk ey ) : b i t s t r i n g .

17 reduc f o r a l l x : b i t s t r i n g , y : s sk ey ; getmess ( si g n ( x , y ) ) = x .

18 reduc f o r a l l x : b i t s t r i n g , y : s sk ey ; c h e ck s ig n ( s i g n ( x , y ) , spk ( y ) ) = x .

20 ( Shared key en cr y p ti o n )

21 type nonce .

5.2. FULL NEEDHAM-SCHROEDER PROTOCOL 71

23 fun s enc ( b i t s t r i n g , nonce ) : b i t s t r i n g .

24 reduc f o r a l l x : b i t s t r i n g , y : nonce ; sd ec ( se nc ( x , y ) , y ) = x .

26 ( Type c o n v e r te r )

27 fun n o n c e t o b i t s t r i n g ( nonce ) : b i t s t r i n g [ data , typeConverter ] .

29 ( Two h on est h o st names A and B )

30 type h os t .

31 free A, B: ho st .

33 ( Key t a b l e )

34 table key s ( host , pkey ) .

36 ( A u th e n t ic a t i on q u e r i e s )

37 event beginBparam ( ho st ) .

38 event endBparam ( ho st ) .

39 event beginAparam ( h ost ) .

40 event endAparam( ho st ) .

42 query x : ho st ; inj−event ( endBparam ( x ) ) ==> inj −event ( beginBparam ( x ) ) .

43 query x : ho st ; inj−event ( endAparam ( x ) ) ==> inj −event ( beginAparam ( x ) ) .

45 ( S ecr ecy q u e r i e s )

46 free secretANa , secretANb , secretBNa , secretBNb : b i t s t r i n g [ private ] .

48 query attacker ( secretANa ) ;

49 attacker ( secretANb ) ;

50 attacker ( secretBNa ) ;

51 attacker ( secretBNb ) .

53 ( A l ic e )

54 le t p roc ess A ( pkS : spkey , skA : skey ) =

55 in ( c , hostX : h ost ) ;

56 event beginBparam ( hostX ) ;

57 out ( c , (A, hostX ) ) ; ( msg 1 )

58 in ( c , ms : b i t s t r i n g ) ; ( msg 2 )

59 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

60 new Na : nonce ;

61 out ( c , aenc ( ( Na, A) , pkX ) ) ; ( msg 3 )

62 in ( c , m: b i t s t r i n g ) ; ( msg 6 )

63 l e t (=Na , NX: nonce ) = adec (m, skA) in

64 out ( c , aenc ( n o n c e t o b i t s t r i n g (NX) , pkX ) ) ; ( msg 7 )

65 i f hostX = B then

66 event endAparam (A) ;

67 out ( c , sen c ( secretANa , Na ) ) ;

68 out ( c , sen c ( secretANb , NX) ) .

70 ( Bob )

71 le t pro cessB ( pkS : spkey , skB : skey ) =

72 in ( c , m: b i t s t r i n g ) ; ( msg 3 )

73 l e t (NY: nonce , hostY : h ost ) = adec (m, skB ) in

74 event beginAparam ( hostY ) ;

75 out ( c , (B, hostY ) ) ; ( msg 4 )

76 in ( c , ms : b i t s t r i n g ) ; ( msg 5 )

77 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

72 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

78 new Nb: nonce ;

79 out ( c , aenc ( (NY, Nb) , pkY ) ) ; ( msg 6 )

80 in ( c , m3: b i t s t r i n g ) ; ( msg 7 )

81 i f n o n c e t o b i t s t r i n g (Nb) = adec (m3, skB ) then

82 i f hostY = A then

83 event endBparam (B ) ;

84 out ( c , sen c ( secretBNa , NY) ) ;

85 out ( c , sen c ( secretBNb , Nb ) ) .

87 ( Trusted key s e r v e r )

88 le t pr o ce s sS ( skS : ss ke y ) =

89 in ( c , ( a : host , b : ho st ) ) ;

90 get keys(=b , sb ) in

91 out ( c , s i g n ( ( sb , b ) , skS ) ) .

93 ( Key r e g i s t r a t i o n )

94 le t processK =

95 in ( c , ( h : host , k : pkey ) ) ;

96 i f h <> A && h <> B then inse rt keys (h , k ) .

98 ( Main )

99 process

100 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

101 new skB : s key ; l e t pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

102 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

103 ( ( ! processA ( pkS , skA ) ) | ( ! p roces sB ( pkS , skB ) ) |

104 ( ! p r oc e ss S ( skS ) ) | ( ! processK ) )

This process uses a key table in order to relate host names and their public keys. The key table is

declared by table keys(host, pkey). Keys are inserted in the key table in the main process (for the

honest hosts A and B, by insert keys(A, pkA) and insert keys(B, pkB)) and in a key registration

process processK for dishonest hosts. The key server processS looks up the key corresponding to host

b by get keys(=b, sb) in order to build the corresponding certiﬁcate. Since Alice is willing to run the

protocol with any other participant and she will request her interlocutor’s public key from the key server,

we must permit the attacker to register keys with the trusted key server (that is, insert keys into the key

table). This behavior is captured by the key registration process processK. Observe that the conditional

if h <> A && h <> B then prevents the attacker from changing the keys belonging to Alice and Bob.

(Recall that when several records are matched by a get query, then one possibility is chosen, but ProVerif

considers all possibilities when reasoning; without the conditional, the attacker can therefore eﬀectively

change the keys belonging to Alice and Bob.)

Evaluating security properties of the Needham-Schroeder protocol. Once again ProVerif is

able to conclude that authentication of B to A and secrecy for A hold, whereas authentication of A to

B and secrecy for B are violated. We omit analyzing the output produced by ProVerif and leave this as

an exercise for the reader.

5.3 Generalized Needham-Schroeder protocol

In the previous section, we considered an undesirable restriction on the participants; namely that the

initiator was played by Alice using the public key pk(skA) and the responder played by Bob using the

public key pk(skB). In this section, we generalize our encoding. Additionally, we also model authentica-

tion as full agreement, that is, agreement on all protocol parameters. The reader will also notice that we

use encrypt and decrypt instead of aenc and adec, and sencrypt and sdecrypt instead of senc and sdec.

The following script can be found in the ﬁle docs/NeedhamSchroederPK-var3.pv.

5.3. GENERALIZED NEEDHAM-SCHROEDER PROTOCOL 73

1 ( Loops i f t y pe s are i g no re d )

2 set ign oreTy pes = f a l s e .

4 free c : c hann el .

6 type h os t .

7 type nonce .

8 type pkey .

9 type skey .

10 type spkey .

11 type ss ke y .

13 fun n o n c e t o b i t s t r i n g ( nonce ) : b i t s t r i n g [ data , typeConverter ] .

15 ( P u bl i c key e n c ry p ti o n )

16 fun pk ( ske y ) : pkey .

17 fun e ncr ypt ( b i t s t r i n g , pkey ) : b i t s t r i n g .

18 reduc f o r a l l x : b i t s t r i n g , y : skey ; de cry pt ( e ncr ypt ( x , pk ( y ) ) , y ) = x .

20 ( S i g na t u r e s )

21 fun spk ( s sk ey ) : spkey .

22 fun s i g n ( b i t s t r i n g , s sk ey ) : b i t s t r i n g .

23 reduc f o r a l l m: b i t s t r i n g , k : ss ke y ; getmess ( si g n (m, k ) ) = m.

24 reduc f o r a l l m: b i t s t r i n g , k : ss ke y ; c he c k si g n ( si g n (m, k ) , spk ( k ) ) = m.

26 ( Shared key en cr y p ti o n )

27 fun s en cr y pt ( b i t s t r i n g , nonce ) : b i t s t r i n g .

28 reduc f o r a l l x : b i t s t r i n g , y : nonce ; s d ec ry p t ( s en cr y pt ( x , y ) , y ) = x .

30 ( S ecr ecy ass umpt io ns )

31 not attacker (new skA ) .

32 not attacker (new skB ) .

33 not attacker (new skS ) .

35 ( 2 ho ne st h os t names A and B )

36 free A, B: ho st .

38 table key s ( host , pkey ) .

40 ( Qu er ie s )

41 free secretANa , secretANb , secretBNa , secretBNb : b i t s t r i n g [ private ] .

42 query attacker ( secretANa ) ;

43 attacker ( secretANb ) ;

44 attacker ( secretBNa ) ;

45 attacker ( secretBNb ) .

47 event beginBparam ( host , hos t ) .

48 event endBparam ( host , ho st ) .

49 event beginAparam ( host , ho st ) .

50 event endAparam( host , ho st ) .

51 event b e g i n B f u l l ( host , host , pkey , pkey , nonce , nonce ) .

52 event e n dB f u ll ( host , host , pkey , pkey , nonce , nonce ) .

53 event b e g i n A f u l l ( host , host , pkey , pkey , nonce , nonce ) .

54 event e nd A fu ll ( host , host , pkey , pkey , nonce , nonce ) .

74 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

56 query x : host , y : h ost ;

57 inj−event ( endBparam ( x , y ) ) ==> inj−event ( beginBparam ( x , y ) ) .

59 query x1 : host , x2 : host , x3 : pkey , x4 : pkey , x5 : nonce , x6 : nonce ;

60 inj−event ( e n dB f ul l ( x1 , x2 , x3 , x4 , x5 , x6 ) )

61 ==> inj−event ( be g i n B f u l l ( x1 , x2 , x3 , x4 , x5 , x6 ) ) .

63 query x : host , y : h ost ;

64 inj−event ( endAparam ( x , y ) ) ==> inj−event ( beginAparam ( x , y ) ) .

66 query x1 : host , x2 : host , x3 : pkey , x4 : pkey , x5 : nonce , x6 : nonce ;

67 inj−event ( e nd Af ul l ( x1 , x2 , x3 , x4 , x5 , x6 ) )

68 ==> inj−event ( b e g i n A f u l l ( x1 , x2 , x3 , x4 , x5 , x6 ) ) .

70 ( Role of t he i n i t i a t o r w it h i d e n t i t y xA and s e c r e t key skxA )

71 le t p r o c e s s I n i t i a t o r ( pkS : spkey , skA : skey , skB : skey ) =

72 ( The a t t a c k e r s t a r t s th e i n i t i a t o r by c ho os in g i d e n t i t y xA ,

73 and i t s i n t e r l o c u t o r xB0 .

74 We che ck t h a t xA i s ho ne st ( i . e . i s A or B)

75 and g e t i t s co rr e sp on di n g key . )

76 in ( c , (xA : host , hostX : ho st ) ) ;

77 i f xA = A | | xA = B then

78 l e t skxA = i f xA = A then skA el se skB in

79 l e t pkxA = pk ( skxA ) in

80 ( Real s t a r t o f t he r o l e )

81 event beginBparam (xA , hostX ) ;

82 ( Message 1 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

83 out ( c , (xA, hostX ) ) ;

84 ( Message 2 )

85 in ( c , ms : b i t s t r i n g ) ;

86 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

87 ( Message 3 )

88 new Na : nonce ;

89 out ( c , enc ryp t ( ( Na , xA) , pkX ) ) ;

90 ( Message 6 )

91 in ( c , m: b i t s t r i n g ) ;

92 l e t (=Na , NX2: nonce ) = de cry pt (m, skxA ) in

93 event b e g i n B f u l l (xA, hostX , pkX , pkxA , Na, NX2 ) ;

94 ( Message 7 )

95 out ( c , enc ryp t ( n o n c e

t o b i t s t r i n g (NX2) , pkX ) ) ;

96 ( OK )

97 i f hostX = B | | hostX = A then

98 event endAparam (xA, hostX ) ;

99 event e nd Af ul l (xA, hostX , pkX , pkxA , Na, NX2 ) ;

100 out ( c , s en cr y pt ( secretANa , Na ) ) ;

101 out ( c , s en cr y pt ( secretANb , NX2 ) ) .

102

103 ( Role of t he re spo nde r wi th i d e n t i t y xB and s e c r e t key skxB )

104 le t pr oce ssR esp ond er ( pkS : spkey , skA : skey , skB : skey ) =

105 ( The a t t a c k e r s t a r t s th e resp on der by c ho osi ng i d e n t i t y xB .

106 We che ck t h a t xB i s h on es t ( i . e . i s A or B) . )

107 in ( c , xB : h ost ) ;

108 i f xB = A | | xB = B then

109 l e t skxB = i f xB = A then skA el se skB in

110 l e t pkxB = pk ( skxB ) in

5.3. GENERALIZED NEEDHAM-SCHROEDER PROTOCOL 75

111 ( Real s t a r t o f t he r o l e )

112 ( Message 3 )

113 in ( c , m: b i t s t r i n g ) ;

114 l e t (NY: nonce , hostY : h ost ) = d ecr ypt (m, skxB ) in

115 event beginAparam ( hostY , xB ) ;

116 ( Message 4 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

117 out ( c , (xB , hostY ) ) ;

118 ( Message 5 )

119 in ( c , ms : b i t s t r i n g ) ;

120 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

121 ( Message 6 )

122 new Nb: nonce ;

123 event b e g i n A f u l l ( hostY , xB , pkxB , pkY , NY, Nb ) ;

124 out ( c , enc ryp t ( (NY, Nb) , pkY ) ) ;

125 ( Message 7 )

126 in ( c , m3: b i t s t r i n g ) ;

127 i f n o n c e t o b i t s t r i n g (Nb) = de cry pt (m3, skB ) then

128 ( OK )

129 i f hostY = A | | hostY = B then

130 event endBparam ( hostY , xB ) ;

131 event e nd B fu l l ( hostY , xB , pkxB , pkY , NY, Nb ) ;

132 out ( c , s en cr y pt ( secretBNa , NY) ) ;

133 out ( c , s en cr y pt ( secretBNb , Nb ) ) .

134

135 ( Se rve r )

136 le t pr o ce s sS ( skS : ss ke y ) =

137 in ( c , ( a : host , b : ho st ) ) ;

138 get keys(=b , sb ) in

139 out ( c , s i g n ( ( sb , b ) , skS ) ) .

140

141 ( Key r e g i s t r a t i o n )

142 le t processK =

143 in ( c , ( h : host , k : pkey ) ) ;

144 i f h <> A && h <> B then inse rt keys (h , k ) .

145

146 ( Main )

147 process

148 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

149 new skB : s key ; l e t pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

150 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

151 (

152 ( Launch an unbounded number o f s e s s i o n s o f th e i n i t i a t o r )

153 ( ! p r o c e s s I n i t i a t o r ( pkS , skA , skB ) ) |

154 ( Launch an unbounded number o f s e s s i o n s o f th e re sp ond er )

155 ( ! pr oce ssR esp ond er ( pkS , skA , skB ) ) |

156 ( Launch an unbounded number o f s e s s i o n s o f th e s e r v e r )

157 ( ! p r oc e ss S ( skS ) ) |

158 ( Key r e g i s t r a t i o n pr o c es s )

159 ( ! processK )

160 )

The main novelty of this script is that it allows Alice and Bob to play both roles of the initiator and

responder. To achieve this, we could simply duplicate the code, but it is possible to have more elegant

encodings. Above, we consider processes processInitiator and processResponder that take as argument

both skA and skB (since they can be played by Alice and Bob). Looking for instance at the initiator

(Lines 71–79), the attacker ﬁrst starts the initiator by sending the identity xA of the principal playing

76 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

the role of the initiator and hostX of its interlocutor. Then, we verify that the initiator is honest, and

compute its secret key skxA (skA for A, skB for B) and its corresponding public key pkxA = pk(skxA).

We can then run the role as expected. We proceed similarly for the responder.

Other encodings are also possible. For instance, we could deﬁne a destructor choosekey by

fun choose key ( host , host , host , skey , ske y ) : skey

reduc f o r a l l x1 : host , x2 : host , sk1 : skey , sk2 : ske y ;

choose key ( x1 , x1 , x2 , sk1 , sk2 ) = sk1

otherwise f o r a l l x1 : host , x2 : host , sk1 : skey , sk2 : skey ;

choose key ( x2 , x1 , x2 , sk1 , sk2 ) = sk2 .

and let skxA be choosekey(xA, A, B, skA, skB) (if xA = A, it returns skA; if xA = B, it returns skB;

otherwise, it fails). The latter encoding is perhaps less intuitive, but it avoids internal code duplication

when ProVerif expands tests that appear in terms.

Three other points are worth noting:

We use secrecy assumptions (Lines 30–33) to speed up the resolution process of ProVerif. These

lines inform ProVerif that the attacker cannot have the secret keys skA, skB, skS. This information

is checked by ProVerif, so that erroneous proofs cannot be obtained even with secrecy assumptions.

(See also Section 6.7.2.) Lines 30–33 can be removed without changing the results, ProVerif will

just be slightly slower.

We set ignoreTypes to false (Lines 1–2). By default, ProVerif ignore all types during analysis.

However, for this script, it does not terminate with this default setting. By setting ignoreTypes =

false , the semantics of processes is changed to check the types. This setting makes it possible

to obtain termination. The known attack against this protocol is detected, but it might happen

that some type ﬂaw attacks are undetected, when they appear when the types are not checked in

processes. More details on the ignoreTypes setting can be found in Section 6.6.2.

There are other ways of obtaining termination in this example, in particular by using a diﬀerent

method for relating identities and keys with two function symbols, one that maps the key to the

identity, and one that maps the identity to the key. However, this method also has limitations: it

does not allow the attacker to create two principals with the same key. More information on this

method can be found in Section 6.7.3.

We use two diﬀerent levels of authentication: the events that end with “full” serve in proving

Lowe’s full agreement [Low97], that is, agreement on all parameters of the protocol (here, host

names, keys, and nonces). The events that end with “param” prove agreement on the host names

only.

As expected, ProVerif is able to prove the authentication of the responder and secrecy for the initiator;

whereas authentication of the initiator and secrecy for the responder fail. The reader is invited to modify

the protocol according to Lowe’s ﬁx and examine the results produced by ProVerif. (A script for the

corrected protocol can be found in examples/pitype/secr-auth/NeedhamSchroederPK-corr.pv. If

you installed by OPAM in the switch ⟨switch⟩, it is in

/.opam/⟨switch⟩/doc/proverif/examples/

pitype/secr-auth/NeedhamSchroederPK-corr.pv. Note that the ﬁxed protocol can be proved correct

by ProVerif even when types are ignored.)

5.4 Variants of these security properties

In this section, we consider several security properties of Lowe’s corrected version of the Needham-

Schroeder public key protocol.

5.4.1 A variant of mutual authentication

In the previous deﬁnitions of authentication that we have considered, we require that internal parameters

of the protocol (such as nonces) are the same for the initiator and for the responder. However, in the

computational model, one generally uses a session identiﬁer that is publicly computable (such as the

5.4. VARIANTS OF THESE SECURITY PROPERTIES 77

tuple of the messages of the protocol) as argument of events. One can also do that in ProVerif, as in the

following example (ﬁle docs/NeedhamSchroederPK-corr-mutual-auth.pv).

1 ( Qu er ie s )

2 fun messtermI ( host , ho st ) : b i t s t r i n g [ data ] .

3 fun messtermR ( host , h ost ) : b i t s t r i n g [ data ] .

5 event termI ( host , host , b i t s t r i n g ) .

6 event a c c e p t s I ( host , host , b i t s t r i n g ) .

7 event acceptsR ( host , host , b i t s t r i n g ) .

8 event termR ( host , host , b i t s t r i n g ) .

10 query x : host , m: b i t s t r i n g ;

11 inj−event ( termI ( x , B,m) ) ==> inj−event ( acceptsR ( x , B,m) ) .

12 query x : host , m: b i t s t r i n g ;

13 inj−event ( termR (A, x ,m) ) ==> inj−event ( a c c e p t s I (A, x ,m) ) .

15 ( Role of t he i n i t i a t o r w it h i d e n t i t y xA and s e c r e t key skxA )

16 le t p r o c e s s I n i t i a t o r ( pkS : spkey , skA : skey , skB : skey ) =

17 ( The a t t a c k e r s t a r t s th e i n i t i a t o r by c ho os in g i d e n t i t y xA ,

18 and i t s i n t e r l o c u t o r xB0 .

19 We che ck t h a t xA i s ho ne st ( i . e . i s A or B)

20 and g e t i t s co rr e sp on di n g key .

21 )

22 in ( c , (xA : host , hostX : ho st ) ) ;

23 i f xA = A | | xA = B then

24 l e t skxA = i f xA = A then skA el se skB in

25 l e t pkxA = pk ( skxA ) in

26 ( Real s t a r t o f t he r o l e )

27 ( Message 1 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

28 out ( c , (xA, hostX ) ) ;

29 ( Message 2 )

30 in ( c , ms : b i t s t r i n g ) ;

31 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

32 ( Message 3 )

33 new Na : nonce ;

34 l e t m3 = enc ryp t ( ( Na , xA) , pkX) in

35 out ( c , m3 ) ;

36 ( Message 6 )

37 in ( c , m: b i t s t r i n g ) ;

38 l e t (=Na , NX2: nonce , =hostX ) = de cry pt (m, skA ) in

39 l e t m7 = enc ryp t ( n o n c e t o b i t s t r i n g (NX2) , pkX) in

40 event termI (xA , hostX , (m3, m) ) ;

41 event a c c e p t s I (xA, hostX , (m3, m, m7) ) ;

42 ( Message 7 )

43 out ( c , (m7, messtermI (xA, hostX ) ) ) .

45 ( Role of t he re spo nde r wi th i d e n t i t y xB and s e c r e t key skxB )

46 le t pr oce ssR esp ond er ( pkS : spkey , skA : skey , skB : skey ) =

47 ( The a t t a c k e r s t a r t s th e resp on der by c ho osi ng i d e n t i t y xB .

48 We che ck t h a t xB i s h on es t ( i . e . i s A or B) . )

49 in ( c , xB : h ost ) ;

50 i f xB = A | | xB = B then

51 l e t skxB = i f xB = A then skA el se skB in

52 l e t pkxB = pk ( skxB ) in

53 ( Real s t a r t o f t he r o l e )

78 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

54 ( Message 3 )

55 in ( c , m: b i t s t r i n g ) ;

56 l e t (NY: nonce , hostY : h ost ) = d ecr ypt (m, skxB ) in

57 ( Message 4 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

58 out ( c , (xB , hostY ) ) ;

59 ( Message 5 )

60 in ( c , ms : b i t s t r i n g ) ;

61 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

62 ( Message 6 )

63 new Nb: nonce ;

64 l e t m6 = enc ryp t ( (NY, Nb , xB) , pkY) in

65 event acceptsR ( hostY , xB , (m, m6 ) ) ;

66 out ( c , m6 ) ;

67 ( Message 7 )

68 in ( c , m3: b i t s t r i n g ) ;

69 i f n o n c e t o b i t s t r i n g (Nb) = de cry pt (m3, skB ) then

70 event termR ( hostY , xB , (m, m6, m3 ) ) ;

71 out ( c , messtermR ( hostY , xB ) ) .

73 ( Se rve r )

74 le t pr o ce s sS ( skS : ss ke y ) =

75 in ( c , ( a : host , b : ho st ) ) ;

76 get keys(=b , sb ) in

77 out ( c , s i g n ( ( sb , b ) , skS ) ) .

79 ( Key r e g i s t r a t i o n )

80 le t processK =

81 in ( c , ( h : host , k : pkey ) ) ;

82 i f h <> A && h <> B then inse rt keys (h , k ) .

84 ( S t a r t p r oc e s s )

85 process

86 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

87 new skB : s key ; l et pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

88 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

89 (

90 ( Launch an unbounded number o f s e s s i o n s o f th e i n i t i a t o r )

91 ( ! p r o c e s s I n i t i a t o r ( pkS , skA , skB ) ) |

92 ( Launch an unbounded number o f s e s s i o n s o f th e re sp ond er )

93 ( ! pr oce ssR esp ond er ( pkS , skA , skB ) ) |

94 ( Launch an unbounded number o f s e s s i o n s o f th e s e r v e r )

95 ( ! p r oc e ss S ( skS ) ) |

96 ( Key r e g i s t r a t i o n pr o c es s )

97 ( ! processK )

98 )

The query

10 query x : host , m: b i t s t r i n g ;

11 inj−event ( termI ( x , B,m) ) ==> inj−event ( acceptsR ( x , B,m) ) .

corresponds to the authentication of the responder B to the initiator x: when the initiator x terminates a

session apparently with B (event termI(x,B,m), executed at Line 40, when the initiator terminates, after

receiving its last message, message 6), the responder B has accepted with x (event acceptsR(x,B,m),

executed at Line 65, when the responder accepts, just before sending message 6). We use a ﬁxed value B

for the name of the responder, and not a variable, because if a variable were used, the initiator might run

a session with a dishonest participant included in the attacker, and in this case, it is perfectly ok that

5.4. VARIANTS OF THESE SECURITY PROPERTIES 79

the event acceptsR is not executed. Since the initiator is executed with identities A and B, x is either A

or B, so the query above proves correct authentication of the responder B to the initiator x when x is A

and when it is B. The same property for the responder A holds by symmetry, swapping A and B.

Similarly, the query

12 query x : host , m: b i t s t r i n g ;

13 inj−event ( termR (A, x ,m) ) ==> inj−event ( a c c e p t s I (A, x ,m) ) .

corresponds to the authentication of the initiator A to the responder x: when the responder x terminates

a session apparently with A (event termR(A,x,m), executed at Line 70, when the responder terminates,

after receiving its last message, message 7), the initiator A has accepted with x (event acceptsI(A,x,m),

executed at Line 41, when the initiator accepts, just before sending message 7).

The position of events follows Figure 3.4. The events termR and acceptsI take as arguments the host

names of the initiator and the responder, and the tuples of messages exchanged between the initiator

and the responder. (Messages sent to or received from the server to obtain the certiﬁcates are ignored.)

Because the last message is from the initiator to the responder, that message is not known to the

responder when it accepts, so that message is omitted from the arguments of the events acceptsR and

termI.

5.4.2 Authenticated key exchange

In the computational model, the security of an authenticated key exchange protocol is typically shown

by proving, in addition to mutual authentication, that the exchanged key is indistinguishable from a

random key. More precisely, in the real-or-random model [AFP06], one allows the attacker to perform

several test queries, which either return the real key or a fresh random key, and these two cases must

be indistinguishable. When the test query is performed on a session between a honest and a dishonest

participant, the returned key is always the real one. When the test query is performed several times on

the same session, or on the partner session (that is, the session of the interlocutor that has the same

session identiﬁer), it returns the same key (whether real or random). Taking into account partnering in

the deﬁnition of test queries is rather tricky, so we have developed an alternative characterization that

does not require partnering [Bla07].

We use events similar to those for mutual authentication, except that termR and acceptsI take the

exchanged key as an additional argument. We prove the following properties:

query x : host , m: b i t s t r i n g ;

inj −event ( termI ( x , B,m) ) ==> inj−event ( acceptsR ( x , B,m) ) .

query x : host , k : nonce , m: b i t s t r i n g ;

inj −event ( termR(A, x , k ,m) ) ==> inj −event ( a c c e p t s I (A, x , k ,m) ) .

query x : host , k : nonce , k : nonce , m: b i t s t r i n g ;

event ( termR (A, x , k ,m) ) && event ( ac c e p t s I (A, x , k ,m) ) ==> k = k .

When the initiator or the responder execute a session with a dishonest participant, they output

the exchanged key. (This key is also output by the test queries in this case.) We show the secrecy

of the keys established by the initiator when it runs sessions with a honest responder, in the sense

that these keys are indistinguishable from independent random numbers.

The ﬁrst two correspondences imply mutual authentication. The real-or-random indistinguishability of

the key is obtained by combining the last two correspondences with the secrecy of the initiator’s key.

Intuitively, the correspondences allow us to show that each responder’s key in a session with a honest

initiator is in fact also an initiator’s key (which we can ﬁnd by looking for the same session identiﬁer), so

showing that the initiator’s key cannot be distinguished from independent random numbers is suﬃcient

to show the secrecy of the key.

Outputting the exchanged key in a session with a dishonest interlocutor allows to detect Unknown

Key Share (UKS) attacks [DvOW92], in which an initiator A believes he shares a key with a responder

B, but B believes he shares that key with a dishonest C. This key is then output to the attacker, so the

secrecy of the initiator’s key is broken. However, bilateral UKS attacks [CT08], in which A shares a key

80 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

with a dishonest C and B shares the same key with a dishonest D, may remain undetected under this

deﬁnition of key exchange. These attacks can be detected by testing the following correspondence:

query x : host , y : host , x : host , y : host , k : nonce , k : nonce ,

m: b i t s t r i n g , m : b i t s t r i n g ;

event ( termR ( x , y , k ,m) ) && event ( a c c e p t s I (x , y , k ,m ) ) ==> x = x && y = y .

to verify that, if two sessions terminate with the same key, then they are between the same hosts (and

we could additionally verify m = m to make sure that these sessions have the same session identiﬁers).

The following script aims at verifying this notion of authenticated key exchange, assuming that the

exchanged key is Na (ﬁle docs/NeedhamSchroederPK-corr-ake.pv).

1 ( Qu er ie s )

2 free se cr et A : b i t s t r i n g [ private ] .

3 query attacker ( se cr et A ) .

5 fun messtermI ( host , ho st ) : b i t s t r i n g [ data ] .

6 fun messtermR ( host , h ost ) : b i t s t r i n g [ data ] .

8 event termI ( host , host , b i t s t r i n g ) .

9 event a c c e p t s I ( host , host , nonce , b i t s t r i n g ) .

10 event acceptsR ( host , host , b i t s t r i n g ) .

11 event termR ( host , host , nonce , b i t s t r i n g ) .

13 query x : host , m: b i t s t r i n g ;

14 inj−event ( termI ( x , B,m) ) ==> inj−event ( acceptsR ( x , B,m) ) .

15 query x : host , k : nonce , m: b i t s t r i n g ;

16 inj−event ( termR (A, x , k ,m) ) ==> inj−event ( ac c e p t s I (A, x , k ,m) ) .

18 query x : host , k : nonce , k : nonce , m: b i t s t r i n g ;

19 event ( termR (A, x , k ,m) ) && event ( ac c e p t s I (A, x , k ,m) ) ==> k = k .

21 ( Query f o r d e t e c t i n g b i l a t e r a l UKS a t t a c k s )

22 query x : host , y : host , x : host , y : host , k : nonce , k : nonce ,

23 m: b i t s t r i n g , m : b i t s t r i n g ;

24 event ( termR ( x , y , k ,m) ) && event ( ac c e p t s I (x , y , k ,m ) ) ==> x = x && y = y .

26 ( Role of t he i n i t i a t o r w it h i d e n t i t y xA and s e c r e t key skxA )

27 le t p r o c e s s I n i t i a t o r ( pkS : spkey , skA : skey , skB : skey ) =

28 ( The a t t a c k e r s t a r t s th e i n i t i a t o r by c ho os in g i d e n t i t y xA ,

29 and i t s i n t e r l o c u t o r xB0 .

30 We che ck t h a t xA i s ho ne st ( i . e . i s A or B)

31 and g e t i t s co rr e sp on di n g key .

32 )

33 in ( c , (xA : host , hostX : ho st ) ) ;

34 i f xA = A | | xA = B then

35 l e t skxA = i f xA = A then skA el se skB in

36 l e t pkxA = pk ( skxA ) in

37 ( Real s t a r t o f t he r o l e )

38 ( Message 1 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

39 out ( c , (xA, hostX ) ) ;

40 ( Message 2 )

41 in ( c , ms : b i t s t r i n g ) ;

42 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

43 ( Message 3 )

44 new Na : nonce ;

45 l e t m3 = enc ryp t ( ( Na , xA) , pkX) in

5.4. VARIANTS OF THESE SECURITY PROPERTIES 81

46 out ( c , m3 ) ;

47 ( Message 6 )

48 in ( c , m: b i t s t r i n g ) ;

49 l e t (=Na , NX2: nonce , =hostX ) = de cry pt (m, skA ) in

50 l e t m7 = enc ryp t ( n o n c e t o b i t s t r i n g (NX2) , pkX) in

51 event termI (xA , hostX , (m3, m) ) ;

52 event a c c e p t s I (xA, hostX , Na , (m3, m, m7 ) ) ;

53 ( Message 7 )

54 i f hostX = A | | hostX = B then

55 (

56 out ( c , s en c ry p t ( se cr et A , Na ) ) ;

57 out ( c , (m7, messtermI (xA, hostX ) ) )

58 )

59 el se

60 (

61 out ( c , Na ) ;

62 out ( c , (m7, messtermI (xA, hostX ) ) )

63 ) .

65 ( Role of t he re spo nde r wi th i d e n t i t y xB and s e c r e t key skxB )

66 le t pr oce ssR esp ond er ( pkS : spkey , skA : skey , skB : skey ) =

67 ( The a t t a c k e r s t a r t s th e resp on der by c ho osi ng i d e n t i t y xB .

68 We che ck t h a t xB i s h on es t ( i . e . i s A or B) . )

69 in ( c , xB : h ost ) ;

70 i f xB = A | | xB = B then

71 l e t skxB = i f xB = A then skA el se skB in

72 l e t pkxB = pk ( skxB ) in

73 ( Real s t a r t o f t he r o l e )

74 ( Message 3 )

75 in ( c , m: b i t s t r i n g ) ;

76 l e t (NY: nonce , hostY : h ost ) = d ecr ypt (m, skxB ) in

77 ( Message 4 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

78 out ( c , (xB , hostY ) ) ;

79 ( Message 5 )

80 in ( c , ms : b i t s t r i n g ) ;

81 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

82 ( Message 6 )

83 new Nb: nonce ;

84 l e t m6 = enc ryp t ( (NY, Nb , xB) , pkY) in

85 event acceptsR ( hostY , xB , (m, m6 ) ) ;

86 out ( c , m6 ) ;

87 ( Message 7 )

88 in ( c , m3: b i t s t r i n g ) ;

89 i f n o n c e t o b i t s t r i n g (Nb) = de cry pt (m3, skB ) then

90 event termR ( hostY , xB , NY, (m, m6, m3 ) ) ;

91 i f hostY = A | | hostY = B then

92 out ( c , messtermR ( hostY , xB) )

93 el se

94 (

95 out ( c , NY) ;

96 out ( c , messtermR ( hostY , xB) )

97 ) .

99 ( Se rve r )

100 le t pr o ce s sS ( skS : ss ke y ) =

82 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

101 in ( c , ( a : host , b : ho st ) ) ;

102 get keys(=b , sb ) in

103 out ( c , s i g n ( ( sb , b ) , skS ) ) .

104

105 ( Key r e g i s t r a t i o n )

106 le t processK =

107 in ( c , ( h : host , k : pkey ) ) ;

108 i f h <> A && h <> B then inse rt keys (h , k ) .

109

110 ( S t a r t p r oc e s s )

111 process

112 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

113 new skB : s key ; l e t pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

114 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

115 (

116 ( Launch an unbounded number o f s e s s i o n s o f th e i n i t i a t o r )

117 ( ! p r o c e s s I n i t i a t o r ( pkS , skA , skB ) ) |

118 ( Launch an unbounded number o f s e s s i o n s o f th e re sp ond er )

119 ( ! pr oce ssR esp ond er ( pkS , skA , skB ) ) |

120 ( Launch an unbounded number o f s e s s i o n s o f th e s e r v e r )

121 ( ! p r oc e ss S ( skS ) ) |

122 ( Key r e g i s t r a t i o n pr o c es s )

123 ( ! processK )

124 )

ProVerif ﬁnds a bilateral UKS attack: if C as responder runs a session with A, it gets Na, then D as

initiator can use the same nonce Na in a session with responder B, thus obtaining two sessions, between

A and C and between D and B, that share the same key Na. (Such an attack appears more generally

when the key is determined by a single participant of the protocol.) The other properties are proved by

ProVerif.

The above script veriﬁes syntactic secrecy of the initiator’s key Na. To be even closer to the compu-

tational deﬁnition, we can verify its secrecy using the real-or-random secrecy notion (page 60), as in the

following script (ﬁle docs/NeedhamSchroederPK-corr-ake-RoR.pv):

1 ( Termination messages )

2 fun messtermI ( host , ho st ) : b i t s t r i n g [ data ] .

3 fun messtermR ( host , h ost ) : b i t s t r i n g [ data ] .

5 set ign oreTy pes = f a l s e .

7 ( Role of t he i n i t i a t o r w it h i d e n t i t y xA and s e c r e t key skxA )

8 le t p r o c e s s I n i t i a t o r ( pkS : spkey , skA : skey , skB : skey ) =

9 ( The a t t a c k e r s t a r t s th e i n i t i a t o r by c ho os in g i d e n t i t y xA ,

10 and i t s i n t e r l o c u t o r xB0 .

11 We che ck t h a t xA i s ho ne st ( i . e . i s A or B)

12 and g e t i t s co rr e sp on di n g key .

13 )

14 in ( c , (xA : host , hostX : ho st ) ) ;

15 i f xA = A | | xA = B then

16 l e t skxA = i f xA = A then skA el se skB in

17 l e t pkxA = pk ( skxA ) in

18 ( Real s t a r t o f t he r o l e )

19 ( Message 1 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

20 out ( c , (xA, hostX ) ) ;

21 ( Message 2 )

22 in ( c , ms : b i t s t r i n g ) ;

5.4. VARIANTS OF THESE SECURITY PROPERTIES 83

23 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

24 ( Message 3 )

25 new Na : nonce ;

26 l e t m3 = enc ryp t ( ( Na , xA) , pkX) in

27 out ( c , m3 ) ;

28 ( Message 6 )

29 in ( c , m: b i t s t r i n g ) ;

30 l e t (=Na , NX2: nonce , =hostX ) = de cry pt (m, skA ) in

31 l e t m7 = enc ryp t ( n o n c e t o b i t s t r i n g (NX2) , pkX) in

32 ( Message 7 )

33 i f hostX = A | | hostX = B then

34 (

35 new random : nonce ;

36 out ( c , choice [ Na , random ] ) ;

37 out ( c , (m7, messtermI (xA, hostX ) ) )

38 )

39 el se

40 (

41 out ( c , Na ) ;

42 out ( c , (m7, messtermI (xA, hostX ) ) )

43 ) .

45 ( Role of t he re spo nde r wi th i d e n t i t y xB and s e c r e t key skxB )

46 le t pr oce ssR esp ond er ( pkS : spkey , skA : skey , skB : skey ) =

47 ( The a t t a c k e r s t a r t s th e resp on der by c ho osi ng i d e n t i t y xB .

48 We che ck t h a t xB i s h on es t ( i . e . i s A or B) . )

49 in ( c , xB : h ost ) ;

50 i f xB = A | | xB = B then

51 l e t skxB = i f xB = A then skA el se skB in

52 l e t pkxB = pk ( skxB ) in

53 ( Real s t a r t o f t he r o l e )

54 ( Message 3 )

55 in ( c , m: b i t s t r i n g ) ;

56 l e t (NY: nonce , hostY : h ost ) = d ecr ypt (m, skxB ) in

57 ( Message 4 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

58 out ( c , (xB , hostY ) ) ;

59 ( Message 5 )

60 in ( c , ms : b i t s t r i n g ) ;

61 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

62 ( Message 6 )

63 new Nb: nonce ;

64 l e t m6 = enc ryp t ( (NY, Nb , xB) , pkY) in

65 out ( c , m6 ) ;

66 ( Message 7 )

67 in ( c , m3: b i t s t r i n g ) ;

68 i f n o n c e

t o b i t s t r i n g (Nb) = de cry pt (m3, skB ) then

69 i f hostY = A | | hostY = B then

70 out ( c , messtermR ( hostY , xB) )

71 el se

72 (

73 out ( c , NY) ;

74 out ( c , messtermR ( hostY , xB) )

75 ) .

77 ( Se rve r )

84 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

78 le t pr o ce s sS ( skS : ss ke y ) =

79 in ( c , ( a : host , b : ho st ) ) ;

80 get keys(=b , sb ) in

81 out ( c , s i g n ( ( sb , b ) , skS ) ) .

83 ( Key r e g i s t r a t i o n )

84 le t processK =

85 in ( c , ( h : host , k : pkey ) ) ;

86 i f h <> A && h <> B then inse rt keys (h , k ) .

88 ( S t a r t p r oc e s s )

89 process

90 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

91 new skB : s key ; l et pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

92 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

93 (

94 ( Launch an unbounded number o f s e s s i o n s o f th e i n i t i a t o r )

95 ( ! p r o c e s s I n i t i a t o r ( pkS , skA , skB ) ) |

96 ( Launch an unbounded number o f s e s s i o n s o f th e re sp ond er )

97 ( ! pr oce ssR esp ond er ( pkS , skA , skB ) ) |

98 ( Launch an unbounded number o f s e s s i o n s o f th e s e r v e r )

99 ( ! p r oc e ss S ( skS ) ) |

100 ( Key r e g i s t r a t i o n pr o c es s )

101 ( ! processK )

102 )

Line 36 outputs either the real key Na or a fresh random key, and the goal is to prove that the attacker

cannot distinguish these two situations. In order to obtain termination, we require that all code including

the attacker be well-typed (Line 5). This prevents in particular the generation of certiﬁcates in which

the host names are themselves nested signatures of unbounded depth. Unfortunately, ProVerif ﬁnds

a false attack in which the output key is used to build message 3 (either encrypt((Na, A), pkB) or

encrypt((random, A), pkB)), send it to the responder, which replies with message 6 (that is, encrypt((Na,

Nb, A), pkA) or encrypt((random, Nb, A), pkA)), which is accepted by the initiator if and only if the

key is the real key Na.

A similar veriﬁcation can be done with other possible keys (for instance, Nb, h(Na), h(Nb), h(Na,Nb)

where h is a hash function). We leave these veriﬁcations to the reader and just note that the false attack

above disappears for the key h(Na) (but we still have to restrict ourselves to a well-typed attacker).

In order to obtain this result, a trick is necessary: if random is generated at the end of the protocol,

ProVerif represents it internally as a function of the previously received messages, including message 6.

This leads to a false attack in which two diﬀerent values of random (generated after receiving diﬀerent

messages 6) are associated to the same Na. This false attack can be eliminated by moving the generation

of random just after the generation of Na.

5.4.3 Full ordering of the messages

We can also show that, if a responder terminates the protocol with a honest initiator, then all mes-

sages of the protocol between the initiator and the responder have been exchanged in the right order.

(We ignore messages sent to or received from the server.) This is shown in the following script (ﬁle

docs/NeedhamSchroederPK-corr-all-messages.pv).

1 ( Qu er ie s )

2 event endB( host , host , pkey , pkey , nonce , nonce ) .

3 event e3 ( host , host , pkey , pkey , nonce , nonce ) .

4 event e2 ( host , host , pkey , pkey , nonce , nonce ) .

5 event e1 ( host , host , pkey , pkey , nonce ) .

5.4. VARIANTS OF THESE SECURITY PROPERTIES 85

7 query y : host , pkx : pkey , pky : pkey , nx : nonce , ny : nonce ;

8 inj−event ( endB (A, y , pkx , pky , nx , ny ) ) ==>

9 ( inj−event ( e3 (A, y , pkx , pky , nx , ny ) ) ==>

10 ( inj−event ( e2 (A, y , pkx , pky , nx , ny ) ) ==>

11 inj −event ( e1 (A, y , pkx , pky , nx ) ) ) ) .

13 ( Role of t he i n i t i a t o r w it h i d e n t i t y xA and s e c r e t key skxA )

14 le t p r o c e s s I n i t i a t o r ( pkS : spkey , skA : skey , skB : skey ) =

15 ( The a t t a c k e r s t a r t s th e i n i t i a t o r by c ho os in g i d e n t i t y xA ,

16 and i t s i n t e r l o c u t o r xB0 .

17 We che ck t h a t xA i s ho ne st ( i . e . i s A or B)

18 and g e t i t s co rr e sp on di n g key .

19 )

20 in ( c , (xA : host , hostX : ho st ) ) ;

21 i f xA = A | | xA = B then

22 l e t skxA = i f xA = A then skA el se skB in

23 l e t pkxA = pk ( skxA ) in

24 ( Real s t a r t o f t he r o l e )

25 ( Message 1 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

26 out ( c , (xA, hostX ) ) ;

27 ( Message 2 )

28 in ( c , ms : b i t s t r i n g ) ;

29 l e t (pkX : pkey , =hostX ) = ch e c ks i gn (ms , pkS ) in

30 ( Message 3 )

31 new Na : nonce ;

32 event e1 (xA, hostX , pkxA , pkX , Na ) ;

33 out ( c , enc ryp t ( ( Na , xA) , pkX ) ) ;

34 ( Message 6 )

35 in ( c , m: b i t s t r i n g ) ;

36 l e t (=Na , NX2: nonce , =hostX ) = de cry pt (m, skA ) in

37 l e t m7 = enc ryp t ( n o n c e t o b i t s t r i n g (NX2) , pkX) in

38 event e3 (xA, hostX , pkxA , pkX , Na , NX2 ) ;

39 ( Message 7 )

40 out ( c , m7 ) .

42 ( Role of t he re spo nde r wi th i d e n t i t y xB and s e c r e t key skxB )

43 le t pr oce ssR esp ond er ( pkS : spkey , skA : skey , skB : skey ) =

44 ( The a t t a c k e r s t a r t s th e resp on der by c ho osi ng i d e n t i t y xB .

45 We che ck t h a t xB i s h on es t ( i . e . i s A or B) . )

46 in ( c , xB : h ost ) ;

47 i f xB = A | | xB = B then

48 l e t skxB = i f xB = A then skA el se skB in

49 l e t pkxB = pk ( skxB ) in

50 ( Real s t a r t o f t he r o l e )

51 ( Message 3 )

52 in ( c , m: b i t s t r i n g ) ;

53 l e t (NY: nonce , hostY : h ost ) = d ecr ypt (m, skxB ) in

54 ( Message 4 : Get t he p u b l i c key c e r t i f i c a t e f o r t h e ot h er h o s t )

55 out ( c , (xB , hostY ) ) ;

56 ( Message 5 )

57 in ( c , ms : b i t s t r i n g ) ;

58 l e t (pkY : pkey ,= hostY ) = c he c k si g n (ms , pkS ) in

59 ( Message 6 )

60 new Nb: nonce ;

61 event e2 ( hostY , xB , pkY , pkxB , NY, Nb ) ;

86 CHAPTER 5. NEEDHAM-SCHROEDER: CASE STUDY

62 out ( c , enc ryp t ( (NY, Nb , xB) , pkY ) ) ;

63 ( Message 7 )

64 in ( c , m3: b i t s t r i n g ) ;

65 i f n o n c e t o b i t s t r i n g (Nb) = de cry pt (m3, skB ) then

66 event endB ( hostY , xB , pkY , pkxB , NY, Nb ) .

68 ( Se rve r )

69 le t pr o ce s sS ( skS : ss ke y ) =

70 in ( c , ( a : host , b : ho st ) ) ;

71 get keys(=b , sb ) in

72 out ( c , s i g n ( ( sb , b ) , skS ) ) .

74 ( Key r e g i s t r a t i o n )

75 le t processK =

76 in ( c , ( h : host , k : pkey ) ) ;

77 i f h <> A && h <> B then inse rt keys (h , k ) .

79 ( S t a r t p r oc e s s )

80 process

81 new skA : skey ; l e t pkA = pk ( skA) in out ( c , pkA ) ; i nsert keys (A, pkA ) ;

82 new skB : s key ; l et pkB = pk ( skB ) in out ( c , pkB ) ; inse rt keys (B, pkB ) ;

83 new skS : s sk ey ; l e t pkS = spk ( skS ) in out ( c , pkS ) ;

84 (

85 ( Launch an unbounded number o f s e s s i o n s o f th e i n i t i a t o r )

86 ( ! p r o c e s s I n i t i a t o r ( pkS , skA , skB ) ) |

87 ( Launch an unbounded number o f s e s s i o n s o f th e re sp ond er )

88 ( ! pr oce ssR esp ond er ( pkS , skA , skB ) ) |

89 ( Launch an unbounded number o f s e s s i o n s o f th e s e r v e r )

90 ( ! p r oc e ss S ( skS ) ) |

91 ( Key r e g i s t r a t i o n pr o c es s )

92 ( ! processK )

93 )

The event endB (Line 66) means that the responder has completed the protocol, e3 (Line 38) that the

initiator received message 6 and sent message 7, e2 (Line 61) that the responder received message 3

and sent message 6, e1 (Line 32) that the initiator sent message 3. These events take as arguments all

parameters of the protocol: the host names, their public keys, and the nonces, except e1 which cannot

take Nb as argument since it has not been chosen yet when e1 is executed. We prove the correspondence

inj −event ( endB (A, y , pkx , pky , nx , ny ) ) ==>

( inj −event ( e3 (A, y , pkx , pky , nx , ny ) ) ==>

( inj −event ( e2 (A, y , pkx , pky , nx , ny ) ) ==>

inj −event ( e1 (A, y , pkx , pky , nx ) ) ) ) .

Chapter 6

Advanced reference

This chapter introduces ProVerif’s advanced capabilities. We provide the complete grammar in Ap-

pendix A.

6.1 Proving correspondence queries by induction

6.1.1 Single query

Consider a correspondence query F ==> F

′

and a process P . As mentioned in Sections 3.2.2 and 4.3.1,

to prove that P satisﬁes the query F ==> F

′

, ProVerif needs to show that, for all traces of P , if F

was executed in the trace, then F

′

was also executed in the trace before F . Intuitively, proving the query

F ==> F

′

by induction consists of proving the above property by induction on the length of the traces

of P .

To simplify the explanation, let us introduce some informal notations. We consider that a trace of

P is a sequence of actions tr = a

. . . a

representing the actions that have been executed in P similarly

to the attack traces (see Section 3.3.2). The length of the trace, denoted |tr|, corresponds to its number

of actions, that is, n. Finally, we say that a fact is executed at step k, denoted F, k ⊢ tr when F is the

action a

in tr. The induction hypothesis P(n) can then be expressed as:

for all traces tr of P , if |tr| ≤ n, then for all k, if F, k ⊢ tr then F

′

, k

′

⊢ tr for some k

′

≤ k.

For ProVerif to prove this property by induction, we only need to prove that P(n) implies P(n + 1) for

all n ∈ N. (Note that P(0) is trivially always true.)

By considering a trace tr = a

. . . a

n+1

and assuming that P(n) holds, we directly obtain that the

sub-trace tr

′

= a

. . . a

satisﬁes P(n). This yields two interesting properties:

We can consider that k = n + 1, otherwise the result would directly hold thanks to tr

′

In the solving procedure, when building the derivations of σF , if we can detect that another

instance of F , say σ

′

F , in the derivation necessarily occurred stricly before σF , then we know by

the induction hypothesis P(n) that σ

′

has been executed before σ

′

F and so before σF .

These two properties are the building blocks of the inductive veriﬁcation of queries in ProVerif:

When generating reachable goals, ProVerif builds Horn clauses with instances of F as a conclusion.

Upon generating a clause of the form H ∧ σ

′

F → σF , ProVerif already knows that this clause represents

an execution of σ

′

F before an execution of σF . ProVerif uses order constraints to infer that σ

′

F was

executed strictly before σF . In this case, the veriﬁcation procedure will add σ

′

to the hypotheses of

the clause, i.e., it replaces the clause H ∧ σ

′

F → σF with the clause H ∧ σ

′

∧ σ

′

F → σF .

Let us illustrate this concept on the small example, available in docs/ex

induction.pv, that is a

simpliﬁed version of the Yubikey protocol [Yub10].

1 free c : chan nel .

2 free k : b i t s t r i n g [ private ] .

3 free d P : ch anne l [ private ] .

88 CHAPTER 6. ADVANCED REFERENCE

4 free d Q : ch anne l [ private ] .

6 fun s enc ( nat , b i t s t r i n g ) : b i t s t r i n g .

7 reduc f o r a l l K: b i t s t r i n g ,M: nat ; sd ec ( se nc (M,K) ,K) = M.

9 event CheckNat ( nat ) .

11 query i : nat ; event ( CheckNat ( i ) ) ==> i s n a t ( i ) .

13 le t P =

14 in ( c , x : b i t s t r i n g ) ;

15 in ( d P , ( i : nat , j : nat ) ) ;

16 l e t j : nat = sd ec ( x , k ) in

17 event CheckNat ( i ) ;

18 event CheckNat ( j ) ;

19 i f j > j

20 then out ( d P , ( i +1, j ) )

21 el se out ( d P , ( i , j ) ) .

23 le t Q =

24 in (d Q , i : nat ) ;

25 out ( c , se nc ( i , k ) ) ;

26 out ( d Q , i +1).

28 process

29 out ( d P , ( 0 , 0 ) ) | out ( d Q , 0 ) | ! P | ! Q

In this protocol, the processes P and Q share a private key k and they both have a memory cell

respectively represented by the private channels d P and d Q. Every time the process Q increments the

value stored in its memory cell, it also outputs the previous value encrypted with the shared key k, i.e.

out(c,senc(i ,k)). On the other hand, the process P stores in its memory cell two values : the number of

time it received a fresh encryption from Q, represented by i :nat in in(d P,(i :nat, j :nat)) and the last

value it received from Q, represented by j :nat.

We aim to prove that the values of the memory cell of P are always natural numbers, which is

represented by the query:

query i : nat ; event ( CheckNat ( i ) ) ==> i s n a t ( i ) .

However, verifying this protocol with ./proverif docs/ex induction.pv | grep "RES" produces

the following output:

RESULT event ( CheckNat ( i 2 ) ) ==> i s n a t ( i 2 ) cannot be proved .

If we look more closely at the output, we can observe that ProVerif considers the following reachable

goal

i s n o t n a t ( i 2 + 1) && j 1 ≥ j 2 + 1 && mess( d P [ ] , ( i 2 , j 2 ) ) &&

mess( d Q [ ] , j 1 ) && mess( d Q [ ] , j 1 ) −> end ( CheckNat ( i 2 + 1 ) )

To ensure termination, ProVerif avoids resolving upon facts that would lead to trivial inﬁnite loops. This

is the case for the facts representing the memory cells, which are mess(d P[],(i 2, j 2 )), mess(d Q[],j 1),

and mess(d Q[],j 1), so resolution stops with the clause above. Since the clause contradicts the query,

ProVerif concludes that it cannot prove the query.

By adding the option induction after the query as follows

query i : nat ; event ( CheckNat ( i ) ) ==> i s n a t ( i ) [ induction ] .

ProVerif would initially generate the following reachable goal:

j 1 ≥ j 2 + 1 && be gin ( CheckNat ( j 2 ) ) && be gin ( CheckNat ( i 2 ) ) &&

mess( d P [ ] , ( i 2 , j 2 ) ) && mess ( d Q [ ] , j 1 ) −> end ( CheckNat ( i 2 + 1 ) )

6.1. PROVING CORRESPONDENCE QUERIES BY INDUCTION 89

Furthermore, ProVerif understands that the event CheckNat(i 2) occurs strictly before CheckNat(i 2 + 1).

By applying the induction hypothesis on CheckNat(i 2), it adds is nat ( i 2 ) in the hypotheses of the

clause, yielding

i s n a t ( i 2 ) && j 1 ≥ j 2 + 1 && be gin ( CheckNat ( j 2 ) ) && be gin ( CheckNat ( i 2 ) ) &&

mess( d P [ ] , ( i 2 , j 2 ) ) && mess ( d Q [ ] , j 1 ) −> end ( CheckNat ( i 2 + 1 ) )

Since this clause does not contradict the query, ProVerif is able to prove the query: Verifying this protocol

with ./proverif docs/ex induction proof.pv | grep "RES" produces the output

RESULT event ( CheckNat ( i 2 ) ) ==> i s n a t ( i 2 ) i s tr ue .

Remark. When the setting inductionQueries is set to true, all queries are proved by induction. In

such a case, one can use the option [noInduction] on one speciﬁc query to enforce that it is not proved

by induction.

6.1.2 Group of queries

Queries may also be stated in the form:

query x

: t

, . . . , x

: t

; q

; . . . ; q

where each q

is a query as deﬁned in Figure 4.3. Furthermore, it is also possible to prove a group of

queries by induction. However the output of ProVerif diﬀers from proving a single query by induction.

Coming back to our previous example, we would additionally prove that the values stored in the memory

cell Q and the value of j in P are also natural numbers. The input ﬁle docs/ex induction group.pv

partially displayed here integrates such queries.

9 event CheckNat ( nat ) .

10 event CheckNatQ ( nat ) .

12 query i : nat ;

13 event ( CheckNat ( i ) ) ==> i s n a t ( i ) ;

14 event ( CheckNatQ ( i ) ) ==> i s n a t ( i ) ;

15 mess( d Q , i ) ==> i s n a t ( i ) [ induction ] .

17 le t P =

18 in ( c , x : b i t s t r i n g ) ;

19 in ( d P , ( i : nat , j : nat ) ) ;

20 l e t j : nat = sd ec ( x , k ) in

21 event CheckNat ( i ) ;

22 event CheckNat ( j ) ;

23 event CheckNatQ ( j ) ;

24 i f j > j

25 then out ( d P , ( i +1, j ) )

26 el se out ( d P , ( i , j ) ) .

Verifying this protocol with ./proverif docs/ex induction group.pv | grep "RES" produces the

following output:

PARTIAL RESULT event ( CheckNat ( i 2 ) ) ==> i s n a t ( i 2 ) i s tr ue i f