Lecture 17: Lock invariants

10/3/24lectureliterateAbout 1685 words

Lecture 17: Lock invariants

Follow these notes in Coq at src/sys_verif/program_proof/invariants.v.

In this lecture we'll introduce concurrent separation logic and lock invariants, our first tool for reasoning about concurrent programs.

Learning outcomes

Understand how concurrent separation logic extends sequential separation logic.
Recall the rules for using lock invariants.

%% basic math \gdef\intersect{\cap} \gdef\union{\cup} \gdef\dom{\operatorname{dom}} \gdef\disjoint{\mathrel{\bot}} \gdef\finto{\overset{\text{fin}}{\rightharpoonup}} \gdef\listapp{\mathbin{+\mkern-10mu+}} \gdef\bool{\operatorname{bool}} \gdef\box{\Box} \gdef\iProp{\operatorname{iProp}} \gdef\Prop{\operatorname{Prop}} %% language \gdef\ife#1#2#3{\text{\textbf{if} } #1 \text{ \textbf{then} } #2 \text{ \textbf{else} } #3} \gdef\lete#1#2{\text{\textbf{let} } #1 := #2 \text{ \textbf{in} }} \gdef\letV#1#2{&\text{\textbf{let} } #1 := #2 \text{ \textbf{in} }} \gdef\num#1{\overline{#1}} \gdef\true{\mathrm{true}} \gdef\false{\mathrm{false}} \gdef\skip{\mathrm{skip}} \gdef\fun#1{\lambda #1.\,} \gdef\funblank{\fun{\_}} \gdef\rec#1#2{\text{\textbf{rec} } #1 \; #2.\;\,} \gdef\app#1#2{#1 \, #2} \gdef\then{;\;} \gdef\assert#1{\operatorname{assert} \, #1} \gdef\val{\mathrm{val}} \gdef\purestep{\xrightarrow{\text{pure}}} \gdef\spawn{\text{\textbf{spawn}}} %% hoare logic \gdef\False{\mathrm{False}} \gdef\True{\mathrm{True}} \gdef\hoare#1#2#3{\left\{#1\right\} \, #2 \, \left\{#3\right\}} \gdef\hoareV#1#2#3{\begin{aligned}% &\left\{#1\right\} \\ &\quad #2 \\ &\left\{#3\right\}% \end{aligned}} \gdef\wp{\operatorname{wp}} \gdef\outlineSpec#1{\left\{#1\right\}} \gdef\entails{\vdash} \gdef\bient{\dashv\vdash} \gdef\eqnlabel#1{\:\:\text{#1}} \gdef\lift#1{\lceil #1 \rceil} %% separation logic % imperative constructs \gdef\load#1{{!}\,#1} \gdef\store#1#2{#1 \mathbin{\gets} #2} \gdef\free#1{\operatorname{free} \, #1} \gdef\alloc#1{\operatorname{alloc} \, #1} % logic \gdef\sep{\mathbin{\raisebox{1pt}{$\star$}}} %% Iris actually uses \ast (6-pointed) not \star (5-pointed) %\gdef\sep{\mathbin{\ast}} \gdef\bigsep{\mathop{\vcenter{\LARGE\hbox{$\star$}}}} \gdef\bigast{\mathop{\vcenter{\LARGE\hbox{$\ast$}}}} \gdef\wand{\mathbin{\raisebox{1pt}{$-\hspace{-0.06em}\star$}}} \gdef\emp{\mathrm{emp}} \gdef\pointsto{\mapsto} \gdef\Heap{\mathrm{Heap}} \gdef\Loc{\mathrm{loc}} \gdef\valid{\text{\checkmark}} \gdef\pcore{\operatorname{pcore}} \gdef\errorval{\bot} \gdef\option{\operatorname{option}} \gdef\Some{\operatorname{Some}} \gdef\None{\operatorname{None}} \gdef\own{\operatorname{own}} %% concurrent separation logic \gdef\isLock{\mathrm{isLock}} % \pvs is the iris.sty macro for a basic update modality (without the super dot) \gdef\pvs{\mathord{{{\mid\kern-0.5ex\Rrightarrow\kern-0.25ex}}\kern0.2ex}} \gdef\vs{\Rrightarrow} \gdef\vsWand{\displaystyle\equiv\kern-1.6ex-\kern-1.5ex\smash{\bigast}\kern-0.2ex} % frame-preserving updates (macro matches iris.sty) \gdef\mupd{\rightsquigarrow}

Motivation

We have concurrent programs with go, we modeled them with $\spawn$ , now how do we prove something about them?

Concurrent separation logic (CSL) extends separation logic to handle this new language where multiple threads can be running. What do we need to adapt? We need a way to reason about the new spawn construct, and we need to adapt the soundness theorem to talk about the new threadpool semantics.

Soundness

Let's start with soundness:

Definition (CSL soundness): For some pure $Φ(v)$ (a Prop), if $\hoare{P}{e}{\fun{v} Φ(v)}$ and $([e], h) \leadsto_{tp} ([e'] \listapp T, h')$ , then if $e'$ is an expression then $([e'] \listapp T, h')$ is not stuck, or $e' = v'$ for some value $v'$ and $Φ(v')$ holds. Furthermore, no thread in $T$ is stuck in $h'$ .

This should look familiar to the definition of pure soundness for sequential separation logic. We only use the threadpool semantics, and describe the return value of the main thread. The spawned threads are mostly ignored but we do state that none of them is stuck.

Exercise: soundness for spawned threads

Suppose we omitted the last sentence of the soundness theorem, and defined $(T, h)$ to be stuck if no threads could take a step. What program and specification $\hoare{P}{e}{Q}$ would be true under the alternate definition that wasn't with the real definition? Why does this motivate the stronger definition of soundness that we're actually using?

Reasoning about spawn

The rule for reasoning about spawn is deceptively simple:

\hoareV{\wp(e, \True)}{\spawn \, e}{\fun{v} \lift{v = ()}} \eqnlabel{wp-spawn}

Let's see a derived rule that's a little easier to explain:

\dfrac{\hoare{P}{e'}{Q}}{ \hoare{\wp(e, \True) ∗ P}{\spawn \, e \then e'}{Q} }

Notice how we go from proving something about $\spawn \, e \then e'$ to proving a regular triple $\hoare{P}{e'}{Q}$ for the code after the spawn. To do so, we need to separately prove that (a) $e$ is safe to run, with just the postcondition $\True$ , and (b) establish the precondition for the rest of the code $e'$ .

The proof of $\wp(e, \True)$ will in general consume some of the resources available, whatever should be owned initially by the background thread. These are basically lost, since the spawned thread never needs to "join" with the parent (unlike a more complex thread-creation mechanism), but we will later see how the spawned thread can communicate with the parent.

The resources $P$ need to be proven right away, unlike if we were verifying $e; e'$ , since the scheduler could certainly choose to run $e'$ next (partly or even to completion). The postcondition of $Q$ makes sense for the whole construct because after spawning $e'$ takes over, and it establishes the postcondition $Q$ .

Let's see this in action.

From sys_verif.program_proof Require Import prelude empty_ffi.
From Goose.sys_verif_code Require Import concurrent.

Section goose.
Context `{hG: !heapGS Σ}.

Let N := nroot .@ "lock".

Lemma wp_SetX (x_l: loc) (x: w64) :
  {{{ x_l ↦[uint64T] #x }}}
    SetX #x_l
  {{{ RET #(); x_l ↦[uint64T] #(W64 1) }}}.
Proof.
  wp_start as "x".
  wp_store.
  iModIntro. iApply "HΦ". iFrame.
Qed.

Lemma wp_NoGo :
  {{{ True }}}
    NoGo #()
  {{{ RET #(); True }}}.
Proof.
  wp_start as "_".
  wp_alloc x_l as "x".
  wp_pures.
  wp_apply (wp_SetX with "[$x]").
  iIntros "x".

Goal

goal 1
  Σ : gFunctors
  hG : heapGS Σ
  N := nroot.@"lock" : namespace
  Φ : val → iPropI Σ
  x_l : loc
  ============================
  "HΦ" : True -∗ Φ #()
  "x" : x_l ↦[uint64T] #(W64 1)
  --------------------------------------∗
  WP #();; #() {{ v, Φ v }}

  wp_pures.
  iModIntro. iApply "HΦ". done.
Qed.

Lemma wp_FirstGo :
  {{{ True }}}
    FirstGo #()
  {{{ RET #(); True }}}.
Proof.
  wp_start as "_".
  wp_alloc x_l as "x".
  (* The actual GooseLang construct for creating threads is called Fork. The
  specification for Fork is equivalent to the wp-spawn above, but is written in
  continuation-passing style. *)
  wp_apply (wp_fork with "[x]").
  { iModIntro.
    wp_apply (wp_SetX with "[$x]"). iIntros "x".

Goal

goal 1
  Σ : gFunctors
  hG : heapGS Σ
  N := nroot.@"lock" : namespace
  Φ : val → iPropI Σ
  x_l : loc
  ============================
  "x" : x_l ↦[uint64T] #(W64 1)
  --------------------------------------∗
  True

    done.
  }
  wp_pures.
  iModIntro. iApply "HΦ". done.
Qed.

Lock invariants

Recall the API for mutexes:

new(sync.Mutex) // to create a new lock
func (m *sync.Mutex) Lock()
func (m *sync.Mutex) Unlock()

We can use mutexes (also commonly called locks) to ensure that critical sections of our code run atomically.

The way to reason about locked code in separation logic is via lock invariants. The intuition is that the program uses a lock to protect some memory, which will only be accessed with the lock held. We translate this idea to separation logic by associating a separation logic proposition called a lock invariant with each mutex. The proposition includes any memory protected by the lock; it can include any other separation logic propositions as well, which is what makes lock invariants both interesting and useful.

So what are the rules for lock invariants? The basic idea for the lock invariant $R$ associated with some mutex $\ell_m$ (which we're naming by the location of the pointer to that mutex) is that it is a separation logic assertion that holds whenever the lock is free. Because it holds when the lock is free, when a thread initially acquires the lock, it gets to assume $R$ . Separation logic assertions are in general not duplicable, but because of mutual exclusion, a thread that acquires a mutex gets full ownership over $R$ . However, when it wants to unlock the same mutex, it has to give up ownership over $R$ .

Formally, we have the following specification for Lock and Unlock:

\hoare{\isLock(\ell_m, R)}{\operatorname{Lock} \, \ell_m}{R} \\ \hoare{\isLock(\ell_m, R) ∗ R}{\operatorname{Unlock} \, \ell_m}{\True}

To initially get $\isLock(\ell_m, R)$ , which associates the lock invariant $R$ with the lock $\ell_m$ , we have to use the following rule:

\hoareV{R}{\operatorname{newMutex} \, ()}{\fun{v} ∃ \ell_m, v = \ell_m ∗ \isLock(\ell_m, R)}

When we create a new mutex, we pick the lock invariant $R$ that represents what the mutex protects, and we also have to prove and give up $R$ . This is what ensures the lock invariant holds initially.

An important aspect of this specification is that $\isLock(\ell_m, R)$ is persistent. This is needed since for mutexes to be useful, $\isLock(\ell_m, R)$ has to be available from multiple threads simultaneously. The fact that it is persistent also explains why we don't return it in the Lock and Unlock postconditions. Note that the assertion $\isLock(\ell_m, R)$ can safely be persistent even if $R$ is not persistent because it merely asserts that the lock invariant for the mutex $\ell_m$ is $R$ ; to actually get a copy of $R$ , the thread has to call Lock, and the implementation of mutexes guarantees mutual exclusion at that point.

Exercise: Suppose we could somehow acquire $\isLock(\ell_m, R_1) ∗ \isLock(\ell_m, R_2)$ (notice these are the same mutex pointer), for arbitrarily chosen $R_1$ and $R_2$ . What could go wrong?

Let's see our first example of using locks with Goose.

Code being verified:

func FirstLock() uint64 {
	var x uint64
	m := new(sync.Mutex)
	go func() {
		m.Lock()
		x = 1
		m.Unlock()
	}()
	m.Lock()
	y := x
	m.Unlock()
	return y
}

Let's try a first proof that just shows this code is safe. Even with no interesting postcondition, the GooseLang model requires us to prove in this example that there are no race conditions on x; due to weak memory considerations, it isn't quite sound to model loads and stores of even a single variable as being atomic. The mutex in this example ensures the absence of races.

Lemma wp_FirstLock_v1 :
  {{{ True }}}
    FirstLock #()
  {{{ (y: w64), RET #y; True }}}.
Proof.
  wp_start as "_".
  wp_alloc x_l as "x". wp_pures.
  wp_apply (wp_newMutex N _ (∃ (y: w64), x_l ↦[uint64T] #y)%I
           with "[x]").
  { iFrame. }
  iIntros (m_l) "#Hlock".
  wp_pures.
  wp_apply wp_fork.
  { wp_apply (wp_Mutex__Lock).
    { iExact "Hlock". }
    iIntros "[Hlocked Hinv]".

Goal

goal 1
  Σ : gFunctors
  hG : heapGS Σ
  N := nroot.@"lock" : namespace
  Φ : val → iPropI Σ
  x_l, m_l : loc
  ============================
  "Hlock" : is_lock N #m_l (∃ y : w64, x_l ↦[uint64T] #y)
  --------------------------------------□
  "Hlocked" : lock.locked #m_l
  "Hinv" : ∃ y : w64, x_l ↦[uint64T] #y
  --------------------------------------∗
  WP #();; #x_l <-[uint64T] #(W64 1);; Mutex__Unlock #m_l {{ _, True }}

After calling Lock, the lock invariant appears in our hypotheses.

    iNamed "Hinv".
    wp_store.
    wp_apply (wp_Mutex__Unlock with "[Hlocked Hinv]").
    { iFrame "Hlock Hlocked".

To call Unlock, we need to prove the same lock invariant.

      iModIntro.

Goal

goal 1
  Σ : gFunctors
  hG : heapGS Σ
  N := nroot.@"lock" : namespace
  Φ : val → iPropI Σ
  x_l, m_l : loc
  y : w64
  ============================
  "Hlock" : is_lock N #m_l (∃ y0 : w64, x_l ↦[uint64T] #y0)
  --------------------------------------□
  "Hinv" : x_l ↦[uint64T] #(W64 1)
  --------------------------------------∗
  ∃ y0 : w64, x_l ↦[uint64T] #y0

      iFrame. }
    done. }
  wp_pures.
  wp_apply (wp_Mutex__Lock with "[$Hlock]"). iIntros "[Hlocked Hinv]". iNamed "Hinv".
  wp_load.
  wp_apply (wp_Mutex__Unlock with "[$Hlock $Hlocked $Hinv]").
  wp_pures.
  iModIntro.
  iApply "HΦ". done.
Qed.

Lemma wp_FirstLock_v2 :
  {{{ True }}}
    FirstLock #()
  {{{ (y: w64), RET #y; ⌜uint.Z y = 0 ∨ uint.Z y = 1⌝ }}}.
Proof.
  wp_start as "_".
  wp_alloc x_l as "x". wp_pures.
  wp_apply (wp_newMutex N _ (∃ (y: w64),
                  "x" :: x_l ↦[uint64T] #y ∗
                  "%Hx" :: ⌜uint.Z y = 0 ∨ uint.Z y = 1⌝)%I
           with "[x]").
  { iFrame. iPureIntro. left; word. }
  iIntros (m_l) "#Hlock".
  wp_pures.
  wp_apply wp_fork.
  { wp_apply (wp_Mutex__Lock).
    { iExact "Hlock". }
    iIntros "[Hlocked Hinv]".
    iNamed "Hinv".
    wp_store.
    wp_apply (wp_Mutex__Unlock with "[Hlocked x]").
    { iFrame "Hlock Hlocked".
      iModIntro.
      iFrame.
      iPureIntro. right; word. }
    done. }
  wp_pures.
  wp_apply (wp_Mutex__Lock with "[$Hlock]"). iIntros "[Hlocked Hinv]". iNamed "Hinv".
  wp_load.
  wp_pures.
  wp_apply (wp_Mutex__Unlock with "[$Hlock $Hlocked $x]").
  { iPureIntro. auto. }
  wp_pures.
  iModIntro.
  iApply "HΦ". iPureIntro. done.
Qed.

End goose.