Interactive Proof Systems

The Verifier and the Prover

Interactive proof systems provide a probabilistic analog of the class $NP$ , similar to how probabilistic polynomial time algorithms provide an analog to $P$

These have important implications to cryptography and approximation algorithms

Recall that the languages in $NP$ are those whose members all have easily verifiable certificates of membership

We reformulate this concept as a Prover with unlimited time and a Verifier with polynomially bounded time

Informally, an interactive proof system takes this formulation, allows for two-way dialog, and lets the Verifier be a probabilistic polynomial time machine

This kind of system can decide problems such as the complement of $S A T$ (within the degree of accuracy we specify)

Example:
$I SO = {⟨ G, H ⟩ ∣ G and H are isomorphic graphs}$ is in $NP$ , but is neither proven to be in $P$ or $NP$ -complete

The complementary language $NON I SO$ is not even known to be in $NP$

However our setup enables a Prover to convince a Verifier that two graphs aren’t isomorphic with a very simple scheme

The Verifier randomly selects either $G_{1}$ or $G_{2}$ , randomly reorders its nodes to produce $H$ , and asks the Prover whether $G_{1}$ or $G_{2}$ is the source of $H$

With unlimited computational power, the Prover can consistently answer correctly only if the graphs are nonisomorphic

Therefore, if the Prover answers correctly over many trials, the Verifier has convincing evidence that the graphs are nonisomorphic

Definition:

The Verifier is a function $V (w, r, m_{1} # \dots # m_{i}) = m_{i + 1}$ that computes its next transmission (or accept/reject) from the input string, a random string (for convenience), and the message history
The Prover is a function $P (w, m_{1} # \dots # m_{i}) = m_{i + 1}$ that computes is next transmission from the input string and the message history
We write the interaction between the Prover and the Verifier as $V \leftrightarrow P$ and write $(V \leftrightarrow P) (w, r) = a cce pt$ if a message sequence $m_{1}, \dots, m_{k}$ exists such that $V (w, r, m_{1} # \dots # m_{i}) = m_{i + 1}$ for even $i$ , $P (w, m_{1} # \dots # m_{i}) = m_{i + 1}$ for odd $i$ , and $m_{k} = a cce pt$

$Pr [V \leftrightarrow P accepts w] = Pr [(V \leftrightarrow P) (w, r) = a cce pt]$

We also assume the lengths of the Verifier’s random input, the lengths of the messages exchanged, and the total number of messages is at most $p (n)$ for some polynomial $p$ that depends only on the Verifier

Definition: $A \in IP$ if some polynomial time computable function $V$ exists such that for some function $P$ and for every function $\tilde{P}$ , for every string $w$ , $w \in A ⟹ Pr [V \leftrightarrow P accepts w] \geq \frac{2}{3}$ and $w \in / A ⟹ Pr [V \leftrightarrow \tilde{P} accepts w] \leq \frac{1}{3}$

The second clause guarantees the Verifier has true evidence of $w \in A$ and can’t rely on the power of the Prover

Note that the specific constants chosen in this definition are arbitrary (and we may amplify the probability by repetition)

$IP$ is clearly more powerful than both $NP$ and $BPP$ , and indeed contains $NON I SO$ , which is not known to be in either

IP = PSPACE

$IP$ is remarkably powerful, and in fact is proven to be equal in power to $PSPACE$

Theorem: $IP = PSPACE$

We divide this into two parts

IP ⊆ PSPACE

Let $A$ be a language in $IP$ , with Verifier $V$ that exchanges $p = p (n)$ messages on input $w$ with length $n$

We construct a $PSPACE$ machine that simulates $V$ , calculating the probability for any string $w$ , $Pr [V accepts w] = max_{P} Pr [V \leftrightarrow P accepts w]$

Let $M_{j}$ denote the message history $m_{1} # \dots # m_{j}$

We write $(V \leftrightarrow P) (w, r, M_{j}) = a cce pt$ if we can extend $M_{j}$ with messages that lead to an accept (maintaining that any messages before $M_{j}$ are consistent)

We write $Pr_{r}$ to consider taking the probability over all strings $r$ consistent with $M_{j}$ ,
$Pr [V \leftrightarrow P accepts w starting at M_{j}] = Pr_{r} [(V \leftrightarrow P) (w, r, M_{j}) = a cce pt]$
$Pr [V accepts w starting at M_{j}] = max_{P} Pr [V \leftrightarrow P accepts w starting at M_{j}]$

We define our final probability inductively, going from $j = p$ to 0

$N_{M_{p}} = 1$ if $M_{p}$ is consistent and 0 otherwise
For $j < p$ , $N_{M_{j}} = {max_{m_{j + 1}} N_{M_{j + 1}} wt-avg_{m_{j + 1}} N_{M_{j + 1}} odd j < p even j < p$

We define $wt-avg_{m_{j + 1}} N_{M_{j + 1}} = \sum_{m_{j + 1}} (Pr [V (w, r, M_{j}) = m_{j + 1}] \cdot N_{M_{j + 1}})$

First, note that we can calculate $N_{M_{0}}$ in polynomial space, by recursively calculating $N_{M_{j}}$ for every $j$ and $M_{j}$ , since our messages take polynomial space

Second, $N_{M_{0}} = Pr [V accepts w]$ is our target value

To show this is true, we inductively prove $N_{M_{j}} = Pr [V accepts w starting at M_{j}]$

The base case $j = p$ is trivial

Assuming our claim holds for $j + 1 \leq p$ and any message stream $M_{j + 1}$ , we prove it is true for $j$ and any $M_{j}$

If $j$ is even, $m_{j + 1}$ is a message from $V$ to $P$ ,
$N_{M_{j}} = \sum_{m_{j + 1}} (Pr_{r} [V (w, r, M_{j}) = m_{j + 1}] \cdot N_{M_{j + 1}})$
$= \sum_{m_{j + 1}} (Pr_{r} [V (w, r, M_{j}) = m_{j + 1}] \cdot Pr [V accepts w starting at M_{j + 1}])$
$= Pr [V accepts w starting at M_{j}]$

If $j$ is odd, $m_{j + 1}$ is a message from $P$ to $V$ ,
$N_{M_{j}} = max_{m_{j + 1}} N_{M_{j + 1}} = max_{m_{j + 1}} Pr [V accepts w starting at M_{j + 1}]$
$= Pr [V accepts w starting at M_{j}]$

This last equality holds because we are looking at probabilities maximizing over all possible Provers

The Counting Problem

Before showing the proof for $PSPACE \subseteq IP$ , we begin with a weaker result that illustrates the technique

The counting problem for satisfiability $# S A T$ identifies whether $ϕ$ is a cnf-formula with exactly $k$ satisfying assignments

Theorem: $# S A T \in IP$

Arithmetization associates $ϕ$ with a polynomial $p (x_{1}, \dots, x_{m})$ , where $p$ is designed to mimic $ϕ$ on Boolean assignments

$α \land β ⟹ α β$
$\neg α ⟹ 1 - α$
$α \lor β ⟹ α * β = 1 - (1 - α) (1 - β)$

The degree of any variable is at most $n$ , the length of $ϕ$

It’s not clear how we’d interpret assigning non-Boolean values, however this proof does just this, exactly as was done here to prove $E Q_{ROBP} \in BPP$

This technique is powerful because while two different formulas can agree on many points, two different polynomials with limited degree can only agree on a small set of points

This idea is more clear when we write out and verify the protocol

We look at a finite field $F$ with $q > 2^{n}$ elements

Let $f_{i} (a_{1}, \dots, a_{i}) = \sum_{a_{i + 1}, \dots, a_{m}} p (a_{1}, \dots, a_{m})$ be the number of valid assignments with $a_{1}, \dots, a_{i} \in F$ fixed and $a_{i + 1}, \dots, a_{m} \in {0, 1}$

$f_{0} ()$ is the number of satisfying assignments of $ϕ$ , and each $f_{i} (x_{1}, \dots, x_{i})$ can be expressed as a polynomial in $x_{1}$ through $x_{i}$ , with degrees at most that of $p$

In Phase 0: $P$ sends $f_{0} ()$ to $V$ , and $V$ rejects if $k \neq = f_{0} ()$

In Phase 1: $P$ sends the coefficients of $f_{1} (z)$ as a polynomial in $z$ , and $V$ rejects if $f_{0} () \neq = f_{1} (0) + f_{1} (1)$
$V$ sends over a random $r_{1}$ from $F$

In Phase 2: $P$ sends the coefficients of $f_{2} (r_{1}, z)$ as a polynomial in $z$ , and $V$ rejects if $f_{1} (r_{1}) \neq = f_{2} (r_{1}, 0) + f_{2} (r_{1}, 1)$
$V$ sends over a random $r_{2}$ from $F$

This continues until $P$ reaches the end and may directly check that $p (r_{1}, \dots, r_{m}) = f_{m} (r_{1}, \dots, r_{m})$ , in which case $V$ accepts

If $ϕ$ does have $k$ satisfying assignments, then $V$ clearly accepts

The tricky part is understanding why $\tilde{P}$ (a crooked Prover) cannot deceive $V$ if $ϕ$ doesn’t have $k$ assignments

To prevent $V$ from rejecting outright, $\tilde{P}$ sends an incorrect $\tilde{f}_{0} ()$

This lie immediately propagates, since one of the values $V$ calculates for $f_{1} (0)$ and $f_{1} (1)$ must be incorrect if $V$ is not to reject, so $\tilde{P}$ must send an incorrect $\tilde{f}_{1} (z)$

The key in our construction is showing that the lie must continue to propagate, i.e. $\tilde{f}_{1} (r_{1})$ is unlikely to equal $f_{1} (r_{1})$

Using this result, the probability that they happen to agree is $\leq n / 2^{n} < n^{- 2}$ (for $n \geq 10$ )

In the general case, this means that if $\tilde{f}_{i - 1} (r_{1}, \dots, r_{i - 1}) \neq = f_{i - 1} (r_{1}, \dots, r_{i - 1})$ , then for $n \geq 10$ and for $r_{i}$ chosen at random in $F$ , $Pr [\tilde{f}_{i} (r_{1}, \dots, r_{i}) = f_{i} (r_{1}, \dots, r_{i})] \leq n^{- 2}$

The probability that $\tilde{P}$ gets lucky at some point is at most the number of phases $m$ times $n^{- 2}$ , or at most $1/ n$

Otherwise, $V$ will catch the incorrect value of $f_{m}$ by calculating it directly in Phase $m + 1$

PSPACE ⊆ IP

We can show this by proving that $TQBF$ is in $IP$

Let’s start by following the same reasoning as in our proof above

Let $ψ$ be a quantified Boolean formula of the form $ψ = Q_{1} x_{1} Q_{2} x_{2} \dots Q_{m} x_{m} [ϕ]$

Again, we define $f_{i} (a_{1}, \dots, a_{i}) = {10 if Q_{i + 1} x_{i + 1} \dots Q_{m} x_{m} [ϕ (a_{1}, \dots, a_{i})] otherwise$

By this definition, $f_{0} ()$ is our final truth value

Our quantifiers give us two new arithmetic identities,
$Q_{i + 1} = \forall$ : $f_{i} (a_{1}, \dots, a_{i}) = f_{i + 1} (a_{1}, \dots, a_{i}, 0) \cdot f_{i + 1} (a_{1}, \dots, a_{i}, 1)$
$Q_{i + 1} = \exists$ : $f_{i} (a_{1}, \dots, a_{i}) = f_{i + 1} (a_{1}, \dots, a_{i}, 0) * f_{i + 1} (a_{1}, \dots, a_{i}, 1)$ (recall $x * y = 1 - (1 - x) (1 - y)$ )

We run into an issue if we proceed as before, because these identifiers are effectively doubling the degree of the resulting polynomial each time, leading to an exponential number of coefficients

To fix this, we include a new operator to help reduce the degree of our polynomials,
$ψ^{'} = Q x_{1} R x_{1} Q x_{2} R x_{1} R x_{2} Q x_{3} R x_{1} R x_{2} R x_{3} \dots Q x_{m} R x_{1} \dots R x_{m} [ϕ]$

We simplify as $ψ^{'} = S_{1} y_{1} S_{2} y_{2} \dots S_{k} y_{k} [ϕ]$ where $S_{i} \in {\forall, \exists, R}$ and $y_{i} \in {x_{1}, \dots, x_{m}}$

For each $i \leq k$ , we define $f_{i}$

$f_{k} (x_{1}, \dots, x_{m})$ is the polynomial $p (x_{1}, \dots, x_{m})$ obtained by arithmetizing $ϕ$

For $i < k$ ,
$S_{i + 1} = \forall$ : $f_{i} (\dots) = f_{i + 1} (\dots, 0) \cdot f_{i + 1} (\dots, 1)$
$S_{i + 1} = \exists$ : $f_{i} (\dots) = f_{i + 1} (\dots, 0) * f_{i + 1} (\dots, 1)$
$S_{i + 1} = R$ : $f_{i} (\dots, a) = (1 - a) f_{i + 1} (\dots, 0) + a f_{i + 1} (\dots, 1)$

While $f_{i}$ has one fewer input variable than $f_{i + 1}$ for $\forall$ and $\exists$ , this is not the case for $R$ , so $f_{i}$ does not generally depend on $i$ variables

In addition, we reorder the inputs to the functions so that input variable $y_{i + 1}$ is the last argument, such that $S_{i + 1} = R$ has the effect of producing a result linear in $y_{i + 1}$ but with the same truth value

This way, each degree is reset to 1 prior to squaring due to quantifiers

We choose a field $F$ of size at least $n^{4}$ , where $n$ is the length of $ϕ$

In Phase 0: $P$ sends $f_{0} ()$ to $V$ , and $V$ rejects if $f_{0} () \neq = 1$

In Phase $i$ : $P$ sends the coefficients of $f_{i} (r_{1} \dots, z)$ as a polynomial in $z$
$V$ evaluates $f_{i} (r_{1} \dots, 0)$ and $f_{i} (r_{1} \dots, 1)$

$V$ checks that $f_{i - 1} (r_{1} \dots) = {f_{i} (r_{1} \dots, 0) \cdot f_{i} (r_{1} \dots, 1) f_{i} (r_{1} \dots, 0) * f_{i} (r_{1} \dots, 1) S_{i} = \forall S_{i} = \exists$
Or if $S_{i} = R$ then $f_{i - 1} (r_{1} \dots, r) = (1 - r) f_{i} (r_{1} \dots, 0) + r f_{i} (r_{1} \dots, 1)$
If this fails then $V$ rejects

$V$ picks a random $r$ in $F$ and sends it to $P$ , continuing the protocol

In Phase $k + 1$ : $V$ directly checks that $p (r_{1}, \dots, r_{m}) = f_{k} (r_{1}, \dots, r_{m})$ , accepts if so and rejects otherwise

The correctness of this protocol follows similar to the $# S A T$ protocol

Clearly if $ψ$ is true then $P$ can follow the protocol and $V$ will accept
If $ψ$ is false, $\tilde{P}$ must send a lie for $f_{0} ()$

At Phase $i$ , if $V$ has an incorrect value for $f_{i - 1} (r_{1} \dots)$ then the polynomial for $f_{i}$ must also be incorrect

Then, the chance that $f_{i} (r_{1} \dots, r)$ happens to be the correct value is at most the polynomial degree divided by the field size $n / n^{4}$

The protocol proceeds for $O (n^{2})$ phases, so the probability that $\tilde{P}$ gets lucky at some point is at most $1/ n$

Otherwise, $V$ will catch the incorrect value of $f_{k}$ by calculating it directly in Phase $k + 1$

Binyamin's Notes

Explorer

The Verifier and the Prover

IP = PSPACE

IP ⊆ PSPACE

The Counting Problem

PSPACE ⊆ IP

Table of Contents