Computer scientists are interested in proving lower bounds for their algorithms. Lower bounds are important because they give algorithm designers some metric about how efficient their algorithms can be.
Computer scientists are usually interested in showing lower bounds for the running time of Turing Machines, which are computational models that emulate how our computers work (with some “small” overhead).
As is the usual motif in theoretical computer science, showing lower bounds for Turing Machines is considered difficult, and computer scientists have found it easier to analyse lower bounds in the setting of non-uniform circuits. In the non-uniform circuit model, we assume that we are given some circuit Cn which computes some function f:{0,1}n→{0,1}.
Our circuits are modeled as directed acyclic graphs with nodes representing inputs, outputs and intermediate computations. This computations are done using some simple operations (called gates) acting on bits, for example:
NOT gates (¬)
OR gates (∨)
AND gates (∧)
These gates are enough to do any computation, and as such are called universal gates. For our purposes, AND gates and OR gates can operate on a poly(n) bits.
Since circuits are directed graphs we can view them as layered computations. At the first layer we start with some subset of the input x∈{0,1}n, and at each layer we apply some “local” computation to nodes from the previous layer. The final layer is usually one node which we call the output of the circuit.
An example of a circuit with 3 layers.
The reason why circuits are easier to prove lower bounds than Turing Machines, is that we can easily model the local computation done between consecutive layers.
In contrast to the time complexity of Turing Machines: we usually quantify the running time of circuits by their number of layers (or depth). Think of the depth of the circuit as the number of consecutive computations performed on the input. We usually count the depth of the circuit as the number of layers (not including the input layer).
A major caveat is that the size of the circuit cannot be arbitrarily large (!), and we usually want it to be polynomial in the input’s size, as we cannot build arbitrarly big circuits.
We know how to prove non-uniform circuit lower bounds for some specific functions (such as parity), but in general we don’t know how this could be done.
To resolve this difficulty, we focus on an even simpler problem. That of finding lower bounds (depth lower bounds) for monotone circuits which compute monotone functions.
What are monotone functions? We can define an ordering on strings by the ordering (x1⋯xn≤y1⋯yn iff for every 1≤i≤n, xi≤yi)
a function is called monotone if x≤y, then f(x)≤f(y) (for example: 0≤1).
Monotone circuits are circuits which compute monotone functions. They consist of an even more restricted set of gates than we previously saw. In our case:
OR gates (∨)
AND gates (∧)
and as such, monotone circuits are not universal and cannot compute arbitrary functions only monotone functions. But how could a function like f(x)=1 be computed only OR and AND gates? Any circuit consisting of OR and AND gates which takes as input the string x=0n must return 0.
To fix this we augment the input to the circuit by adding advice, a constant string an.
I will finish this section with some questions for the reader.
Why do monotone circuits compute monotone functions?
Given some montone function f, think of a depth-2 monotone circuit which computes f.
We will now go on to talk about something that is seemingly unrelated: communication complexity. This concept has surprising connections to what we previously discussed.
In the setting of communication complexity, two players (Alice and Bob) want to solve some task together. Alice is given the clue x and Bob is given y and they need to compute some function of x and y. We assume that both players have unlimited computational power and are interested in the amount of information (bits) communicated between them.
We are interested in a specific class of games, these are called Karchmer-Wigderson (KW) games. The game is defined as follows for some monotone function f:
Karchmer-Wigderson Game Mf:
Alice gets x∈{0,1}n such that f(x)=1.
Bob gets y∈{0,1}n such that f(y)=0.
Goal: Find i such that xi=1 and yi=0.
Comment: Such an index i always exists because f is monotone.#
Connecting Communication Complexity and Monotone Circuit Depth#
Given a monotone function f we denote by Depth(f) the monotone circuit depth of the function and by CC(Mf) the communication complexity of the KW game for f. The following theorem gives a profound connection between circuits and communication complexity.
In fact
Depth(f)=CC(Mf)
In other words, to compute a lower bound for the run-time of an algorithms (in the monotone circuit model), we could try to show a lower bound for the number of bits communicated between two parties in the game Mf! This gives us a way to analyse circuits using communication complexity (🤯).
We will prove the equality by showing that CC(Mf)≤Depth(f) and Depth(f)≤CC(Mf).
Try to prove this inequality yourself before proceeding to read the proof.
Hint
Try proving this by induction.Proof
Suppose we are given a circuit C of depth Depth(f). We will design a protocol for Alice and Bob for the game Mf which uses at most Depth(f) bits of information. We will prove this claim by induction.
Alice and Bob are both going to hold the exact same circuit C, and are going to follow the following protocol:
Induction Base: If the circuit is of depth 1, than the circuit is xi for some i, so they need to communicate 0 bits.
Induction Step: suppose the gate at the output of the circuit is an OR gate, and denote the left left sub-circuit f0 and the right sub-circuit f1, and so f=f0∨f1. Alice and Bob know that f(x)=1 and f(y)=0, thus f0(y)=f1(y)=0 and f0(x)=1∨f1(x)=1. The condition f0(y)=f1(y)=0 about Bob’s clue gives no new information to Alice, because she already knows that this is true. But Alice could communicate a bit j such fj(x)=1. fj is monotone as it is a subcircuit of a monotone circuit and has depth Depth(fj)≤Depth(f)−1. Also, note that fj(x)=1 while fj(y)=0 so the initial condition for the game still holds.
Protocol for Alice and Bob in the case that the top-most gate is an OR gate
Hence, by the induction hypothesis, Alice and Bob could succeed in the game fj with Depth(fj) bits. In other words CC(Mf)≤1+CC(Mfj)≤1+Depth(f)−1=Depth(f)
Exercise: We are missing the case for AND in the induction, modify the argument for the OR case.
This inequality is considerably harder to prove. In this case suppose Alice and Bob have a protocol for the game Mf, we need to find a circuit C of depth at most Depth(f) which computes f.
Let’s assume for a moment that the protocol starts with Alice sending a bit to Bob j. Bob now knows that Alice’s input x is in some set Xj. Essentially, this bit partitions Alice’s input set A:=f−1(1) (all x`s such that f(x)=1) to two disjoint sets A=A0⊎A1 (i.e. the set of inputs for which Alice sends 0 and the set of inputs for which Alice sends 1).
The key insight is that we could consider a more general game then before, and conclude that the inequality holds as it is a special case.
With this newly found intuition we might want to define a more general game. Given a monotone function f, we define the following game
Back our previous goal, we want to show that Depth(f)≤CC(Mf), equivalently Depth(f)≤CC(MA,B) for A=f−1(1) and B=f−1(0). Furthermore, assume as before that protocol start by Alice sending the bit j to Bob.
We would want to leverage some induction hypothesis somehow. The induction should be about the communication complexity of the sub-protocol CC(MAj,B)=CC(MA,B)−1.
But now it seems we are stuck. Since f is fixed, it couldn’t possibly be that Depth(f)≤CC(MAj,B).
How could we solve this conundrum?
Solution
We observe that f need not be fixed, in fact we only care about gj that is equal to f on the set Aj∪B.
Thus, we come up with the following theorem:
Theorem:
If CC(MA,B)=d for some function monotone function f then there exists a monotone function g such that:
Depth(g)≤d.
g(x)=1 for x∈A.
g(y)=0 for y∈B.
In fact this theorem, gives us an algorithm for constructing a circuit for g≡f∣A∪B. Of course, the inequality we set out to prove follows from an application of this theorem with A=f−1(1) and B=f−1(0).
The proof of this theorem draws from the previous proof. Try to find a way of construction a circuit for g using a proof by induction.
Proof
Induction Base: If d=0, then the players don’t need to communicate any information. Therefore, the players know that there exists some coordinate i such that xi=yi thus the circuit is simply the circuit which computes the function f(z)=zi
Induction Step: In this case, we assume Alice sends the first bit to Bob. This partitions Alice’s input set A to A=A0⊎A1. By the induction hypothesis, there exists f0 such that:
Depth(f0)≤d−1.
f0(x)=1 for x∈A0.
f0(y)=0 for y∈B.
and also, there exists g1 such that:
Depth(f1)≤d−1.
f1(x)=1 for x∈A1.
f1(y)=0 for y∈B.
We know want to construct g such that g≡f∣A∪B. It is important to note that it is possible that f0(x)=0 for x∈A1 (and correspondingly it could be that it is possible that f1(x)=0 for x∈A0 ). Consider the case that g=f0∧f1, and f0(x)=1 and f0(x)=0 for some x∈A0, in this case g(x)=0∧1=0=1. However, if g=f0∨f1, then indeed g(x)=f0(x)∨f1(x)=1 for x∈A∪B. By the induction hypothesis Depth(g)=1+max(Depth(f0),Depth(f1))≤1+d−1=d.
Exercise: Show that g=f0∧f1 if Bob sends the first bit.
Counter-intuitively we proved the second inequality by proving a more general case. When I initially saw this proof, it confused me. But I don’t see another way to make the induction work.
Could you think of other examples where to prove a statement, we look for a more general statement?
This technique is called “strengthening the induction hypothesis” for more examples see.
Some results that were proved using this technique:
the function that takes as input a a bipartite graph and checks whether is contains a bipartite matching requires monotone depth Ω(n).
the function of st-Connectivity: given a graph G and two input nodes s and t whether s and t the function determines whether s and t are connected. This function requires monotone depth Ω(log(n)2).