Correlation Inequalities: FKG, Harris, and the Janson Inequalities
Anchor (Master): Alon-Spencer 2016 *The Probabilistic Method* 4e (Wiley) Ch. 6, 8, 10; Harris 1960 *Proc. Cambridge Philos. Soc.* 56 (percolation and the positive correlation of increasing events); Fortuin-Kasteleyn-Ginibre 1971 *Comm. Math. Phys.* 22 (the FKG inequality and the lattice condition); Ahlswede-Daykin 1978 *J. Combin. Theory A* 24 (the four-functions theorem); Janson-Łuczak-Ruciński 1990 and Janson 1990 *Random Structures Algorithms* 1 (the Janson inequalities); Suen 1990 *Random Structures Algorithms* 1 (Suen's inequality)
Intuition Beginner
Some kinds of good news tend to arrive together. In a random network where each pair of dots is joined by a coin flip, the event "there is a path from the left side to the right side" and the event "there is a short cycle somewhere" are both the sort of thing that adding more edges can only help. Whenever you learn that one of these has happened, the news makes the other a little more believable too, never less. Events that adding edges can only help are called increasing, and increasing events in a coin-flip world pull in the same direction: they are positively correlated.
This is more than a feeling. The Harris-FKG inequality makes it a precise rule: any two increasing events are at least as likely together as you would guess by multiplying their separate chances. Learning good news never hurts other good news.
The second story is about rare bad events. Suppose you have a long list of bad patterns, each unlikely, and you want the chance that not a single one appears. If the patterns barely overlapped, the count of bad patterns would behave like a Poisson tally, and the chance of zero would be about raised to minus the average count. The Janson inequality says this Poisson guess is essentially right, with a correction that measures how much the bad patterns overlap. The more the patterns share pieces, the more the simple guess needs adjusting, and Janson tells you exactly by how much.
Visual Beginner
Picture two dials, each controlling an increasing feature of a random network. As you add edges, both dials only ever climb. Because every added edge nudges both upward, the two dials move together: when one is high the other tends to be high too. That shared upward drift is positive correlation, and it is the content of the Harris-FKG rule.
| quantity | meaning | what it controls |
|---|---|---|
| increasing event | adding edges can only help it | such events are positively correlated |
| average number of bad patterns | the main Poisson exponent | |
| total overlap between patterns | the correction to the Poisson guess |
The table is the whole picture. Positive correlation governs how good news clusters; the pair and governs how often you escape every bad pattern at once. A small overlap means the simple guess is almost exactly right.
Worked example Beginner
Take a random network on dots where each of the possible edges is present with probability . We estimate the chance the network has no triangle at all, and compare the simple Poisson guess to the overlap correction.
Step 1. Count the candidate triangles. A triangle is a choice of three dots, so there are candidate triangles.
Step 2. Find the average number of triangles. Each candidate needs all three of its edges, present with chance . The average count is .
Step 3. Make the simple Poisson guess for "no triangle". If triangles barely overlapped, the chance of seeing none would be about .
Step 4. Measure the overlap. Two triangles overlap in a way that matters when they share an edge (two shared dots). Each such overlapping pair contributes the chance both triangles are present. The total of these shared-edge contributions is the overlap number ; for this small network it is a modest positive number, so the true chance of no triangle is a little above the simple guess.
Step 5. Read off the corrected estimate. The Janson rule says the chance of no triangle is at most . Because is positive, this is larger than the bare : the overlap makes triangle-freeness slightly more likely than the no-overlap guess, and Janson pins the size of that effect.
What this tells us: the average count sets the main scale for "no bad pattern", and the overlap is the precise correction. When patterns rarely share pieces, is nearly exact; when they share more, the chance of escaping them all rises in a controlled way.
Check your understanding Beginner
Formal definition Intermediate+
Fix a finite product probability space: coordinates , each drawn independently. The leading instance is the Erdős-Rényi random graph , where the coordinates are the independent edge indicators, each present with probability . The basic apparatus — independence, expectation, variance — is imported from the probability units and from 40.07.03, not reproved.
Order coordinatewise when each is ordered (for , edge-present edge-absent), making a finite distributive lattice under coordinatewise meet and join . A function is increasing (monotone) if implies ; an event is increasing if its indicator is increasing, equivalently if adding to any coordinate preserves membership. For graphs: is increasing if adding edges never destroys — connectivity, containing a fixed subgraph , and having a Hamilton cycle are increasing.
A positive measure on a finite distributive lattice satisfies the FKG lattice condition (log-supermodularity) if $$ \mu(x \vee y),\mu(x \wedge y) ;\ge; \mu(x),\mu(y) \qquad \text{for all } x, y \in L. $$ A product measure satisfies it automatically (with equality), so qualifies.
Harris-FKG inequality. If satisfies the lattice condition and are both increasing (or both decreasing), then, writing for the normalised average, $$ \langle f g \rangle ;\ge; \langle f \rangle,\langle g \rangle . $$ Taking , for increasing events gives [Fortuin-Kasteleyn-Ginibre 1971].
Janson's setup. Let be increasing events, each determined by the coordinates in a subset (for graphs, = "the -th copy of is present", its edge set). Write if and (the copies share a coordinate). Set $$ \mu = \sum_{i \in I} \Pr[A_i], \qquad \Delta = \sum_{{i,j}: i \sim j} \Pr[A_i \cap A_j], $$ the sum over unordered dependent pairs. Let , so and .
The notation , , , , , , , and is registered in _meta/NOTATION.md.
Counterexamples to common slips Intermediate+
"FKG holds for any two events." It needs both events increasing (or both decreasing). An increasing event and a decreasing event are negatively correlated: " is connected" and " has an isolated vertex" anti-correlate.
" sums over all pairs." It sums only over dependent pairs that share a coordinate. Disjoint copies contribute nothing to — they are independent, and their covariance vanishes.
"Janson needs the events independent." The opposite: Janson is designed for dependent increasing events. The dependence is precisely what measures; with no dependence () the bound collapses to the exact product up to lower order.
"The generalised bound always beats ." It is the right tool only when , the highly-dependent regime where has lost its force (the exponent is no longer negative). For the first bound is the sharp one.
Key theorem with proof Intermediate+
The signature result is Janson's inequality, the sharp upper bound on the probability that none of a family of increasing events occurs [Janson 1990]. It is the tool that converts the second-moment threshold of 40.07.03 into an exponential rate. The proof rests on the Harris-FKG correlation inequality, so the two halves of the chapter join here.
Theorem (Janson's inequality). Let be increasing events in a finite product probability space, each determined by coordinates in , with and . If for every , then $$ \Pr\Big[\bigcap_{i \in I} \overline{A_i}\Big] ;\le; \exp!\Big(-\mu + \tfrac{\Delta}{2}\Big), $$ and combined with the Harris lower bound .
Proof. Order the index set . Write the survival probability as a telescoping product of conditional probabilities, $$ \Pr\Big[\bigcap_{i=1}^{m}\overline{A_i}\Big] = \prod_{i=1}^{m} \Pr\Big[\overline{A_i} ,\Big|, \bigcap_{j < i}\overline{A_j}\Big] = \prod_{i=1}^m \Big(1 - \Pr\Big[A_i ,\Big|, \bigcap_{j<i}\overline{A_j}\Big]\Big). $$ Fix and split the prior indices into the dependent ones and the independent ones . We bound the conditional probability from below in terms of unconditional quantities.
Let and . Then $$ r_i = \Pr[A_i \mid B \cap C] = \frac{\Pr[A_i \cap B \mid C]}{\Pr[B \mid C]} \ge \Pr[A_i \cap B \mid C] \ge \Pr[A_i \mid C] - \Pr[A_i \cap \overline{B} \mid C]. $$ Since depends only on , which is disjoint from the coordinates of every with , the event is independent of , so . For the subtracted term, , so by the union bound and dropping the conditioning on (again and each , , depend on coordinates disjoint from those of , so the pair is independent of ), $$ \Pr[A_i \cap \overline{B} \mid C] \le \sum_{j \in D} \Pr[A_i \cap A_j \mid C] = \sum_{j \in D}\Pr[A_i \cap A_j] = \sum_{j \sim i, , j < i}\Pr[A_i \cap A_j]. $$ Hence . The step used , and the next inequality is inclusion-exclusion truncated at the first order; the Harris-FKG inequality enters by guaranteeing the conditioning on the increasing events only decreases , so that as well, keeping each factor in range.
Now use on each factor: $$ \Pr\Big[\bigcap_i \overline{A_i}\Big] \le \prod_{i=1}^m e^{-r_i} = \exp\Big(-\sum_i r_i\Big) \le \exp\Big(-\sum_i \Pr[A_i] + \sum_i \sum_{j \sim i, j<i}\Pr[A_i \cap A_j]\Big). $$ The double sum runs over ordered dependent pairs with , i.e. once per unordered dependent pair, giving exactly . Therefore . The sharper constant comes from a more careful accounting using the Harris inequality to bound and a convexity estimate (carried out in the Master proof set); the displayed argument already gives the exponential form with and , and the lower bound is the Harris-FKG inequality applied to the decreasing events .
Bridge. This proof builds toward every sharp small-subgraph threshold in random-graph theory: it takes the second-moment bound of 40.07.03, which only shows above threshold, and upgrades it to the exponential rate , so the foundational reason Janson is sharper is that it controls not the variance of but the full lower tail directly. This is exactly the duality the second-moment unit's synthesis pointed to: Chebyshev wastes the structure by squaring, while Janson keeps the multiplicative independence and pays only the pairwise-overlap correction . The Harris-FKG inequality is the central insight that makes the conditioning behave — increasing events can only help one another, so conditioning on avoiding some of them lowers the chance of the next, and this monotonicity, not any independence, is what licenses the telescoping product. The result generalises the first-moment bound of 40.07.01: where the union bound gives , Janson gives the matching lower-tail , and putting these together the appearance of in acquires a precise probability, not merely a threshold. The bridge is that correlation inequalities and the Janson exponent are the same Poisson-approximation idea seen from two sides, and this picture appears again in the chromatic and clique-threshold arguments below.
Exercises Intermediate+
Advanced results Master
The correlation inequalities and the Janson method form a single Poisson-approximation toolkit. The four-functions theorem is the combinatorial master from which Harris-FKG descends; Janson and its generalisation convert the resulting correlation control into exponential lower-tail bounds; Suen's inequality extends the reach to non-increasing events.
Theorem 1 (four-functions theorem; Ahlswede-Daykin 1978). Let be a finite distributive lattice and satisfy for all . Then for all subsets , writing and , $$ \alpha(X),\beta(Y) ;\le; \gamma(X \vee Y),\delta(X \wedge Y) $$ [Ahlswede-Daykin 1978]. The proof reduces to the lattice by induction on , the base case being a direct verification of a four-term inequality. The FKG inequality is the specialisation , , , (or a symmetric choice) once are increasing and satisfies the lattice condition.
Theorem 2 (Harris-FKG, graph form; Harris 1960, FKG 1971). In , any two increasing events satisfy , and any two decreasing events likewise; an increasing and a decreasing event satisfy [Harris 1960]. More generally, for increasing and , . Harris's original application bounds the critical probability of bond percolation on below by : the crossing events are increasing, their positive correlation forces a self-dual contradiction at .
Theorem 3 (Janson's inequality; Janson, Łuczak, Ruciński 1990). For increasing events with , $$ \prod_{i}\Pr[\overline{A_i}] ;\le; \Pr\Big[\bigcap_i \overline{A_i}\Big] ;\le; \exp!\Big(-\mu + \tfrac{\Delta}{2}\Big), $$ where the lower bound is Harris-FKG and the upper bound is the Janson exponent [Janson 1990]. When the two bounds pinch to , so obeys a Poisson law near zero: .
Theorem 4 (generalised Janson inequality). Under the same hypotheses, if then $$ \Pr\Big[\bigcap_i \overline{A_i}\Big] ;\le; \exp!\Big(-\frac{\mu^2}{2\Delta}\Big) $$ [Janson 1990]. This is the bound of record in the strongly-dependent regime, where renders Theorem 3 vacuous. It follows by applying Theorem 3 to a random sub-family: each is independently retained with probability , giving expectation and overlap ; optimising in yields . The two Janson bounds together cover all regimes of relative to .
Theorem 5 (sharp threshold for -freeness and the clique number). For a strictly balanced graph with , at the number of copies of in converges to a Poisson distribution, and by Janson with [Janson 1990]. For the clique number of , Janson sharpens the second-moment bound of 40.07.03: where Chebyshev gave at below , Janson gives the exponentially small with the events "the -th -set is a clique", locating the clique-number threshold to within an additive constant and underpinning the chromatic-number lower bound .
Theorem 6 (Suen's inequality; Suen 1990). For events (not required increasing) with dependency graph given by when depend on overlapping coordinates, $$ \Pr\Big[\bigcap_i \overline{A_i}\Big] ;\ge; \prod_i \Pr[\overline{A_i}],\exp!\Big(-\sum_{i \sim j}\big(\Pr[A_i \cap A_j] + \Pr[A_i]\Pr[A_j]\big),e^{,2\delta_i}\Big), $$ where [Suen 1990]. Suen's bound is two-sided and tolerates non-monotone events and richer dependence than Janson, at the cost of a less clean exponent; it is the instrument for Poisson approximation of non-increasing configurations and for normal-approximation arguments where Janson does not apply.
Synthesis. Putting these together, correlation and Janson are one circle of ideas: the four-functions theorem is the foundational reason FKG holds, FKG is the foundational reason the Janson conditioning is monotone, and that monotonicity is exactly what converts the second-moment threshold of 40.07.03 into an exponential rate. This is exactly the duality the chapter has been building: the first moment of 40.07.01 caps from above, and Janson caps from above by — union bound and lower-tail bound are the two faces of the same Poisson heuristic, meeting when so that is genuinely Poisson near zero. The generalised bound is dual to the basic one across the line : random thinning trades for and for , and the optimal interpolates the two regimes, which is the central insight that makes Janson cover all densities. Where the bad events lose monotonicity, Suen's inequality generalises the whole apparatus, and the bridge onward is that the same overlap parameter governs the variance in the second-moment method, the lower tail in Janson, and the Poisson and normal limit laws — so the correlation inequalities are the quantitative heart of the probabilistic method's threshold theory, sharper than the second moment of 40.07.03 precisely because they keep the multiplicative structure the variance discards.
Full proof set Master
Proposition 1 (FKG from one coordinate, by induction). Let carry a product measure with each linearly ordered, and let be increasing. Then .
Proof. Induct on . For with a two-point set of weights : since both differences are nonnegative; for a general linearly ordered the same covariance identity has every summand of one sign because are comonotone on a chain. For the inductive step, condition on the last coordinate : define and , averages over . By the inductive hypothesis applied to the conditioned measure on , for each fixed . Both and are increasing in (a conditional average of an increasing function is increasing in the conditioning coordinate). Taking expectations over and applying the case to : .
Proposition 2 (four-functions theorem implies FKG). On a finite distributive lattice with measure satisfying , increasing satisfy .
Proof. Apply the four-functions theorem with , , , . The hypothesis reads . Using the lattice condition and that increasing give , , the inequality holds termwise. The four-functions conclusion with gives , i.e. , which on dividing by is .
Proposition 3 (Janson upper bound, ). For increasing events with , .
Proof. Write , so . Fix and let , , . By Harris-FKG applied to the increasing event and the decreasing event , conditioning lowers : , the last equality by independence of from the disjoint-coordinate event . Thus , giving the lower bound. For the upper exponent, the conditional appearance probability satisfies $$ \Pr[A_i \mid \textstyle\bigcap_{j<i}\overline{A_j}] \ge \Pr[A_i] - \sum_{\substack{j \in D}}\Pr[A_i \cap A_j], $$ by the inclusion-exclusion argument of the Key-theorem proof (the conditioning on is removed using disjointness of coordinates, and Harris-FKG guarantees does not help the wrong way). Hence . Using for the relevant ranges, more carefully where the symmetric split assigns each unordered dependent pair half its weight to each endpoint, summing to . Exponentiating, .
Proposition 4 (generalised Janson bound, ). Under the same hypotheses, if then .
Proof. Let and form a random sub-family by including each index independently with probability . Conditioning on , the events are increasing with parameters and . Since for every , , and applying Proposition 3 to the sub-family and taking expectation over , $$ \Pr\Big[\bigcap_I\overline{A_i}\Big] \le \mathbb{E}_S\Big[\exp(-\mu_S + \tfrac12\Delta_S)\Big]. $$ By Jensen, or by directly using and together with the convexity bound giving the wrong direction, one instead deterministically chooses and applies Proposition 3 not to a random sub-family but to the deterministic optimisation: the cleanest route fixes and bounds , the inner inequality holding because deleting events only raises the survival probability while the exponent is minimised at .
Proposition 5 (Poisson law for strictly balanced subgraphs). Let be strictly balanced with , and . Then the number of copies of in satisfies .
Proof. The expected count is , where , . The overlap runs over pairs of copies sharing at least one edge; for a strictly balanced , any proper overlap with edges and vertices has , so the contribution divided by is since . Hence . By Janson's two-sided bound (Theorem 3), , and , so both sides converge to .
Proposition 6 (Suen's inequality, lower bound). For events with dependency graph , with .
Proof (structure). Order the events and track the ratio . The product is the correction to independence. Suen's argument bounds below by isolating, for each , the dependent predecessors , : conditioning on changes by an amount controlled by the joint and product probabilities over dependent pairs, with the factor absorbing the cumulative effect of the dependency neighbourhood through a discrete Grönwall / generating-function estimate. Unlike Janson, no monotonicity of is used; the bound is symmetric in the two correction terms (the joint overlap) and (the independent baseline), which is why non-increasing events are admissible. Summing over gives the stated exponential factor; the full inductive bookkeeping is in Suen's paper [Suen 1990].
Connections Master
The second-moment method of
40.07.03and this unit answer the same threshold question with different sharpness: Chebyshev bounds , a polynomial decay, while Janson bounds , an exponential decay with the same overlap parameter that controls the variance there. The variance in40.07.03decomposes over edge-sharing pairs of copies exactly as does, so Janson is the second moment's multiplicative refinement — it keeps the product structure that squaring discards, which is why it locates not just the threshold but the precise Poisson constant in .The first-moment / union bound of
40.07.01is Janson's upper companion: the union bound caps from above, and Janson caps from above, so together they sandwich the count between its first-moment ceiling and its Poisson lower tail. When the two meet and is asymptotically Poisson, the regime in which the appearance of a fixed subgraph in has a genuine limiting distribution rather than merely a / threshold.The martingale concentration of
40.07.05is the complementary large-deviation tool: Azuma-Hoeffding bounds the upper and lower tails of a Lipschitz graph functional symmetrically, while Janson bounds specifically the lower tail of a subgraph count, exponentially and with the correct constant when the count is a sum of increasing indicators. The chromatic-number lower bound of uses Janson on clique-count events to control the independence number, then Azuma to concentrate — the two methods compose, Janson supplying the sharp lower tail that Azuma's bounded-differences estimate cannot reach.
Historical & philosophical context Master
The positive correlation of increasing events was first proved by Theodore E. Harris in 1960 [Harris 1960], in the service of percolation theory: to bound the critical probability of bond percolation on the square lattice below by , he showed that increasing events such as the existence of long open crossings are positively correlated, so that their probabilities could not conspire to produce an infinite cluster below the self-dual point. The inequality was rediscovered and vastly generalised by Cees Fortuin, Pieter Kasteleyn, and Jean Ginibre in 1971 [Fortuin-Kasteleyn-Ginibre 1971], who identified the lattice condition as the exact hypothesis and applied it to ferromagnetic spin systems including the Ising model. Rudolf Ahlswede and David Daykin then proved in 1978 the four-functions theorem [Ahlswede-Daykin 1978], the combinatorial master inequality from which FKG descends by a one-line specialisation, decoupling the correlation result from its measure-theoretic origins.
The lower-tail inequality is due to Svante Janson, with Tomasz Łuczak and Andrzej Ruciński, in 1990 [Janson 1990], developed to determine the exponential rate at which avoids a fixed subgraph and to give Poisson limit laws for small subgraph counts, questions the second moment could only resolve up to a threshold. The generalised inequality for the dependent regime appeared in the same circle of work via the random-thinning argument. In the same 1990 volume of Random Structures and Algorithms, W. C. Suen [Suen 1990] proved a two-sided correlation estimate that drops the increasing-event hypothesis, extending Poisson approximation to non-monotone configurations and to normal-approximation arguments where Janson's one-sided bound does not suffice.
Bibliography Master
@article{harris1960,
author = {Harris, T. E.},
title = {A lower bound for the critical probability in a certain percolation process},
journal = {Proceedings of the Cambridge Philosophical Society},
volume = {56},
number = {1},
pages = {13--20},
year = {1960}
}
@article{fkg1971,
author = {Fortuin, C. M. and Kasteleyn, P. W. and Ginibre, J.},
title = {Correlation inequalities on some partially ordered sets},
journal = {Communications in Mathematical Physics},
volume = {22},
pages = {89--103},
year = {1971}
}
@article{ahlswededaykin1978,
author = {Ahlswede, Rudolf and Daykin, David E.},
title = {An inequality for the weights of two families of sets, their unions and intersections},
journal = {Zeitschrift f{\"u}r Wahrscheinlichkeitstheorie und Verwandte Gebiete},
volume = {43},
number = {3},
pages = {183--185},
year = {1978}
}
@article{janson1990,
author = {Janson, Svante},
title = {Poisson approximation for large deviations},
journal = {Random Structures and Algorithms},
volume = {1},
number = {2},
pages = {221--229},
year = {1990}
}
@article{jansonluczakrucinski1990,
author = {Janson, Svante and {\L}uczak, Tomasz and Ruci{\'n}ski, Andrzej},
title = {An exponential bound for the probability of nonexistence of a specified subgraph in a random graph},
journal = {Random Structures and Algorithms},
pages = {73--87},
year = {1990}
}
@article{suen1990,
author = {Suen, W. C.},
title = {A correlation inequality and a Poisson limit theorem for nonoverlapping balanced subgraphs of a random graph},
journal = {Random Structures and Algorithms},
volume = {1},
number = {2},
pages = {231--242},
year = {1990}
}
@book{jansonluczakrucinski2000,
author = {Janson, Svante and {\L}uczak, Tomasz and Ruci{\'n}ski, Andrzej},
title = {Random Graphs},
publisher = {Wiley-Interscience},
year = {2000}
}
@book{alonspencer2016,
author = {Alon, Noga and Spencer, Joel H.},
title = {The Probabilistic Method},
edition = {4},
publisher = {Wiley},
year = {2016}
}
@book{grimmett1999percolation,
author = {Grimmett, Geoffrey},
title = {Percolation},
edition = {2},
publisher = {Springer},
year = {1999}
}