Top Banner
Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT [email protected]
22

Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT [email protected].

Mar 26, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Optimal Space Lower Bounds for All

Frequency Moments

David Woodruff

MIT

[email protected]

Page 2: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

The Streaming Model

0113734 … Stream of elements a1, …, aq each in {1, …, m} Want to compute statistics on stream Elements arranged in adversarial order Algorithms given one pass over stream Goal: Minimum space algorithm

Page 3: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Frequency Moments

• q = stream size, m = universe size• fi = # occurrences of item i• Define k-th Frequency Moment:

Applications • F_0 = # distinct elements in stream, F_1 = q• F_2 = repeat rate

• Compute self-joins in database

Page 4: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

The Best Determininistic Algorithm

Trivial Algorithm for Fk

Store/update frequency fi of each item i

Space: m items i, log q bits for each fi

Total Space = O(m log q)

Negative Result [AMS96]: Any algorithm computing Fk exactly must use (m) space.

Can we do better?

Page 5: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Approximating Fk

Negative Result [AMS96]: Any deterministic algorithm that outputs x with |Fk – x| < Fk must use (m) space.

What about randomized approximation algorithms?

Randomized algorithm A -approximates Fk if A outputs x with Pr[|Fk – x| < Fk ] > 2/3

Page 6: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Previous Work

Upper Bounds: Can -approximate F0 [BJKST02], F2 [AMS96], Fk [CK04], k > 2 with space respectively:

Lower Bounds:

[AMS96] 8 k, –approximating Fk need (log m) space

[IW03] -approximating F0 requires space if

Questions: Does the bound hold for k 0?

Does it hold for F0 for smaller ?

Page 7: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

First Result

Optimal Lower Bound: 8 k 1 and any = (m-1/2), any -approximator for Fk must use (-2) bits of space.

F1 = q computed trivially in log q space

Fk computed in O(m log q) space, so need = (m-.5)

Technique: Reduction from 2-party protocol for computing Hamming distance (x,y)

Page 8: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Idea Behind Lower Bounds

x 2 {0,1}m

y 2 {0,1}m

Stream s(x) Stream s(y)

(1 § ) Fk algorithm A

(1 § ) Fk algorithm A

Internal state of A

• Compute (1 § ) Fk(s(x) ± s(y)) w.p. > 2/3 • Idea: If can decide f(x,y) w.p. > 2/3, space used by A at least randomized 1-way comm. Complexity of f(,)

S

Alice Bob

Page 9: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Randomized 1-way comm. complexity

Boolean function f: X £ Y ! {0,1} Alice has x 2 X, Bob y 2 Y. Bob wants f(x,y) Only 1 message sent: must be from Alice to Bob Comm. cost of protocol = expected length of longest

message sent over all inputs. -error randomized 1-way comm. complexity of

f, R(f), is comm. cost of optimal protocol computing f w.p. ¸ 1-

How do we lower bound R(f)?

Page 10: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

The VC Dimension [KNR]

F = {f : X ! {0,1}} family of Boolean functions f 2 F is length-|X | bitstring For S µ X, shatter coefficient SC(fS) of S is |{f |S}f 2

F| = # distinct bitstrings when F restricted to S

SC(F, p) = maxS 2 X, |S| = p SC(fS)

If SC(fS) = 2|S|, S shattered by F VC Dimension of F, VCD(F), = size of largest S

shattered by F

Page 11: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Shatter Coefficient Theorem

Notation: For f: X £ Y ! {0,1}, define: fX = { fx(y) : Y ! {0,1} | x 2 X },

where fx(y) = f(x,y)

Theorem [BJKS]: For every f: X £ Y ! {0,1}, every p ¸ VCD( fX ),

R1/3(f) = (log(SC(fX, p)))

Page 12: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Hamming Distance Decision Problem (HDDP)

We will lower bound R1/3(f) via SC(fX, t), but first, a critical lemma…

Set t = (1/2)

x 2 {0,1}t y 2 {0,1}t

Alice Bob

Promise Problem :

(x,y) · t/2 – t1/2 (x,y) > t/2 f(x,y) = 0 OR f(x,y) = 1

Page 13: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Main Lemma

S µ{0,1}n

y= T= S-T

Show 9 S µ {0,1}n with |S| = n s.t.

there exists 2(n) “good” sets T µ S so that: 9 a separator y 2 {0,1}n s.t

1. 8 t 2 T, (y, t) · n/2 – cn1/2 for some c > 0

2. 8 t 2 S – T, (y,t) > n/2

Page 14: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Lemma Solves HDDP Complexity

Theorem: R1/3(f) = (t) = (-2). Proof:

1. Alice gets yT for random good set T applying main lemma with n = t.

2. Bob gets random s 2 S

3. Let f: {yT }T £ S ! {0,1}.

4. Main Lemma =>SC(f) = 2(t)

5. [BJKS] => R1/3(f) = (t) = (-2)

Page 15: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Back to Frequency Moments

Idea: Use -approximator for Fk in a protocol to solve HDDP

y 2 {0,1}t s 2 S µ {0,1}t

Fk Alg Fk AlgState

ay as

ith universe element included exactlyonce in auxiliary stream ay (resp. as) if and only if yi (resp. si) = 1.

Page 16: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Solving HDDP with Fk

Alice/Bob compute -approx to Fk(ay ± as)

Fk(ay ± as) = 2k wt(y Æ s) + 1k (y,s)

For k 1,

Conclusion: -approximating Fk(ay ± as) decides HDDP, so space for Fk is (t) = (-2)

Alice also transmits wt(y) in log m space.

Page 17: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

But How to Prove Main Lemma?

Recall: show 9 S µ {0,1}n with |S| = n s.t.

there exists 2(n) sets T µ S so that: 9 a separator y 2 {0,1}n s.t

1) 8 t 2 T, (y, t) · n/2 – cn1/2 for some c > 0

2) 8 t 2 S – T, (y,t) > n/2

Use probabilistic method For S, choose n random elts in {0,1}n

Show probability arbitrary T µ S satisfies (1),(2) is > 2-zn for constant z < 1.

Hence expected such T is 2(n)

So exists S with 2(n) such T

Key

Page 18: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Proving the Main Lemma

Let T ={t1, …, tn/2} µ S be arbitrary

Let yi = majority(t1,i, ..., tn/2,i) for all i 2 [m]

What is probability p that both: 1) 8 t 2 T, (y, t) · n/2 – cn1/2 for some c > 0

2) 8 t 2 S – T, (y,t) > n/2

For 1, let x = Pr[8 t 2 T, (y,t) · n/2 – cn.5]

For 2, let y = Pr[8 t 2 S-T, (y,t) > n/2] = 2-n/2

By independence, p = x ¢ y.

It remains to lower bound x…

Page 19: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

The Matrix Problem

WLOG, assume y = 1n (recall y is majority word) Want lower bound Pr[8 t 2 T, (y,t) · n/2 – cn.5] Equivalent to matrix problem:

t1 ->t2 -> …tn/2 ->

101001000101111001100101011100011110001110111101010101101010111011100011

Given random n/2 x n binary matrix w/each column majority 1, what is probablity each row has at least n/2 + cn.5 1s?

Page 20: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Bipartite Graphs

Matrix Problem Bipartite Graph Counting Problem:

How many bipartite graphs exist on n/2 by n vertices s.t. each left vertex has degree > n/2 + cn.5 and each right vertex degree > n/2?

… …

Page 21: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Second Result

Bipartite graph count: Probabilistic argument shows at least 2n^2/2 – zn/2 –n such bipartite graphs for constant z < 1. Analysis generalizes to show # bipartite graphs on m +

n vertices w/each left vertex having degree > n/2 and each right vertex degree > m/2 is > 2mn-zm-n.

Previous known count: 2mn-m-n [MW – personal comm.] Follows easily from a correlation inequality of Kleitman.

Our proof uses correlation inequalities, but more involved analysis.

Page 22: Optimal Space Lower Bounds for All Frequency Moments David Woodruff MIT dpwood@mit.edu.

Summary

Results: Optimal Lower Bound: 8 k 1 and any =

(m-1/2), any -approximator for Fk must use (-2) bits of space.

Bipartite Graph Count: # bipartite graphs on m + n vertices w/each left vertex having degree > n/2 and each right vertex degree > m/2 is at least 2mn-zm-n for constant z < 1.