Top Banner
Low-Cost Learning via Active Data Procurement EC 2015 Jacob Abernethy Yiling Chen Chien-Ju Ho Bo Waggoner 1
35

Low-Cost Learning via Active Data Procurement

Feb 11, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Low-Cost Learning via Active Data Procurement

Low-Cost Learning via Active Data Procurement

EC 2015

Jacob AbernethyYiling ChenChien-Ju HoBo Waggoner

1

Page 2: Low-Cost Learning via Active Data Procurement

2

We Buy Data!

Learners LLC

h

hypothesis(predictor)

General problem: buy data for learning

Page 3: Low-Cost Learning via Active Data Procurement

3

We Buy Data!

Learners LLC

h

hypothesis(predictor)

General problem: buy data for learning

Example: each person has medical data...

… learn to predict disease

Page 4: Low-Cost Learning via Active Data Procurement

4

Example task: classification

h

● Data point: pair (x, label) where label is or

● Hypothesis: hyperplane separating the two types

● Loss: 0 if h(x) = correct label, 1 if incorrect label

● Goal: pick h with low expected loss on new data point

We Buy Data!

Learners LLC

Page 5: Low-Cost Learning via Active Data Procurement

Learn a good hypothesisby purchasing data from the crowd

5

General Goal:

Page 6: Low-Cost Learning via Active Data Procurement

1. price data actively based on value

2. machine-learning style bounds

3. transform learning algs to mechanisms

6

This paper:

learning alg mechanism

Page 7: Low-Cost Learning via Active Data Procurement

1. price data actively based on value

2. machine-learning style bounds

3. transform learning algs to mechanisms

7

This paper:

learning alg mechanism

Page 8: Low-Cost Learning via Active Data Procurement

8

How to assess value/price of data?

Page 9: Low-Cost Learning via Active Data Procurement

9

Use the learner’s current hypothesis!

Page 10: Low-Cost Learning via Active Data Procurement

10

Use the learner’s current hypothesis!

Page 11: Low-Cost Learning via Active Data Procurement

c1

11

Our model

z1

i.i.d.

z2mechanism

distribution

c2 h

hypothesis

Cost of revealing data● lies in [0,1]● worst-case, arbitrarily correlated with the data

online arrival

Page 12: Low-Cost Learning via Active Data Procurement

12

Agent-mechanism interaction

At each time t = 1, …, T:

data: 65 30 65

price: $0.22 $0.41 $0.88

1. mechanism posts menu

Page 13: Low-Cost Learning via Active Data Procurement

13

ct

zt

Agent-mechanism interaction

At each time t = 1, …, T:

data: 65 30 65

price: $0.22 $0.41 $0.88

2. agent arrives

accepts mechanism learns (zt, ct) and pays price(zt)

rejects mechanism sees rejection and pays nothing

1. mechanism posts menu

Page 14: Low-Cost Learning via Active Data Procurement

1. price data actively based on value

2. machine-learning style bounds

3. transform learning algs to mechanisms

14

This paper:

learning alg mechanism

Page 15: Low-Cost Learning via Active Data Procurement

15

What is the “classic” learning problem?

z1

i.i.d.

z2

learning algdistribution

h

hypothesis

Page 16: Low-Cost Learning via Active Data Procurement

16

Classic ML bounds

E loss( h ) ≤ E loss( h* ) + O VC-dim

T

h

alg’s hypothesis optimal hypothesis # of data points

measure of problem difficulty

Page 17: Low-Cost Learning via Active Data Procurement

17

Main result

For a variety of learning problems:

E loss( h ) ≤ E loss( h* ) + O γB

h

our hypothesis optimal hypothesis Budget constraint

measure of “problem difficulty”,in [0,1].

(Assume: γ is approximately known in advance)

Page 18: Low-Cost Learning via Active Data Procurement

18

Main result

For a variety of learning problems:

E loss( h ) ≤ E loss( h* ) + OB

h

our hypothesis optimal hypothesis Budget constraint

measure of “problem difficulty”,in [0,1].

(Assume: γ is approximately known in advance)

1

Tγ ≈ average cost * difficulty

“if problem is cheap or easy or has good correlations, we do well”

γ

Page 19: Low-Cost Learning via Active Data Procurement

19

Related work in purchasing data

this work

Meir, Procaccia, Rosenschein 2012

Cummings, Ligett, Roth, Wu, Ziani 2015

Dekel, Fisher, Procaccia 2008

Ghosh, Ligett, Roth, Schoenebeck 2014

Horel, Ionnadis, Muthukrishnan 2014

Roth, Schoenebeck 2012

Ligett, Roth 2012

Cai, Daskalakis, Papadimitriou 2015

Type of goal

Model

Page 20: Low-Cost Learning via Active Data Procurement

Key features/ideas:

1. price data actively based on value

2. machine-learning style bounds

3. transform learning algs to mechanisms

20

This paper:

learning alg mechanism

Page 21: Low-Cost Learning via Active Data Procurement

21

Learning algorithms: FTRL

● Follow-The-Regularized-Leader (FTRL)(Multiplicative Weights, Online Gradient Descent, ….)

● FTRL algs do “no regret” learning:○ output a hypothesis at each time○ want low total loss

● we interface with FTRL as a black box…… but analysis relies on “opening the box”

Page 22: Low-Cost Learning via Active Data Procurement

22

Our mechanism

At each time t = 1, …, T:

1. post menu

htcurrent hypothesis

Alg

price(z) ~ distribution(ht, z)

Page 23: Low-Cost Learning via Active Data Procurement

23

ct

zt

Our mechanism

At each time t = 1, …, T:

1. post menu

2. agent arrives

accepts

rejectsnull data point

htcurrent hypothesis

de-biased data

Alg

price(z) ~ distribution(ht, z)

Page 24: Low-Cost Learning via Active Data Procurement

24

Analysis idea: use no-regret setting!

c1

z1 z2

c2hh

● Propose regret minimization with purchased data

● Prove upper and lower bounds on regret

● low regret ⇒ good prediction on new data (main result)

Page 25: Low-Cost Learning via Active Data Procurement

25

Summary

Problem: learn a good hypothesis bybuying data from arriving agents

For a variety of learning problems:

E loss( h ) ≤ E loss( h* ) + O γ

B

Page 26: Low-Cost Learning via Active Data Procurement

26

Key ideas

1. price data actively based on value

2. machine-learning style bounds

3. transform learning algs to mechanisms

learning alg mechanism

Page 27: Low-Cost Learning via Active Data Procurement

27

Future work

- Improve bounds (no-regret: gap betweenlower and upper bounds)

- Propose “universal quantity” to replaceγ in bounds (analogue of VC-dimension)

- Variants of the model, better batch mechanisms

- Explore black-box use of learning algs in mechanisms

Page 28: Low-Cost Learning via Active Data Procurement

28

Future work

- Improve bounds (no-regret: gap betweenlower and upper bounds)

- Propose “universal quantity” to replaceγ in bounds (analogue of VC-dimension)

- Variants of the model, better batch mechanisms

- Explore black-box use of learning algs in mechanisms

Thanks!

Page 29: Low-Cost Learning via Active Data Procurement

Additional slides

29

Page 30: Low-Cost Learning via Active Data Procurement

Naive 1: post price of 1, obtain B points, run a learner on them.

Naive 2: post lower prices, obtain biased data, do what??

Roth-Schoenebeck (EC 2012): draw prices from a distribution, obtain biased data, de-bias it.● Batch setting (offer each data point the same price distribution)● Each agent has a number. Task is to estimate the mean● Derives price distribution to minimize variance of estimate

30

What would you do before this work?

Page 31: Low-Cost Learning via Active Data Procurement

31

Related workML-stylerisk bounds

Minimize varianceor related goal

this work

Meir, Procaccia, Rosenschein 2012

Cummings, Ligett, Roth, Wu, Ziani 2015

Dekel, Fisher, Procaccia 2008

Ghosh, Ligett, Roth, Schoenebeck 2014

Horel, Ionnadis, Muthukrishnan 2014

Roth, Schoenebeck 2012

Ligett, Roth 2012

Cai, Daskalakis, Papadimitriou 2015

can fabricate data(like in peer-prediction)

principal-agent style, data depends on effort

agents cannot fabricate data,have costs

Page 32: Low-Cost Learning via Active Data Procurement

32

Simulation results

MNIST dataset -- handwritten digit classification

Brighter green = higher cost

Toy problem: classify (1 or 4) vs (9 or 8)

Page 33: Low-Cost Learning via Active Data Procurement

33

Simulation results

● T = 8503● train on half,

test on half● Alg: Online Gradient

Descent

Naive: pay 1 until budgetis exhausted, then run alg

Baseline: run alg on alldata points (no budget)

Large γ: bad correlationsSmall γ: independent cost/data

Page 34: Low-Cost Learning via Active Data Procurement

● Value of data = size of loss size of gradient of loss (“how much you learn from the loss”)

● Pricing distribution:

Pr[ price ≥ x ] =

● K = normalization constant proportional to γ = ∑t ǁ∇loss(ht,zt)ǁ ct(assume approximate knowledge of K … in practice, can estimate it online)

● Distribution is derived by optimizing regret bound of mechanism for “at-cost” variant of no-regret setting

T

34

“value” and pricing distribution?

ǁ ∇ loss(ht , zt ) ǁ

K x

1

Page 35: Low-Cost Learning via Active Data Procurement

35

Pricing distribution