Top Banner
Read Montague Baylor College of Medicine Houston, TX www.hnl.bcm.tmc.edu Reward Processing and Social Exchange
27

Read Montague Baylor College of Medicine Houston, TX Reward Processing and Social Exchange.

Jan 20, 2016

Download

Documents

Todd Charles
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Read MontagueBaylor College of Medicine

Houston, TXwww.hnl.bcm.tmc.edu

Reward Processing and Social Exchange

Page 2: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

I love you

Let’s do lunch

BA

A BAB

Page 3: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Natural visual statistics

Specific algorithms evolved to solve efficiently the array of problems of early

vision

Page 4: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Recharge or die

The real world imposes another difficult requirement on all mobile organisms

Page 5: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Natural reward-harvesting statistics?

genericwoodlandcreature

Page 6: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Models of reinforcement learning can provide insight

What computations should we expect?

Reward harvesting is an economic problem involving both valuation and choice

Page 7: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Sustain goal

Select goal

Pursue goal

Goal-directed choice

Actual experience Counterfactual experience‘what could have been’

guidancesignals

Page 8: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Midbrain dopamine neurons

burst

pause

R timenaive

R timeAfter learning

timeAfter learning(catch trial)

Pause, burst, and ‘no change’ responses represent reward prediction errors – ongoing emission of information

ERROR SIGNAL = current reward + next prediction - current prediction

Page 9: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Can we detect a reward prediction error signal in a human subject?

Page 10: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Neural correlates of these error signals have been observed in conditioning and decision

tasks

Passive conditioning tasks

Instrumental conditioning tasks

Sequential decision tasks

Social and economic exchange tasks

Page 11: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Midbrain dopamine neurons

burst and pause responses encode reward prediction errors

ERROR SIGNAL = current reward + next prediction - current prediction

Can these systems be re-deployed for abstractly defined rewards (ideas)?

“I want to solve Fermat’s last theorem”

Page 12: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

What about the something more abstract like the expression and

repayment of trust?

Page 13: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Trust

Modeling

Must involve risk (uncertainty)

Harvesting returns from another agent

Page 14: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

TRUST

I love you

Let’s do lunch

BA

A BAB

Page 15: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Simplifying and quantifying Trust(Berg et al., 1995; Weigelt and Camerer, 1988)

Trust is the amount of money a sender sends to a receiver without external enforcement.

Agent 2Agent 1

I

R

Page 16: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

X 3

Investor Trustee

$20

A dynamic version of the Trust game (10 rounds)

Page 17: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

investperiod

8 s 8 s10 s 10 s 10 s8 s8 s4 s4 s

cue to invest

repaymentrevealed toboth brains

Gave

13

Kept

29

investmentrevealed toboth brains

Kept Gave

6 14

repayperiod

cue to repay

delayperiod

delayperiod

delayperiod

delayperiod

investment phase repayment phase

Totals

2919

totalsrevealed toboth brains

inter-rounddelay period

investperiod

8 s 8 s10 s 10 s 10 s8 s8 s4 s4 s

cue to invest

repaymentrevealed toboth brains

Gave

13

Kept

29

Gave

13

Kept

29

investmentrevealed toboth brains

Kept Gave

6 14

Kept Gave

6 14

repayperiod

cue to repay

delayperiod

delayperiod

delayperiod

delayperiod

investment phase repayment phase

Totals

2919

Totals

2919

totalsrevealed toboth brains

inter-rounddelay period

Structure of a round

Page 18: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

What is the behavioral signal that most strongly

influences changes in trust (money sent) ?

Agent 2Agent 1

I

R

Reciprocity = TIT-FOR-TAT

Page 19: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Reciprocity = TIT-FOR-TAT

mon

ey s

en

tto

part

ner

+Benevolent

signal

-Malevolent

signal

Neutralsignal

Page 20: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Questions about brain response

Responses that differentiate benevolent from neutral?

Responses that differentiate malevolent from neutral?

Responses that differentiate benevolent from malevolent?

Surprise response

Page 21: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

*

time (sec)-8 0 10

-0.2

0

0.2

Trustee Brain:

‘intention to increase trust’ shifts with reputation building

Reputation develops

Trustee will increase trust on next move

Trustee will decreasetrust on next move

submit reveal*

increases or decreasesin future trust by trustee

-0.2

0

0.2

time (sec)

Signal is nowanticipating the

outcome

Page 22: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Temporal shift resembles value transfer in reward learning experiments

burst

pause

R timenaive

R timeAfter learning

timeAfter learning(catch trial)

Page 23: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

-1

-.9

-.8

-.7

-.6

-.5

-.4

-.3

-.2

-.1

0

-1 -.8 -.6 -.4 -.2 0 .2 .4 .6 .8 1

r = .00TR

US

TE

E

decr

ease

s re

paym

en

t

Trustee decreases No information

0

.1

.2

.3

.4

.5

.6

.7

.8

.9

1

-1 -.8 -.6 -.4 -.2 0 .2 .4 .6 .8 1

r = .27

Change in next investment

TR

US

TE

E in

crease

s re

paym

en

t

Trustee decreases positive correlation

Why should intentions to increase repayment burn more energy?

Change in next investment

Page 24: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

1. Social trust is about modeling others - always some underlying currency.

2. Re-deploy reward-harvesting machinery that we share with all vertebrates (abstractions gain reward status)

3. Use to probe pathologies (addicted state, autism spectrum disorder, borderline…)

Trust?

Page 25: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

Baylor College of MedicinePearl Chiu

Amin KayaliBrooks King-Casas

Terry LohrenzSam McClure (Princeton)

Read MontagueDamon Tomlin

CaltechCedric Anen

Colin CamererSteve Quartz

UCLPeter Dayan

Nathaniel Daw

Salk InstituteTerry Sejnowski

Princeton UniversityJon Cohen www.hnl.bcm.tmc.edu/trust

Emory UniversityGreg Berns

University of AlabamaLaura KlingerMark Klinger

Families of autistic subjects

Funding SourcesNIDANIMH

Dana FoundationKane Family Foundation

Angel Williamson Imaging CenterInstitute for Advanced Study, Princeton NJ

Collaborators

Page 26: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

rounds

.1

.2

.3

.4

3 4 5 6 7 8 9 10

fractionof

highlyaccurateguesses

by trustee

Investorevents

Trusteeevents

How do we know a reputation is forming?

8 s 10 s4 s. . .

investperiod

cue to invest

investmentrevealed toboth brains

delayperiod

guessperiod

cue to guess

Investment phase

synchronized onsubmission time of both

partners

Trustee

Page 27: Read Montague Baylor College of Medicine Houston, TX  Reward Processing and Social Exchange.

What is Fair?

i

i

i

Split the profit

Split ‘loaf’Split the total 10 + i each

i

Investor Trustee

20-i

collective ownership? common goods?