Top Banner
Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 – Lecture 1 June 23, 2014
39

Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Oct 19, 2019

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Lecture 1Intro to Spatial and Temporal Processes

Dennis SunStats 253

June 23, 2014

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 2: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Outline of Lecture

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 3: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data?

Where are we?

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 4: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Geostatistics

South African Witwatersrand Gold Reef (grams per ton)

●●

● 10

20

30

40

50

60

70

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 5: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Geostatistics

South African Witwatersrand Gold Reef (grams per ton)

●●

● 10

20

30

40

50

60

70

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 6: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Geostatistics

South African Witwatersrand Gold Reef (grams per ton)

●●

10

20

30

40

50

60

70

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 7: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Geostatistics

South African Witwatersrand Gold Reef (grams per ton)

10

20

30

40

50

60

70

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 8: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Lattice (Areal) Data

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 9: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Point Processes

John Snow: 1854 Broad Street Cholera Outbreak

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 10: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

Point Processes

John Snow: 1854 Broad Street Cholera Outbreak

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 11: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Spatial Data

The Division of Spatial Statistics

Cressie (1993) organizes spatial statistics into these three categories.

I Geostatistics: continuous space, labeled observations, goal isprediction

II Lattice (areal) data: discrete space, labeled observations, goal isinference

III Point processes: continuous space, unlabeled observations, goal isinference

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 12: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Temporal Data

Time Series Example 1

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2−0.06

−0.04

−0.02

0

0.02

0.04

0.06

0.08

Time (seconds)

Human Speech

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 13: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Temporal Data

Time Series Example 2

Google Flu Trends

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 14: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Discussion

What do space and time have in common?

The observations yt (or ys) are correlated:

Cov(yt, yt′) 6= 0 for t 6= t′

Compare with the first assumption you see in most statistics courses:

Let yi be i.i.d....

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 15: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

What is Spatial and Temporal Data? Discussion

How are they different?

Time data is ordered, whereas there is no clear ordering for spatial data.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 16: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Where are we?

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 17: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Organization

• Lectures: Mondays and Wednesdays 2:15-3:30pm in Education 334

• Instructors:• Dennis Sun• Edgar Dobriban• Jingshu Wang

• Contact? Office Hours? Sections?

• All information can be found on the course website:stats253.stanford.edu.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 18: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Content

• This is a new course.

• The material that we will be covering has not really beensynthesized—because it is at the frontiers of statistics!

• We will be loosely following the books• Shumway and Stoffer. Time Series Analysis and Applications (with R

Applications).• Sherman. Spatial Statistics and Spatio-Temporal Data.

• You don’t have to purchase these books: they are available for freefor Stanford students. (Link on course website.)

• Other useful references:• Bivand et al. Applied Spatial Data Analysis with R. (also available free)• Cressie and Wikle. Statistics for Spatio-Temporal Data.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 19: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Homeworks

• There will be about 4 short homeworks. Each homework will be acase study (data analysis).

• We will provide support for R, but you are free to use any computingenvironment (e.g., Python, Matlab, C....)

• You may work in pairs. If you do this, please turn in only one copywith both of your names on the front page.

• They will be graded on effort and completion only.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 20: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Project

• The goal of this course is to make a small but useful contribution tothe world: e.g., a conference publication, open-source code, etc.

• This may sound intimidating, but there’s actually a lot of low-hangingfruit in this subject!

• Grading rubric: produce something useful ⇒ A+.

• I have no qualms about giving everyone an A+ if you all earn it!

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 21: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Course Logistics

Course Requirements

• The grade will be based on the final project.

• If taking this class CR/NC, the final project is optional, but you arerequired to complete all homeworks.

• Please enroll in this class if you are able: I promise that you will getmuch more out of it! The homeworks will be short and instructional.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 22: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Where are we?

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 23: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Review of Linear Regression

Model: yi = β1x1i + ...+ βpxpi + εi, εiiid∼ N(0, σ2)

• Ordinary least squares: choose β1, ..., βp by solving

Estimation: argminβ1,...,βp

n∑i=1

(yi − (β1x1i + ...+ βpixpi))2.

• Write in vector notation as:

Model: y = Xβ + ε, ε ∼ N(0, σ2I)

Estimation: β = argminβ

||y −Xβ||2.

• Solve by differentiating: β must satisfy

2XT (y −Xβ) = 0 ⇒ β = (XTX)−1XTy.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 24: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Regression: A Geometric Perspective

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 25: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Regression: A Geometric Perspective

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 26: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Regression: A Geometric Perspective

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 27: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Linear Regression

Properties of the Estimator

Recall that the model is

y = Xβ + ε, ε ∼ N(0, σ2I).

Is the estimator β any good for estimating β?

β = (XTX)−1XTy = β + (XTX)−1XT ε

E(β) = β

Var(β) = σ2(XTX)−1

It is the best linear unbiased predictor (i.e., with the smallest variance).

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 28: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Where are we?

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 29: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Introducing Dependence

• In linear regression:

Cov(yi, yj) = Cov(xTi β + εi,xTj β + εj) = Cov(εi, εj) = 0

• How can we introduce dependence?

yt = xTt β + φyt−1 + εt

• How do we estimate φ?

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 30: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Autoregression

• Idea: Write asy2...yn

︸ ︷︷ ︸y

=

y1

— X2:n —...

yn−1

︸ ︷︷ ︸

X

βφ

+ ε

• Regress y on X to obtain estimates of β and φ.

• Hence, we call this an auto - regressive (AR) model, meaning“regress on itself.”

• Does this method “work”?

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 31: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Does it work?

Let’s set X ≡ 0 for now. So the model is

yt = φyt−1 + εt

The least squares estimate is

φ = (yT1:(n−1)y1:(n−1))−1yT1:(n−1)y2:n

• Is it true that E(φ) = φ? No.

• Is it true that Var(φ) = σ2(yT1:(n−1)y1:(n−1))−1? No.

• However, it turns out that φ is consistent for φ.

φp→ φ as n→∞.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 32: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Simulation Example

• Suppose we observe 1000 observations of a random walk:

yt = yt−1 + εt, εt ∼ N(0, 1)

(In this case, φ = 1.)

• Calculate φ by regression.

• R Code:

phi <- sapply(1:10000, function(iter) {

z <- cumsum(rnorm(1000))

x <- z[1:999]

y <- z[2:1000]

return(sum(x*y)/sum(x*x))

})

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 33: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Simulation Example

hist(phi, xlim=c(.9,1.1))

abline(v=mean(phi), col=’red’, lty=2)

Histogram of phi

phi

Frequency

0.90 0.95 1.00 1.05 1.10

0500

1000

1500

2000

2500

3000

3500

sd(phi) = .003Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 34: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Simulation Example

• The true standard error of φ is .003.• Does this agree with what linear regression would tell us?• R Code:

z <- cumsum(rnorm(1000))

x <- z[1:999]

y <- z[2:1000]

model <- lm(y~x-1)

summary(model)

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 35: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Simulation Example

• The true standard error of φ is .003.• Does this agree with what linear regression would tell us?• R Code:

z <- cumsum(rnorm(1000))

x <- z[1:999]

y <- z[2:1000]

model <- lm(y~x-1)

summary(model)

GoodDennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 36: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Simulation Example

• The true standard error of φ is .003.• Does this agree with what linear regression would tell us?• R Code:

z <- cumsum(rnorm(1000))

x <- z[1:999]

y <- z[2:1000]

model <- lm(y~x-1)

summary(model)

BadGoodDennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 37: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Autoregressions

Conclusions

• Linear regression gives good estimates for the coefficients of an ARprocess.

• However, it tends to underestimate the error (when observations arepositively correlated).

• This will make effects look more significant than they really are!

• How can we fix this? Next lecture.

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 38: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Recap

Where are we?

1 What is Spatial and Temporal Data?Spatial DataTemporal DataDiscussion

2 Course Logistics

3 Linear Regression

4 Autoregressions

5 Recap

Dennis Sun Stats 253 – Lecture 1 June 23, 2014

Page 39: Lecture 1 Intro to Spatial and Temporal Processes · Lecture 1 Intro to Spatial and Temporal Processes Dennis Sun Stats 253 June 23, 2014 Dennis Sun Stats 253 { Lecture 1 June 23,

Recap

What We’ve Learned

• The similarities and differences between spatial and temporal data.

• Linear regression

• AR processes: the simplest model for correlated data

• The advantages and shortfalls of using regression to estimateparameters in AR processes.

stats253.stanford.edu

Dennis Sun Stats 253 – Lecture 1 June 23, 2014