Shuming Bao Shuming Bao [email protected][email protected]China Data Center University of Michigan Aug 23-25, 2001 Fudan University Integrating Spatial Integrating Spatial Attribute Data and CHGIS Attribute Data and CHGIS for Spatial Analysis for Spatial Analysis
29
Embed
Shuming Bao [email protected] China Data Center University of Michigan Aug 23-25, 2001 Fudan University Integrating Spatial Attribute Data and CHGIS for.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
•Socioeconomic data– County/Province statistics– Census data– Social surveys
Spatial Data Sources:Spatial Data Sources:
•Geographic data (polygons, points and lines)•Arc/Info data•Shape files (*.shp, *.shx, and *.dbf)•Grid•Image data (ERDAS Image, JPEG, TIFF, BMP and Arc/Info Image)•Tabular data (dBASE, INFO and TEXT)•SQL•SDE (Spatial Data Engine)
Types of Spatial DataTypes of Spatial Data
Sample of Spatial Data
Elevation and Major Cities of ChinaElevation and Major Cities of China
The Integration of HGIS data with The Integration of HGIS data with other dataother data
Historical GIS – Boundaries – Settlements
Local attributes– Climate– Culture– Education– Languages– Agriculture– Business
Geographical data – River– Roads– Elevation
Remotely sensed data– Images– Grid
Statistical data–Socioeconomic data–Survey data–Census data
The Integration of HGIS data with The Integration of HGIS data with other data (b)other data (b)
Integration of Data: Spatial OperationsIntegration of Data: Spatial Operations
Is there any spatial cluster over space?Are spatial observations distributed randomly over space?Are spatial observations correlated ?Is there any spatial outlier?Is there any spatial trend?What is the interaction (statistically and theoretically) between different factors?How to predict an unknown spatial value at a specific location ?
QuestionsQuestions
Why spatial data is different from non-spatialWhy spatial data is different from non-spatial data ? (spatial neighborhood)
Statistical property for spatial data:Statistical property for spatial data: Spatial dependence (autocorrelation) Heterogeneity Spatial trend (non-stationarity)
Sensitive to spatial boundaries and spatial unitSensitive to spatial boundaries and spatial unit (Country, County, Tract) Lat / Long grid
Why Spatial is Special ?Why Spatial is Special ?
Spatial AnalysisSpatial Analysis
•Tests on spatial patterns:Tests on spatial non-stationarityTests on spatial autocorrelation Tests on Spatial stationarity and non-stationarity
•Data-driven approaches (Exploratory Spatial Data Analysis)
Global StatisticsLocal statistics
•Model-driven approachesSpatial linear and non-linear modelsSpace-temporal models
Visualization of Spatial DataVisualization of Spatial Data
Criteria: theoretical and empirical•Accessibility (roads, rivers, railways, airlines and Internet)•Economic linkage (commuter flows, migrations, trade
Adjacency criterion: 1 if location j is adjacent to i,wij = 0 if location j is not adjacent to i. Distance criterion: 1 if location j is within distance d from i,wij (d) = 0 otherwise. A general spatial distance weight matrices: wij (d) = dij
Moran I (Z value) is • positive: observations tend to be similar;• negative: observations tend to be dissimilar;• approximately zero: observations are arranged randomly over space. Geary C:• large C value (>>1): observations tend to be dissimilar;• small C value (<<1) indicates that they tend to be similar.
Moran I:
C d n w w x x x xijj
n
i
n
ij i jj
n
i
n
ii
n
( ) ( ) ( ){ ( ) ( ) }
1 2 2 2
Geary C:
Identifying Global Pattern Identifying Global Pattern of Spatial Distributionof Spatial Distribution
Local Moran: I d w Zi ij jj i
n
( )
Local Geary: C d w Z Zi ij i jj i
n
( ) ( ) 2
• significant and negative if location i is associated with relatively low values in surrounding locations;• significant and positive if location i is associated with relatively high values of the surrounding locations.
• significant and small Local Geary (t<0) suggests a positive spatial association (similarity);• significant and large Local Geary (t>0) suggests a negative spatial association
(dissimilarity).
Identifying Local Patterns of Identifying Local Patterns of Spatial DistributionSpatial Distribution
Identifying Factors for Spatial Identifying Factors for Spatial ChangesChanges
Spatially autoregressive model
Spatial moving average model
Semi-parametric model
Kriging
Y = WY +
where y is an observed variable over space D: {Y(si): si D, i=1?n },
W is a spatial weight matrix (nxn), is the spatial autoregressive parameter, and ~ N(0, 2).
OLSOLS estimates are biased and inconsistent: estimates are biased and inconsistent:
^
( )' ( ) ( )' ( )' ( ) ( )'
Wy Wy Wy y Wy Wy Wy1 1
E( )^
A Simple Spatial Autoregressive A Simple Spatial Autoregressive
ModelModel
A General Form of Spatial A General Form of Spatial
Process ModelProcess Model
where W1 and W2 are spatial weight matrices, ~ N(0,).
y W y X 1
W 2
Historical studies Socioeconomic development Environment Religion Anthropology studies Population studies Minority studies….
ApplicationsApplications
GIS Systems
Topological information
Tables
Statistical Systems
•Spatial Statistics
•Spatial modelsAttribute data
Analytical results
Statistical reports
Statistic graphics
Charts
GIS Maps
Spatial weights
Integration of Spatial Analysis with Integration of Spatial Analysis with HGISHGIS
•An enhanced version of S language specially for An enhanced version of S language specially for exploratory data analysis and statistics.exploratory data analysis and statistics.
•An integrated suite for data manipulation, data An integrated suite for data manipulation, data analysis and graphical display.analysis and graphical display.
•An interpretedAn interpreted language, in which individual language language, in which individual language expressions are read and then immediately executed.expressions are read and then immediately executed.
•Object-oriented programming(method, class, and object).Object-oriented programming(method, class, and object).
•S+SpatialStatsS+SpatialStats for geostatistical data, polygon data for geostatistical data, polygon data and point data (2000+ analytical functions).and point data (2000+ analytical functions).
S-PLUS for ArcView GISS-PLUS for ArcView GIShttp://www.mathsoft.com
China Data
Attribute data:
GIS map data:
Application Interface
ArcView GIS
S-PLUS/SpatialStats
Maps
Analysis
Reports
Statistical Graphics
S-PLUS for ArcViewS-PLUS for ArcView
Spatial data process (missing data, fuzzy data, large Spatial data process (missing data, fuzzy data, large volume of data, space-time data structure, references)volume of data, space-time data structure, references)
Spatial data sharing and management (Metadata, GIS Spatial data sharing and management (Metadata, GIS data, attribute data; distributed centers; update, search, data, attribute data; distributed centers; update, search, online analysis)online analysis)
Integration of CHGIS with natural and social informationIntegration of CHGIS with natural and social information
Development of new methodology and tools for spatial Development of new methodology and tools for spatial data analysis (sampling, survey, clustering, data analysis (sampling, survey, clustering, autocorrelation, association, modeling, simulation, web autocorrelation, association, modeling, simulation, web tools)tools)
Applications of GIS, database, and new technology in Applications of GIS, database, and new technology in historical and other studieshistorical and other studies