Top Banner
Distributed Geometric Data Structures Philip Levis Stanford Platform Lab Review Feb 9, 2017
31

Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Sep 29, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Distributed Geometric Data Structures

Philip LevisStanford Platform Lab Review

Feb 9, 2017

Page 2: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Big Control

Page 3: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

The Physical World

• Big control applications collect data on, and take action in, the physical world▶ There will be a lot of data: they need distributed data structures

to store, query, and compute on it

• Big control applications have high locality (literally)▶ Physical world data is geometric (2D, 3D) in nature; it has much

more complex data inter-dependencies than key-value stores

• Need new, distributed geometric data structures

Page 4: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Outline

• Example big control applications

• Geometric data structures

• Distributing geometric data structures

Page 5: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Outline

• Example big control applications

• Geometric data structures

• Distributing geometric data structures

Page 6: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Disaster Response

Page 7: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:
Page 8: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:
Page 9: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:
Page 10: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Beverages

Child

Page 11: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Data Requirements

• Altitude produces different data resolutions▶ Dynamically changing in response to application actions

• Data changes/decays over time: 4D

• Grid-based (e.g., temperature, pixels) as well as point-based data (people, objects, landmarks)

Page 12: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Outline

• Example big control applications

• Geometric data structures

• Distributing geometric data structures

Page 13: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Two Basic Approaches

Bounding VolumeHierarchy (BVH)

Spatial subdivision

BA

C

D

X

YZ

geometry

X

A B

Y

C D

Z

data structure

A B

C DF

EG

geometry

D E F G

Z

data structure

A B C Y

Page 14: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Two Basic Approaches

Bounding VolumeHierarchy (BVH)

Spatial subdivision

BA

C

D

X

YZ

geometry

X

A B

Y

C D

Z

data structure

A B

C DF

EG

geometry

D E F G

Z

data structure

A B C Y

Page 15: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Spatial subdivision

• Many variants: quad/oct-trees, kd-trees, binary space partitioning

• Oct-tree: subdivide each axis evenly

Page 16: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem with Oct-trees

Sparse, pointer structure: low locality, cache-poor

Page 17: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Another Problem0

1

2

3

4

5

6

7

8

9

10

Level

1,000-foldresolutionincrease

30-fold altitudeincrease

Many levels for large variations in resolution

Page 18: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

VDB (Museth, ACM TOG 2013 Vol 32, 3:27)

• Hierarchical data structure for the efficient representation of sparse, time-varying volumetric data discretized on a 3D grid

25

2423 = 212

4096-fold

Page 19: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

VDB (Museth, ACM TOG 2013 Vol 32, 3:27)

• Hierarchical data structure for the efficient representation of sparse, time-varying volumetric data discretized on a 3D grid

25

2423 = 212

4096-fold

Single host

Page 20: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Outline

• Example big control applications

• Geometric data structures

• Distributing geometric data structures

Page 21: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

?

Page 22: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 1: Distribution

• Where should the system place each tree node?

• Option 1: Random/spray placement▶ Improves potential read bandwidth▶ Balances load easily

• Option 2: Locality-based placement▶ Better when computations pushed to data (granular computing)▶ Load balancing is an open problem▶ Strawman: balance data size (assumes uniform computation)

Page 23: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 2: Not So Simple

• Staggered (MAC) grids store data on faces as well as in cells: important to represent flow

cijfij

fij

Page 24: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 2: Not So Simple

• Staggered (MAC) grids store data on faces as well as in cells: important to represent flow

fij - wind

cij -temperature, burn state, etc.

Page 25: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 2: Not So Simple

• Staggered (MAC) grids store data on faces as well as in cells: important to represent flow

fij - movement of people

cij - # of people

Page 26: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 2: Not So Simple

• Cell and face values should have locality

• Distributed grids require replication (“ghost cells”)

• Minimizing surface area of volumesminimizes communication, butcomplicates load balancing cijfij

fij

Page 27: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 3: Dynamic Updates

• Applications will dynamically subdivide and coarsen the data structure

• Operations may trigger load rebalancing: need to mask latency from application (asynchrony/replication)

Page 28: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Problem 4: Time• Big control applications require being able to look

backwards in time▶ Where did those people needing rescue go?▶ Where did the fire jump the fire break?▶ What is traffic downtown like in 15 minutes (at 5:30PM)?

• Complicates load balancing: historical data should be close to current data

Page 29: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Current Status

• Understanding bottlenecks/performance issues requires workloads (computations, hierarchy structure)▶ Have implemented multi-resolution FLIP simulation▶ Next step: simulator for drone exploration

• Integrating replication/ghost cells for distribution

Page 30: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Conclusion

• Big control applications use geometric data structures

• Dynamically distributing these data structures is an open problem

• We’re starting with space partitioning (have some prior results on BVHs)

Chinmayee Shah

Page 31: Distributed Geometric Data Structures Levis.pdfThe Physical World • Big control applications collect data on, and take action in, the physical world There will be a lot of data:

Hilbert Helix