Page 1
Emerging Dynamic Distributed Systems
and Challenges for Advanced Services
Engineering
Hong-Linh Truong
Distributed Systems Group,
Vienna University of Technology
[email protected] http://www.infosys.tuwien.ac.at/staff/truong
1 ASE WS 2012
Advanced Services Engineering,
WS 2012
Page 2
Outline
Today‘s Internet Computing
Some emerging models – properties and issues
Data provisioning models
Computational infrastructures/frameworks
provisioning
Human computation provisioning
Internet-scale service engineering
Single service/platform engineering
ASE WS 2012 2
Page 3
Today‘s Internet Computing
Internet infrastructure and software connect
contents, things, and people, each has different
roles (computation, sensing, analytics, etc.)
ASE WS 2012 3
People Software
Things
Size does matter
Large-scale interactions
Big data generated
Big quantities to be managed
Hard to control quality
Any * access behaviour does matter
Unpredictable workload
Scalability
Economic factors do matter
On-demand, pay-as-you-go
Complex contract
Internet infrastructure and
software
Page 4
Today‘s Internet Computing
ASE WS 2012 4
Social computing
Service Computing
Distributed Computing
Peer-to-Peer
Computing
Cloud Computing
converge
People Software
Things Emerging forms of
computing
models, systems
and applications introduces
Technologies and
computing models
Page 5
WHICH EMERGING FORMS OF
COMPUTING MODELS,
SYSTEMS AND APPLICATIONS
DO YOU SEE?
Discussion time:
ASE WS 2012 5
Page 6
Some emerging data provisioning
models (1)
ASE WS 2012 6
• Satellites and environmental/city sensor networks (e.g., from specific orgs/countries)
• Machine-to-machine (e.g., from companies)
• Social media (e.g., from people + platform providers)
Large (near-) realtime
data
• Open science and engineering data sets
• Open government data Open data
• Statistics and business data
• Commercial data in general
Marketable data
Data are assets
Page 7
Some emerging data provisioning
models (2)
ASE WS 2012 7
Social
Platforms
Things
Environtments
Infrastructures
....
Data/Service Platforms
APPs Data
Storage
Data Profiling
and Enrichment
Data
Analytics Data
Query
...
A lot A few A lot
Page 8
Examples of large-scale (near-)
realtime data
ASE WS 2012 8
Page 9
Large-scale (near-)realtime data:
properties and issues
Some properties
Having massive data
Requiring large-scale, big
(near-) real time
processing and storing
capabilities
Enabling predictive and
realtime data analytics
Some issues
Timely analytics
Performance and
scalability
Quality control
Handle of unknown data
patterns
Benefit/cost versus
quality tradeoffs
ASE WS 2012 9
Page 10
Example of open data
ASE WS 2012 10
Page 11
Open data: properties and issues
Some properties
Having large, multiple
data sources but mainly
static data
Having good quality
control in many cases
Usually providing the
data as a whole set
Some issues
Fine-grained content
search
Balance between
processing cost and
performance
ASE WS 2012 11
Page 12
Marketable data examples
ASE WS 2012 12
Page 13
Marketable data: properties and
issues
Some properties
Can be large, multiple
data sources but mainly
static data
Having good quality
control
Have strong data contract
terms
Some do not offer the
whole dataset
Some issues
Multiple levels of
service/data contracts
Compatible with other
data sources w.r.t.
contract
Cost w.r.t. up-to-date
data
ASE WS 2012 13
Page 14
Emerging computational
infrastructure/platform provisioning
models
Infrastructure-as-a-Service
Machine-as-a service
Storage as a Service
Database as a Service
Platform-as-a-Service
Middleware
Computational frameworks
Software Defined Networking
ASE WS 2012 14
Page 15
Examples of Infrastructure-as-a-
Service
ASE WS 2012 15
Source: Hong Linh Truong, Schahram Dustdar: Cloud computing for small research groups in computational science and engineering: current
status and outlook. Computing 91(1): 75-91 (2011)
And more MongoLab
Amazon S3
OKEANOS
Microsoft Aruze
Page 16
Examples of Platform-as-a-Service
ASE WS 2012 16
Source: Hong Linh Truong, Schahram Dustdar: Cloud computing for small research groups in computational science and engineering:
current status and outlook. Computing 91(1): 75-91 (2011)
And more Amazon Elastic MapReduce
StormMQ Globus Online (GO)
Page 17
SOCloud WS 2011 17
Examples of multiple clouds
aaa
Source: Katarzyna Keahey, Mauricio Tsugawa, Andrea Matsunaga, and Jose Fortes. 2009. Sky Computing. IEEE Internet
Computing 13, 5 (September 2009), 43-51. DOI=10.1109/MIC.2009.94 http://dx.doi.org/10.1109/MIC.2009.94
Page 18
Emerging computational
infrastructure/platform provisioning
models– properties and issues
Some properties
Rich types of services
from multiple providers
Better choices in terms of
functions and costs
Concepts are similar but
diverse APIs
Strong
dependencies/tight
ecosystems
Some issues
On-demand information
management from
multiple sources
APIs complexity
Cross-vendor integration
Data locality
ASE WS 2012 18
Page 19
Emerging human computation
models
Crowdsourcing platforms
(Anonymous) people computing capabilities exploited
via task bids
Individual Compute Unit
An individual is treated like „a processor“ or “functional
unit“. A service can wrap human capabilities to support
the communication and coordination of tasks
Social Compute Unit
A set of people and software that are initiated and
provisioned as a service for solving tasks
ASE WS 2012 19
The main point: humans are a computing element
Page 20
Examples of human computation
(1)
ASE WS 2012 20
Source: Salman Ahmad, Alexis Battle, Zahan Malkani, Sepandar D. Kamvar: The jabberwocky programming environment for structured
social computing. UIST 2011: 53-64
Page 21
Examples of human computation
(2)
ASE WS 2012 21
Source: Daniel W. Barowy, Charlie Curtsinger, Emery D. Berger, Andrew McGregor: AutoMan: a platform for integrating human-based
and digital computation. OOPSLA 2012: 639-654
Page 22
Examples of human computation
(3)
ASE WS 2012 22
Source: Muhammad Z.C. Candra, Rostyslav Zabolotnyi, Hong-Linh Truong, and Schahram Dustdar, Virtualizing Software and Human for
Elastic Hybrid Services, Web Services Handbook, (c)Springer-Verlag, 2012.
Page 23
Human computation models –
properties and issues
Some properties
Huge number of people
Capabilities might not
know in advance
Simple coordination
models
Some issues
Quality control
Reliability assurance
Proactive, on-demand
acquisition
Incentive strategies
ASE WS 2012 23
Page 24
Summary of emerging models wrt
advanced service-based systems
ASE WS 2012 24
People Software
Things
Engineering advanced service-
based systems
utilize/consist of
Emerging data provisioning models
Emerging computational infrastructure/platform provisioning models
Emerging human computation
models
Emerging data provisioning
models
Emerging data provisioning models
Page 25
WHERE ARE
OPPORTUNITIES?
DO I NEED TO TAKE
OPPORTUNITIES? WHY?
Discussion time:
ASE WS 2012 25
Page 26
Recall our motivating example (1)
ASE WS 2012 26
Equipment Operation and Maintenance
Civil protection
Building Operation Optimization
Cities, e.g. including:
10000+ buildings
1000000+ sensors
Near realtime analytics
Predictive data
analytics
Visual Analytics
Enterprise
Resource
Planning
Emergency
Management
Internet/public cloud
boundary
Organization-specific
boundary
Tracking/Log
istics
Infrastructure
Monitoring
Infrastructure/Internet of Things
...
Can we combine open government data
with building monitoring data?
Page 27
Recall our motivating
example (2)
ASE WS 2012 27
A lot of input data (L0):
~2.7 TB per day
A lot of results (L1, L2): e.g., L1 has ~140 MB per
day for a grid of
1kmx1km
Soil
moisture
analysis for
Sentinel-1
Michael Hornacek,Wolfgang Wagner, Daniel Sabel, Hong-Linh Truong, Paul Snoeij, Thomas Hahmann, Erhard Diedrich, Marcela Doubkova,
Potential for High Resolution Systematic Global Surface Soil Moisture Retrieval Via Change Detection Using Sentinel-1, IEEE
Journal of Selected Topics in Applied Earth Observations and Remote Sensing, April, 2012
Can we combine them
with open government
data?
Page 28
Recall our motivating example (3)
ASE WS 2012 28
Source: http://www.undata-api.org/ Source:
http://www.strikeiron.com/Catalog/StrikeIronServices.aspx
Source: http://docs.gnip.com/w/page/23722723/Introduction-
to-Gnip
Page 29
WHICH OPPORTUNITIES DO
YOU SEE?
Discussion time:
ASE WS 2012 29
Page 30
Internet-scale service engineering -
- the elasticity
ASE WS 2012 30
Page 31
Internet-scale service engineering -
- big/near-real time data impact
Which are data concerns that impact the data
processing?
How to use data concerns to optimize data
analytics and service provisioning?
How to use available data assets for advanced
services in an elastic manner?
What are the role of human-based servies in
dealing with complex data analytics?
ASE WS 2012 31
Page 32
Internet-scale service engineering -
- Steps
ASE WS 2012 32
Large-scale, multi-platform services engineering
Identify platform/application
problems
Identify the scale, complexity and *city
design units, selection of existing service
units;
development and Integration, Optimization
Understanding Properties/Concerns
Data /Service/Application concerns; their dependencies
Monitoring, evaluation and provisioning of concerns
Utilization of data/service concerns
Single service/platform engineering
Service units for representing fundamental things, people
and software
Provisioning of fundamental service units
Engineering with single service units
Page 33
WHAT ARE MISSING?
Discussion time:
ASE WS 2012 33
Page 34
Single service/platform engineering
– service unit (1)
The service model and the unit concept can be applied
to things, people and software
ASE WS 2012 34
Service model
Unit Concept
Service unit
„basic
component“/“basic
function“ modeling
and description
Consumption,
ownership,
provisioning, price, etc.
Page 35
Single service/platform engineering
– service units (2)
ASE WS 2012 35
Source: Stefan Tai, Philipp Leitner, Schahram Dustdar: Design by Units: Abstractions for Human and Compute Resources for Elastic Systems.
IEEE Internet Computing 16(4): 84-88 (2012)
Page 36
Single service/platform engineering
– service unit provisioning
Provisioning software under services
Provisioning things under services
Provisioning human under services
Crowd platforms of massive numbers of individuals
Individual Compute Unit (ICU)
Social Compute Unit (SCU)
ASE WS 2012 36
1. Mark Turner, David Budgen, and Pearl Brereton. 2003. Turning Software into a Service. Computer 36, 10 (October 2003), 38-44.
DOI=10.1109/MC.2003.1236470 http://dx.doi.org/10.1109/MC.2003.1236470
2. Luigi Atzori, Antonio Iera, and Giacomo Morabito. 2010. The Internet of Things: A survey. Comput. Netw. 54, 15 (October 2010), 2787-2805.
DOI=10.1016/j.comnet.2010.05.010 http://dx.doi.org/10.1016/j.comnet.2010.05.010
3. Dominique Guinard, Vlad Trifa, Stamatis Karnouskos, Patrik Spiess, Domnic Savio: Interacting with the SOA-Based Internet of Things:
Discovery, Query, Selection, and On-Demand Provisioning of Web Services. IEEE T. Services Computing 3(3): 223-235 (2010)
4. Schahram Dustdar, Kamal Bhattacharya: The Social Compute Unit. IEEE Internet Computing 15(3): 64-69 (2011)
5. Hong-Linh Truong, Schahram Dustdar, Kamal Bhattacharya "Programming Hybrid Services in the Cloud", Springer-Verlag, 10th
International Conference on Service-oriented Computing (ICSOC 2012), November 12-16, 2012, Shanghai, China
Page 37
Single service/platform engineering
– examples (1)
Service engineering with a single
system/platform
Using Excel to access Azure datamarket places
Using Boto to access data in Amazon S3
Using Hadoop within a cluster to process local data
Using workflows to process data (e.g.,
Trident/Taverna/ASKALON)
Using StormMQ to store messages
ASE WS 2012 37
Page 38
Single service/platform engineering
– examples (2)
ASE WS 2012 38
Page 39
Internet-scale multi-platform
services engineering – required
technologies
ASE WS 2012 39
Internet-scale, Multi-platform
Services Engineering for
Software, Things and People
Data analysis/Computation
services in cluster (e.g., Hadoop)
Data services (e.g., Azure, S3)
Middleware (e.g., StormMQ)
Workflows (e.g., Trident)
Crowd platforms, human-based service
platforms(e.g., Mturks, VieCOM)
Billing/Monitoring (e.g.,
thecurrencycloud)
Page 40
WHAT ARE MISSING?
Discussion time
ASE WS 2012 40
Page 41
Exercises
Read papers mentioned in slides
Get their main ideas
Check services mentioned in examples
Examine capabilities of the mentioned services
Including price models and underlying technologies
Examine their size and scale
Examine their ecosystems and dependencies
Work on possible categories of single service
units that are useful for your work
Some common service units with capabilities and
providers
ASE WS 2012 41
Page 42
42
Thanks for your attention
Hong-Linh Truong
Distributed Systems Group
Vienna University of Technology
[email protected]
http://www.infosys.tuwien.ac.at/staff/truong
ASE WS 2012