Top Banner
Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST
18

Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Dec 26, 2015

Download

Documents

Janis Gaines
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Data Analysis on Handheld Devices Using Clarens

Tahir AzimNUST

Page 2: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

BackgroundBackground

• High Energy Physics (HEP) data storage requirements

– ROOT files, HBOOK files, HEPREP files …

• Data not conclusive in its raw form

• Grid-enabled analysis

– Graphical Visualizations (2D – 3D)

• Desktop Analysis Applications Available

– Java Analysis Studio (histogramming and analysis package)

– WWW Interactive Remote Event Display (detector geometry and event display)

– ROOT (efficient file format)

• Requirement of efficient Grid-enabled analysis capabilities on handheld devices

Page 3: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

The Grid For Handheld Devices…The Grid For Handheld Devices…

• Handheld devices can enable ubiquity• Portable access to data• But they cannot provide all the

computing power to do data analysis and processing on the fly

• Grid & distributed computing can provide the power of much more resourceful computers constantly on your handheld

Page 4: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Challenges of Handheld DevicesChallenges of Handheld Devices

• Relatively slow processors • Small display screen• Small storage space (excluding add-on memory cards)• Intermittent, relatively low bandwidth network

connections• Lack of software support

– Bugs in JVMs– Java 1.1 instead of Java 1.4– Embedded Visual C++ instead of MS Visual C++– Only a small subset of operating system libraries and APIs

available– Relatively difficult installation procedures

Page 5: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

The Clarens Web Services FrameworkThe Clarens Web Services Framework

• Clarens is a high-performance, Grid-enabled web service framework

• Supports lightweight XML-RPC protocol for accessing all kinds of tools and services

• Provides a uniform way of accessing widely differing (e.g. in terms of access protocols, running platforms) Grid tools

• Provides a remote file access service and a catalog service to look for distributed datasets

• A shell service to provide shell-like access to remote machines

• Clarens services can be designed to act as lightweight wrappers to powerful backend utilities and to other services

• Available in Python and Java implementations

Page 6: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

View of a Collaborative Desktop

PDAPDA

ROOTROOT

ClarensClarensClarensClarens

External Services

MonaLisaMonaLisaBrowserBrowserIguanaIguana

VO

ManagementAuthenticatio

n Authorization LoggingKey

Escrow

File Access

Shell

Storage Resource

Broker

CMS ORCA/COBRA

Cluster Schedulers

ATLAS DIAL

Griphyn VDT

MonaLisa Monitoring

Page 7: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Architectural OverviewArchitectural Overview

Mod-python/Servlet

fileservice

echoservice

systemservice

groupservice

proxyservice

rendezvousservice

Clarens

Apache/Tomcat

Filesystem

Database

Binary data

XML-RPC

XML/binary data ( plaintext / SSL encrypted)

Data Transfer/Authentication Visualization

Handheld Clients

To MonALISA

Page 8: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

[ Technologies for Development on Handheld Clients][ Technologies for Development on Handheld Clients]

• Plethora of available technologies:

– IBM WebSphere

– MIDP

– SuperWaba

– Savaje

– Personal Java

• Insignia Jeode

• CrEme

Page 9: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Pocket JAS Pocket JAS

• Displays histograms and scatter plots from

data in ROOT files and flat files

• ROOT IO rewritten to make it PDA-

compatible

• Grid authentication and security built-in

• Uses Clarens ‘file’ services to look for and

download ROOT files

• Supports X.509 as well as PKCS12

certificates

• Displays various statistics about the data

sets and can fit function curves against the

histogram data

• Handles unreliable, intermittent data

transfers over wireless links

Page 10: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

[ Pocket JAS plug-in architecture ]

[ Pocket JAS plug-in architecture ]

Page 11: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Pocket Wired Pocket Wired

• 3D display of event data from HepRep and HepRep2 files

• Display of HepRep event tree

• Most functionality from WIRED ported including:

– 2D Translation & Rotation

– Scaling

– 3D rotation

– Projections

• Remarkable enhancement in speed by using CrEme JVM instead of Jeode

Page 12: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Using Clarens as the Data Processing EngineUsing Clarens as the Data Processing Engine

• Parsing libraries of certain file types could not be ported to Pocket PC/ WinCE, e.g. HBOOK files

• A Clarens service was written for processing of HBOOK files

• Handheld clients browse the remote file system for required files

• Selected files are processed by an ‘HBOOK’ service, and converted into a suitable format for handhelds

• Data is returned back for analysis to handheld device in supported format

Handheld ClientHandheld ClientHandheld ClientHandheld Client

XML-RPCXML-RPCXML-RPCXML-RPC

Web serverWeb serverWeb serverWeb server

SOAP/XML-RPCSOAP/XML-RPCSOAP/XML-RPCSOAP/XML-RPC

ClarensClarensClarensClarens

HBOOK files onHBOOK files onlocal file systemlocal file system

HBOOK files onHBOOK files onlocal file systemlocal file system

File

Serv

ice

File

Serv

ice

HB

OO

K fi

le

HB

OO

K fi

le

pro

cessor

pro

cessor

HTTPHTTPHTTPSHTTPS

Page 13: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Distributed Analysis with a Condor-based Execution EngineDistributed Analysis with a Condor-based Execution Engine

Page 14: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Features of the distributed analysis environmentFeatures of the distributed analysis environment

• Finds convenient places for the users “job” (computing task) to be run

• Seeks to use the widely dispersed resources efficiently

• Deals with authentication to the different sites that the scientists will be using

• Runs the jobs

• Monitors progress

• Recovers from problems

• Tells you when the work is complete and transfers the result back

Page 15: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Monitoring ServiceMonitoring Service

• Currently uses the end host monitoring agent (EMA) for monitoring– EMA is a plug-in of MonALISA developed at NUST

• Calculates an estimate of the load of each server

• Reports OS information, CPU Clock rate, load coefficient and other monitoring information and load coefficient to the resource broker

Page 16: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

Job Submission ServiceJob Submission Service

• Receives job requests from the handheld clients

• Decides the best site to submit the jobs based on monitoring

information and OS specified

• Forwards the job requests to remote site

• Maintains a mapping of job IDs and submission sites

• Uses CONDOR for job submission; thereby enabling parallel job

execution on farms and clusters

• Clients do not have to know where the job was finally submitted

• Provides status information to the handheld clients

• Enables the handheld clients to get back outputs of executed jobs

Page 17: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

ResultsResults

• Handheld clients are able to access and utilize the system transparently

• Handhelds are able to do analysis with exceptionally good performance

• Analysis tasks on large number of data files can be started in parallel for faster results

• Using Clarens, lightweight handheld clients, ranging from PDAs to mobile phones, can make use of powerful computational services

Page 18: Data Analysis on Handheld Devices Using Clarens Tahir Azim NUST.

ConclusionConclusion

• Work presented and used at major international workshops and exhibitions

– Demonstration at 1st GAE Workshop at Caltech in June 2003

– Work presented as part of “Grid Enabled Analysis” demo at ITU Telecom World 2003, carried out by Caltech, CERN, UERJ, UPB, KEK, Sinica and NUST

– Various demonstrations in workshops and conferences in Pakistan

• Clarens can enable lightweight devices, like PDAs and mobile phones, to access Grid resources and distributed datasets

• Work demonstrates how the Grid can be used to provide powerful computational and processing capabilities on handhelds