Top Banner
ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State University Moscow, Russia
28

ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

Jan 12, 2016

Download

Documents

Bernard Leonard
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Enabling GRID for Computer Algebra Applications

Victor EdneralSkobeltsyn Institute of Nuclear Physics, Moscow State University

Moscow, Russia

Page 2: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Acknowledgements

• This talk includes slides from previous tutorials and talks delivered by:

– the EDG training team

– Dr Fabrizio Gagliardi, EGEE 1 Project DirectorEGEE 1 Project Director

– Dr Bob Jones, EGEE 1Technical DirectorEGEE 1Technical Director

– Oxana SmirnovaOxana SmirnovaJINR / Lund / CERNJINR / Lund / CERN

• Prepared by Victor Edneral, SINP MSU

Page 3: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Overview

• Motivation for Grid Computing• The basic ideas of Grid technology• What is EGEE?• Some Examples• Grid and Computer Algebra

Page 4: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

The terms of the problem

• Technological progress produces more sophisticated digital sensors (particle physics detectors, satellites, radio-telescopes, synchrotrons…)

• Much of science is therefore becoming increasingly “data-intensive”

• Huge amounts of data need to be analyzed by large and geographically distributed scientific communities

• Consequently, single computers, clusters or supercomputers are not powerful enough for the necessary calculations and the data processing

Result: access to large facilities is difficult and expensive for the scientific community, particularly in less favored countries=> increase of the “electronic divide”

Page 5: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

CERN: an example of data intensive science and a large international facility

Mont Blanc(4810 m)

• CERN is building the Large Hadron Collider (LHC) the most powerful instrument ever built to investigate elementary particles physics

• Data Challenge:

– One Megabyte of data digitised for each collision

– 1010 collisions recorded each year = 10 Petabytes/year of data !!!

– LHC data correspond to about 20 million CDs each year!

• Simulation, reconstruction, analysis: LHC data handling requires a computing power equivalent to ~100,000 of today's fastest PC processors!

(106 mega; 109 giga; 1012 tera; 1015 peta)

Downtown Geneva

Page 6: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Just a comparison…

5 times the Eiffel Tower

˜1500 m

6-8 Petabytes ˜10.000.000 CD-ROM

Page 7: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

IT progress: some facts• Network vs. computer

performance:

– Computer speed doubles every 18 months

– Network speed doubles every 9 months

• 1986 to 2000:

– Computers: 500 times faster

– Networks: 340000 times faster

• 2001 to 2010 (projected):

– Computers: 60 times faster

– Networks: 4000 times faster

Slide adapted from the Globus Alliance

Bottom line: CPUs are fast enough; networks are very fast – gotta make use of it!

Page 8: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

What is Grid computing?

• “Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations” (I.Foster)

– Resources are controlled by their owners

– The Grid infrastructure provides access to collaborators

• A Virtual Organization is:

– People from different institutions working to solve a common goal

– Sharing distributed processing and data resources

• Enabling People to Work Together on Challenging Projects

– Science, Engineering, Medicine, …

– Public service, commerce too!

Page 9: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

The Grid Paradigm

• Distributed supercomputer, based on commodity PCs and fast Networks

• Access to the great variety of resources by a single pass – certificate

• A possibility to manage distributed data in a synchronous manner

• A new commodity

Supercomputer

WorkstationPC Farm

The Grid

Page 10: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

What is EGEE?

EGEE is funded by the European Union under contract IST-2003-508833

Enabling Grid to E-sciencE

Page 11: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Complex Infrastructure

• Users want access to compute power and data

– With security, reliability, trust, …

• This requires a complex infrastructure

– Registries

– Brokers

– Administration

– Policy

– Negotiation

– Etc.

• Users shouldn’t need to know the details

Page 12: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

12

Virtual Organizations

Slide from Ian Foster’s ssdbm 03 keynote

Page 13: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Some architectural thoughts

Storage

StorageUser

Interface

UserInterface

UserInterface

InformationServer

Data locationserver

WorkloadmanagerResource

broker

InformationServer

InformationServer

Resource center

Resource center

Page 14: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

• EGEE I. April 2004

– Large scale deployment of EGEE infrastructure to deliver production level Grid services with selected number of applications

• EGEE II. April 2006

– Natural continuation of the project’s first phase

– Opening up to a larger user community

increased multidisciplinary Grid infrastructure

more involvement from Industry

– Extending the Grid infrastructure world-wide

increased international collaboration

Enabling Grid to E-sciencE

From Phase I to II

Page 15: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

The Large Hadron Collider

http://www.cern.ch

~9 km

LHC

SPS

CERN

Page 16: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Applications• Medical and biomedical:

– Image processing (digital X-ray image analysis)– Simulation for radiation therapy– Protein folding

• Chemistry– Quantum– Organic– Polymer modelling

• Climate studies• Space sciences• Physics:

– High Energy and other accelerator physics– Theoretical physics, lattice calculations of all sorts– Neutrino physics– Combustion– Thermonuclear synthesis

• Genomics• Material sciences• Computer algebra?

Page 17: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Bio-medicine applications

Medical images

Exam image patient key ACL ...

1. Query the medical image database and retrieve a patient image

Metadata

3. Retrieve most similar cases

Similar images Low score images

2. Compute similarity measures over the database images

Submit 1 job per image

• Bio-informatics– Phylogenetics– Search for primers– Statistical genetics– Bio-informatics web portal– Parasitology– Data-mining on DNA chips– Geometrical protein comparison

• Medical imaging– MR image simulation– Medical data and metadata

management– Mammographies analysis– Simulation platform for

PET/SPECT

Applications deployedApplications tested

Applications under preparation

Page 18: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Bio-medicine applications

Page 19: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Earth observation applications

Grid contribute to EO:

• enhance the ability to access high level products

• allow reprocessing of large historical archives

• improve Earth science complex applications (data fusion, data mining, modelling …) Roberto Barbera

ESA missions:

• about 100 Gbytes of data per day (ERS 1/2)

• 500 Gbytes, for the ENVISAT mission (2002).

Page 20: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Earth observation applications

Page 21: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Grid Geography

Page 22: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

The Grid on This Monday

Page 23: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Grid for Computer Algebra

• Which problems

– Many similar calculations

• Experimental Mathematics. Scanning some domain in parameter space. For example searching domains of integrability of an ODE system.

• Massive calculation of Feynman diagrams

• Which software we can use

– Batch calculations

– Statically compiled programs for testing

– GNU license

Page 24: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Case of Resonance (1:3)

Page 25: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Russian Data Intensive Grid

491 Processors 34.42 TByte

Page 26: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Virtual Organization

• RDIG rather than EGEE

• The calg VO at Skobeltsyn Institute of Nuclear Physics (MSU) & Joint Institute of Nuclear Research (Dubna)

• We have hardware now

– 124 processors x (1-4) GByte RAM

– 19.13 TByte mass storage

• Appropriate software

– CompHEP (SINP MSU)

– GINV (JINR)

– Singular

– LISP

Page 27: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

To know more about EGEE:

http://www.eu-egee.org

Page 28: ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.

ACA2006, June 26-29, Varna, Bulgaria

Questions?