You are on page 1of 23

Auger & XtremWeb:

Monte Carlo computation on


A Global Computing platform
O. Lodygensky, G. Fedak, V. Neri,
A. Cordier, F. Cappello

Laboratoire de l’Accelerateur Lineaire;


Laboratoire de Recherche en Informatique;
CNRS, Université Paris sud, France.
Sommaire
• Introduction
• XtremWeb
• Auger distributed computing
• Conclusion
Different GRID
Nodes
caracteristics
Traditionnal
« GRID » computing •<100
centers, •Stables
Clusters •Individually
identified
2 distributed •Trusted
system types
Global computing systems Windows, •~100 000
Linux, •Volatiles
Mac OS •No individual
« Desktop GRID » ident
« Internet Computing » •Not trusted

Peer to Peer systems


(P2P)

March 27, 2003 CHEP2003 - O.Lodygensky 3


Desktop GRID
• Dedicated applications
One server centralizes scheduling – SETI@Home, distributed.net,
On volunteer PCs – Decrypthon
• Production projects
Client Application
– Folding@home,
set params. / get results.
Genome@home,
server
– Folderol,
parametres
• Open source/research
Internet projects
Volunteer – XtremWeb, BOINC,
PC
• Commercial platforms
Volunteer – Entropia, Datasynapse,
Volunteer PC : PC
load & exec task
– United Devices, Grid systems

March 27, 2003 CHEP2003 - O.Lodygensky 4


Desktop Grid characteristics
• Scalability : up to 100 k, 1 M hosts
• Heterogeneity : different hardwares, OSes
• Volatility : unpredictable participant behaviour
 Seti@home, Napster, Kazaa, etc. : they work well despite volatility.
• Perenity : developments and upgrades must be easy
• Performances : Seti@home ~30 Tflops,
Kazaa (1 M users : 100kb/s, 1Mb/s  100 Gb/s, 1 Tb/s?).
• Sécurity :
 Volunteer PCs and servers integrity ;
 Prevent application & results corruption ;
 Authentication.

March 27, 2003 CHEP2003 - O.Lodygensky 5


Sommaire
• Introduction
• XtremWeb
• Auger distributed computing
• Conclusion

March 27, 2003 CHEP2003 - O.Lodygensky 6


XW : Architecture
• Centralized
• Global Computing (Peer to Peer)
• 3 entities : client/coordinator/worker
Hierarchical
P2P Coordinator Coordinator
Global Computing
coordinator
(client)

PC
Internet / LAN Client/worker

PC PC
PC Worker Worker
March 27, 2003 Client/Worker CHEP2003 - O.Lodygensky 7
XW : Technology
Data
Server Java SQL Base
Java JDBC

Communication XML-RPC
protocol SSL
Http Server PHP3-4

Worker Java GNU


Client Installation autotool

Pre requisite for installation: database (Mysql),


JAVA > jdk1.2.

March 27, 2003 CHEP2003 - O.Lodygensky 8


XW : Security
ssh ssh
Coordinat
Coordinat..

Worker

Client Sandbox Loaded


(SBLSM) App

March 27, 2003 CHEP2003 - O.Lodygensky 9


XW : fault tolerance model
• Every entity is volatile by essence
• Connectionless protocols
=> All entities are stand alone

Submit task Get work


Client
Client Worker1
Worker1
Sync/Retrieve result Coord. Put result
Coord.
re sult Sync
/Get
ve
c/R etrie Put work
Client2
Syn resu
lt
Client2
Sync Worker2
Worker2
/Subm
Sync it task t w ork
/Retr / G e
ieve r Coord.
Coord. Sync sult
esult /P u t re
Sync

March 27, 2003 CHEP2003 - O.Lodygensky 10


Sommaire
• Introduction
• XtremWeb
• Auger distributed computing
• Conclusion

March 27, 2003 CHEP2003 - O.Lodygensky 11


Pierre Auger Observatory
Understanding the origin of very high cosmic rays:
• Aires: Air Showers Extended Simulation
– Sequential, Monte Carlo. Time for a run: 5 to 10 hours

Air shower parameter


Traditional Super database (Lyon, France) XtremWeb
Computing Centers
Server
CINES (Fr) Estimated
air shower PC number ~ 5000

PC worker
Internet
Fermi Lab (USA) and LAN PC Client

PC Worker
CHEP2003 - O.Lodygensky
PC worker Aires
March 27, 2003 12
Auger-XW (AIRES): High Energy Physics
Application : Icluster Madison
Grenoble Wisconsin
AIRES
PBS Condor
Deployment:
• Coordinator at LRI
• Madison: 700 workers
U-psud
Pentium III, Linux Internet network
LRI
(500 MHz+933 MHz) Other
Condor Pool
Labs
(Condor pool)
• Grenoble Icluster: 146 workers
(733 Mhz), PBS
• LRI: 100 workers lri.fr
Pentium III, Athlon, Linux XW Coordinator
XW Client
(500MHz, 733MHz, 1.5 GHz)
(Condor pool)
March 27, 2003 CHEP2003 - O.Lodygensky 13
Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 14


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 15


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 16


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 17


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 18


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 19


Auger-XW (AIRES): High Energy Physics

March 27, 2003 CHEP2003 - O.Lodygensky 20


Sommaire
• Introduction
• XtremWeb
• Auger distributed computing
• Conclusion

March 27, 2003 CHEP2003 - O.Lodygensky 21


Conclusion
XtremWeb : a « desktop Grid » platform
 Fault tolerance. XtremWeb : « connectionless » + « restartable »
 Security : certificats + crypto + sandbox +…

– What we have learned so far with XtremWeb:


– Deployment is critical
– When they understand the computational power potential ,
users rapidly ask for more resources!!!

XtremWeb Auger:
International Desktop GRID
Condor pools with XW as global infrastructure
Good performances (ratio 1:60 with several small hosts than the
reference)

=>Schedulling is a lack of XtremWeb <=


=>Strong need of results browsing tools <=

March 27, 2003 CHEP2003 - O.Lodygensky 22


Software
• XtremWeb : www.XtremWeb.net
– Since 2001
– Acual version : 1.2.rc0

March 27, 2003 CHEP2003 - O.Lodygensky 23

You might also like