Grid Information and Monitoring System using XML-RPC andInstant Messaging for DIRAC

I.Stokes-Rees, A.Tsaregorodtsev, V.Garonne

To be presented by I.Stokes-Rees

The DIRAC system developed for the CERN LHCb experiment is a grid infrastructure for managing generic simulation and analysis jobs. It enables jobs to be distributed across a variety of computing resources, such as PBS, LSF, BQS, Condor, Globus, LCG, and individual workstations. A key challenge of distributed service architectures is that there is no single point of control over all components. DIRAC addresses this via two complementary features: a distributed Information System, and an XMPP (Extensible Messaging and Presence Protocol) Instant Messaging framework.

The Information System provides a concept of local and remote information sources. Any information which is not found locally will be fetched from remote sources. This allows a component to define its own state, while fetching the state of other components directly from those components, or via a central Information Service. We will present the architecture, features, and performance of this system.

XMPP has provided DIRAC with numerous advantages. As an authenticated, robust, lightweight, and scalable asynchronous message passing system, XMPP is used, in addition to XML-RPC, for inter-Service communication, making DIRAC very fault-tolerant, a critical feature when using SOA. XMPP is also used for monitoring real-time behaviour of the various DIRAC components. Finally, XMPP provides XML-RPC like facilities which are being developed to provide control channels direct to Services, Agents, and Jobs. We will describe our novel use of Instant Messaging in DIRAC and discuss directions for the future.


Last modified Tue 18 May 2004 . View page history
Switch to HTTPS . Website Help . Print View . Built with GridSite 1.4.3
For more about GridPP please contact Neasan O'Neill