• Keine Ergebnisse gefunden

A Technical View

N/A
N/A
Protected

Academic year: 2022

Aktie "A Technical View"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

The Collaborative Climate Community Data and Processing Grid (C3Grid)

A Technical View

B. Bräuer, C. Grimme, A. Papaspyrou, S. Plantikow

Data Management System

The Data Management System (DMS) stages data from primary sources, manages replicas and inter-workspace data transfers.

DIS

discovery metadata:

metadata (local)

local institutes DMS

planned lifetime!

grid services

metadata (local)metadata

(local)

replication

replica metadata:

Challenges: Plan the future!

• at which sites will file A exist tomorrow at 11:00 am?

• copy file A from X to Y today between 6 pm and 10 pm

• How long will a transfer take?

• When can file X be restored from tape to disk earliest?

Co-scheduling

The Workflow Scheduler relies on the transfer plans of the DMS for its own scheduling.

Provenance tracking

Answers: How were these data produced? Which data were derived from this?

Implementation:

based on GridFTP, GSI, MDS, Apache Axis and Lucene

Import/Export

The DMS provides support for importing and exporting data into and out of the grid

Dynamic Resource Discovery

DMS dynamically integrates new providers using proven Globus Toolkit resource discovery services like MDS.

11

11

Portal

Workflow Scheduler

distributed grid infrastructure

DMS

workspace user interface

RIS DIS

data providers compute providers

Workflow Scheduling Service

The Workflow Scheduling Service accepts workflow descriptions from the Portal and coordinates resource allocation and data provision in collaboration with the DMS.

Workflow description

Climate workflows consist of data staging, transfer, and execution tasks which are described in JSDL. The dependencies between those tasks, are given by the proprietary C3Grid Workflow Specification Language (WSL).

DMS

Workflow Scheduler

Portal interface

GRAMWS

Data

C3Grid Architecture Overview

Transfer

Analysis

Data Transfer

Transfer

Workflow Description in WSL, Tasks in JSDL

Resource Information

coordination through negotiation

execution

registration query

local computing resources

Workflow scheduling

Based on the modular workflow concept, the scheduler decides where to transport data and when to execute analysis tasks with respect to the defined task inter-dependencies:

• negotiation-based coordina- tion of data provisioning with the DMS

• the selection of adequate resources for job execution is based on information stored in the Resource Information Service

• the C3Grid Production Environment (CPE) provides a modular, individual, and dynamical environment for user applications

• job execution is done via Globus WS-GRAM

C3Grid Production Environ- ment

Supported by the DEISA modules technologies to dynamically load required software and tools.

Capabilities of a resource are published to the Resource Information Service.

Status Messaging to Portal

Implementation of the OASIS WS- Notification standard for workflow and task status reports.

Portal – User Interface

Data Retrieval

The Data Information System (DIS) is the catalogue where the C3Grid metadata (ISO 19115) information are stored. It is built upon panFMP, a generic and flexible framework for building data portals independent of metadata formats and protocols. panFMP is based on Apache Lucene and the OAI Protocol for Metadata Harvesting. With the portal a user is able to search and browse for data within the DIS easily. The portal also provides the staging of files and gives the possibility to download the staged results.

Workflow Submission

The other main application of the portal provides the submission of workflows. There are some specific workflows which contain predefined datasets (this means that they are compatible with the corresponding workflow) and proper parameters. Based on the the selections made by the user the portlet builds a job description object and sends it to the Scheduler’s WebService interface.

After submission the WSS sends status messages to a notification service (based on WS- Notification) which reacts for example with sending an email to the user to inform him about the current state of his job.

GridSphere

Core Portlets C3Grid

Workflow Submission Portlets

C3Grid Data Retrieval

Portlets

Workflow Scheduler Data Information Service Portal

Search for metadata Submit job

The C3Grid Portal acts as a graphical user interface and allows users the access to C3Grid and its resources. It makes the C3Grid metadata catalogue searchable and sends workflow descriptions to the Workflow Scheduling Service (WSS).

Referenzen

ÄHNLICHE DOKUMENTE

T interval – the inter-packet interval for a thread; Stat – a structure that contains the information about time parameters and collected at the end of the sending session;

This concept clearly separates the resource view from the control flow view. Therefore, the actors and roles of a WFR-net resp. a WFDR-net should have no influence on the control flow

The EDIT Specimen Explorer, the second component examined, helps find taxonomically relevant specimen and observation data by searching the GBIF (Global Biodiversity

In the sense of strategic management, it is a question of dealing with knowledge, software resources and new economic impacts on employing “free” and (today) non-rare resources

Fine mapping, epigenetic landscape and chromatin architecture at rs8133843 as an exemplary category 1 locus containing RA credible SNPs.. The likely causal credible SNPs are

Both geographic (section 3.2.1) as well as product related measuring approaches (section 3.2.2) are analyzed and their appropriateness is discussed with regard to

Vectools reduces the need for custom scripts by offering an easy-to-use command-line tool with a wide range functions for manipulating tables, one of the most

Weirick T, Müller R, Uchida S: Data and analysis pipelines used in Increasing workflow development speed and reproducibility with Vectools [Data set]. Hewett-Emmett D, Tashian