Introduction

The SCIGNE platform <https://scigne.fr/>`_ offers a high-throughput computing (HTC) service based on technologies developed for computing grids. Data exchanges between the various computing centres that make up these grids are carried out using specific protocols (HTTPS and XROOTD).

This document details the use of the GFAL2 tool to interact with these protocols to manage your data from between your workstation and the SCIGNE platform.

HTC storage system

Several storage systems (dCache, Storm, etc) are used by the computing grid sites. In order to ensure that the systems are interoperable, they communicate using identical protocols.

The SCIGNE platform uses the dCache system to make HTC storage available. Although several protocols are available (HTTPS, XROOTD), this documentation focuses on the use of HTTPS.

The dCache storage system consists of a front-end server and a set of file servers. The front-end server centralises file access requests. The file servers physically store the files. Files are accessed by querying the front-end server using the commands detailed in this document.

Access to data is controlled by a certificate. It is therefore necessary to have a valid proxy when using this type of storage.

Data management with dCache

This section describes how to manage data stored on a dCache server.

Foreword

The following URL is used throughout this section. It will be abbreviated to ${BASE_URL}:

$ export BASE_URL="https://sbgdcache.in2p3.fr/vo.scigne.fr"

Client installation

Currently, packages are only available for the RedHat distribution and its derivatives. The packages are available in the EPEL repository. Once this repository is available on your system, the gfal2 tools can be quickly installed with :

$ dnf install gfal2-util gfal2-all

Reading the contents of a directory

The gfal-ls command lists the contents of a directory. The result of this command is similar to that of the ls command. The example below illustrates how this command works:

$ gfal-ls ${BASE_URL}
EOST
IBMC
IBMP
ICS
ICUBE
IPHC
$ gfal-ls -l ${BASE_URL}
drwxrwxrwx   0 0     0             0 Aug 22 07:42 EOST
drwxrwxrwx   0 0     0             0 Aug 22 07:42 IBMC
drwxrwxrwx   0 0     0             0 Aug 22 07:42 IBMP
drwxrwxrwx   0 0     0             0 Aug 22 07:42 ICS
drwxrwxrwx   0 0     0             0 Aug 22 07:42 ICUBE
drwxrwxrwx   0 0     0             0 Aug 22 07:43 IPHC

Note: As the contents of this directory change over time, the result displayed by the previous commands differs from the examples above.

Creation a directory

The gfal-mkdir command permits to create a directory:

$ gfal-mkdir ${BASE_URL}/lab/user/data

Note that the value /lab/user/data is given as an example and must be adapted to your specific use case. By convention:

  • lab is the acronym of the laboratory in uppercase ;

  • user is the concatenation of the first letter of the first name and the surname.

Copying a file

The gfal-copy command is used to copy data from or to the dCache server:

$ gfal-copy file://${PWD}/my_dataset.tgz ${BASE_URL}/lab/user/data/my_dataset.tgz
Copying file://${PWD}/my_dataset.tgz   [DONE]  after 2s

Deleting a file

The gfal-rm command permits to delete a file on the dCache server:

$ gfal-rm ${BASE_URL}/lab/user/data/my_dataset.tar.gz
https://sbgdcache.in2p3.fr/vo.scigne.fr/lab/user/my_dataset.tgz      DELETED

Deleting a directory

The gfal-rm -r command is used to delete a directory (and its contents):

$ gfal-rm -r ${BASE_URL}/lab/user/data

Additional documentation

Ces liens proposent des références complémentaires pour approfondir la gestion des données sur la plateforme SCIGNE :