All E-ARK tools are available via our GitHub site.

Pre-Ingest Tools 

This section lists the tools which are capable of preparing SIPs according to the E-ARK SIP specification. 


RODA-in is a tool specially designed for producers and archivists to create Submission Information Packages (SIP) ready to be submitted to an Open Archival Information System (OAIS). The tool creates SIPs from files and folders available on the local file system.

In version 2 we revolutionized the way SIPs are created to satisfy the need for mass processing of data. In this version you can create thousands of valid SIPs with just a few clicks, complete with data and metadata.

Installable package of RODA-in is available at 

ETP (ESSArch Tools for Producer)

SIP packaging and delivery tool for data producers

Download and installation instructions of ETP are available at

ETA (ESSArch Tools for Archives)

ETA (ESSArch Tools for Archive)


Database preservation tools

This section lists the tools and components which allow for the migration of live databases into the SIARD2 preservation format and reuse of databases stored in the SIARD2 format.

Database Preservation Toolkit (DBPTK)

The Database Preservation Toolkit allows conversion between Database formats, including connection to live systems, for purposes of digitally preserving databases. The toolkit allows conversion of live or backed-up databases into preservation formats such as SIARD 2.0, a XML-based format created for the purpose of database preservation. The toolkit allows also to upload the archived database into a live DBMS (Oracle, MySQL etc.).

Further information and the executable application can be found at


RESTful service to control the DBPTK by a graphical user interface.


Relational Database Viewer for databases based on SIARD 2


Digital repository solutions 

This section lists the repository solutions provided by E-ARK partners which implement the E-ARK Information Package specifications. 


RODA is a complete digital repository that delivers functionality for all the main units of the OAIS reference model. RODA is capable of ingesting, managing and providing access to the various types of digital objects produced by large corporations or public bodies. RODA is based on open-source technologies and is supported by existing standards. 

Further information about RODA can be found at

ESSArch Preservation Platform

ESSArch Preservation Platform - an practical implementation of the OAIS (Reference Model for an Open Archival Information System).

Further information about the platform is available at


E-ARK Integrated Platform components 

The E-ARK Integrated Platform is a reference implementation of all E-ARK specifications. The main component of the Integrated Platform is the E-ARK Web which is a digital preservation system supporting all OAIS functions. The E-ARK Web can be extended with additional components for scalable computing, data mining and natural language processing.


E-ARK Web is an open source web-based archiving, digital preservation and access system. It is OAIS-oriented which means that data ingest, archiving and dissemination functions operate on information packages bundling content and metadata in contiguous containers.

Access to a demo installation of E-ARK Web is available upon request. If interested, please contact us!


Tool of the full-scale E-ARK deployment to extract content from AIPs and load content into Lily.


Common Maven configuration of the Java-based tools used in the E-ARK full-scale deployment.


The data mining and natural language processing modules used in a cluster environment (Hadoop).


MapReduce job to run text classifier


A tool of the full-scale E-ARK deployment which provides a REST service for transfering large files into HDFS.


The map/reduce file ingest tool of the full-scale E-ARK deployment. It unpackages TAR packaged E-ARK information packages and initiates the indexing of the individual files using the Lily API. The Java-based tool runs as a service and consumes RabbitMQ messages notifying about new packages being available for indexing in HDFS.


Prototype for end user's interface 


Java example client for the file upload service (dm-hdfs-storage)


Shared Maven and other global configuration parameters of the Java-based tools.


E-ARK WP6 - reference implementation: generate csv from archived db


Other tools or components 

This section lists all other E-ARK sub-projects available to date. 


The RODA Commons IP provides a generic API to manipulate Information Packages in the E-ARK Common Specification format.


A search engine for the Pelagios universe, with a comprehensive JSON API.

E-ARK Information Package examples

Specification and example of the Submission, Archival and Dissemination Information Packages.

SIARD2 format

SIARD 2 Format specification and examples.

Latest News

RSS 2.0 Feed - Latest News