All E-ARK tools are available via our GitHub site.
This section lists the tools which are capable of preparing SIPs according to the E-ARK SIP specification.
ERMS Export Module
The ERMS Export Module is a tool which allows for the controlled and selective export of records and their metadata from any CMIS 1.0 compliant records or content management system.
The tool allows users to set up a CMIS connection to the source system, acquire an overview of the records and their aggregations, select required records, and finally export selected records and their metadata into the structure defined by the E-ARK Common Specification. As of now, the module supports the mapping of metadata into EAD3 format. Technically it’s possible to create mappings into any destination format.
RODA-in is a tool specially designed for producers and archivists to create Submission Information Packages (SIP) ready to be submitted to an Open Archival Information System (OAIS). The tool creates SIPs from files and folders available on the local file system.
In version 2 we revolutionized the way SIPs are created to satisfy the need for mass processing of data. In this version you can create thousands of valid SIPs with just a few clicks, complete with data and metadata.
Installable package of RODA-in is available at http://rodain.roda-community.org/
ETP is a tool designed to support the creation of Submission Information Packages (SIPs) by the data producer, and their delivery to archival repositories. The tool is highly configurable and allows for the setup of complex archival description profiles and validation. The tool is fully compliant with the E-ARK SIP specification.
Download and installation instructions of ETP are available at http://etp.essarch.org/.
ETA is the archives counterpart of ETP, developed to receive and validate packages created according to the E-ARK SIP Specification.
Database preservation tools
This section lists the tools and components which allow for the migration of live databases into the SIARD2 preservation format and reuse of databases stored in the SIARD2 format.
The Database Preservation Toolkit allows conversion between Database formats, including connection to live systems, for purposes of digitally preserving databases. The toolkit allows conversion of live or backed-up databases into preservation formats such as SIARD 2.0, a XML-based format created for the purpose of database preservation. The toolkit allows also to upload the archived database into a live DBMS (Oracle, MySQL etc.).
Further information and the executable application can be found at http://www.database-preservation.com/.
RESTful service to control the DBPTK by a graphical user interface.
Relational Database Viewer for databases based on SIARD 2
Digital repository solutions
This section lists the repository solutions provided by E-ARK partners which implement the E-ARK Information Package specifications.
RODA is a complete digital repository that delivers functionality for all the main units of the OAIS reference model. RODA is capable of ingesting, managing and providing access to the various types of digital objects produced by large corporations or public bodies. RODA is based on open-source technologies and is supported by existing standards.
Further information about RODA can be found at http://www.roda-community.org/.
ESSArch Preservation Platform - an practical implementation of the OAIS (Reference Model for an Open Archival Information System).
Further information about the platform is available at http://www.essolutions.se/ESSArch.
E-ARK Integrated Platform components
The E-ARK Integrated Platform is a reference implementation of all E-ARK specifications. The main component of the Integrated Platform is the E-ARK Web which is a digital preservation system supporting all OAIS functions. The E-ARK Web can be extended with additional components for scalable computing, data mining and natural language processing.
E-ARK Web is an open source web-based archiving, digital preservation and access system. It is OAIS-oriented which means that data ingest, archiving and dissemination functions operate on information packages bundling content and metadata in contiguous containers.
Access to a demo installation of E-ARK Web is available upon request. If interested, please contact us!
The following components listed in this section are optional add-ons which primarily provide support for scalable infrastructures (i.e. Lily and Hadoop technologies).
Tool of the full-scale E-ARK deployment to extract content from AIPs and load content into Lily.
Common Maven configuration of the Java-based tools used in the E-ARK full-scale deployment.
The data mining and natural language processing modules used in a cluster environment (Hadoop).
MapReduce job to run text classifier
A tool of the full-scale E-ARK deployment which provides a REST service for transfering large files into HDFS.
The map/reduce file ingest tool of the full-scale E-ARK deployment. It unpackages TAR packaged E-ARK information packages and initiates the indexing of the individual files using the Lily API. The Java-based tool runs as a service and consumes RabbitMQ messages notifying about new packages being available for indexing in HDFS.
Prototype for end user's interface
Java example client for the file upload service (dm-hdfs-storage)
Shared Maven and other global configuration parameters of the Java-based tools.
E-ARK WP6 - reference implementation: generate csv from archived db
Access tools or components
A search engine for the Pelagios universe, with a comprehensive JSON API.
The E-ARK CMIS Viewer facilitates remote access to any CMIS 1.0 compatible digital preservation system.
Access Software Platform
The E-ARK Order Management Tool allows for querying, ordering and viewing of E-ARK Dissemination Information Packages from any compliant repository. The tool consists of a backend, the Order Management Service, and a frontend called the E-ARK Platform UI.
Other tools or components
This section lists all other E-ARK sub-projects available to date.
The EAD Editor is a tool which allows for simple form-based editing and creation of EAD3 compliant archival descriptions
This project includes all documentation about the E-ARK data warehousing and OLAP pilot.
The RODA Commons IP provides a generic API to manipulate Information Packages in the E-ARK Common Specification format.
Specification and example of the Submission, Archival and Dissemination Information Packages.
SIARD 2 Format specification and examples.