Halyard

CI Coverage

Halyard is an extremely horizontally scalable RDF store with support for named graphs, designed for storage and integration of extremely large semantic data models, and execution of SPARQL 1.1 queries of the whole linked data universe snapshots. The implementation of Halyard is based on Eclipse RDF4J framework and the Apache HBase database, and it is completely written in Java.

Run in Amazon EMR

To get started with Halyard, try deploying it on an Amazon Elastic MapReduce (EMR) cluster.

See the Amazon EMR sample cluster setup.

Run locally

Download and unzip the latest halyard-sdk-<version>.zip bundle from the releases page to a Apache Hadoop cluster node with a configured Apache HBase client.

Halyard is expected to run on an Apache Hadoop cluster node with configured Apache HBase client. Apache Hadoop and Apache HBase components are not bundled with Halyard. The runtime requirements are:

  • Apache Hadoop version 2.5.1 or higher
  • Apache HBase version 1.1.2 or higher
  • Java 8 Runtime

Note: Recommended Apache Hadoop distribution is Hortonworks Data Platform (HDP) version 2.4.2 or Amazon Elastic MapReduce (EMR).

Get involved