anssr data engine #1

9 machines, 18 units

Overview

The Anssr Data Engine bundle is our reference implementation of all the compoments we can offer with the Anssr Data Platform.

All the components are aware of their relations and configure themselves automatically on any of the leading Cloud platforms.

Bundle Composition

  • Apache Hadoop Client
  • Ganglia
  • Ganglia Node
  • Namenode
  • Apache Hadoop Plugin
  • Apache Hadoop Resource Manager
  • RSyslog
  • RSyslog Forwarder
  • Hadoop Slave
  • Apache Spark
  • Apache Zookeeper
  • Apache Drill
  • Apache Zeppelin
  • Pig
  • Hive
  • HBase
  • Mahout
  • Flume HDFS
  • Flume Kafka
  • Apache Kafka

Deploying

To deploy this stack you can simple push the deploy button at the top of the page or run:

juju deploy cs:~spiculecharms/bundle/anssr-data-engine

This will spin up 9 machines and deploy the 20 components to their respective machines. Deployment time varies on Cloud and network perfomance but usually takes about 20 minutes until you have a full operational and scalable Hadoop platform.

Verifying

To check all the components have deploy successfully you can check the Status tab in the Juju GUI or run:

juju status

And ensure none of the units are reporting an error state.

Monitoring

Scaling

To scale units you can do so by selecting the charm in the GUI and then in the menu on the left, select the units and input the amout of extra units you require. Or you can run:

 juju add-unit -n 1 <charm name>

Where 1 is the number of new units you want and is the name of the charm you want to scale.

Issues

Contact Information

You can get help and support for this bundle from:

info@anssr.io

Resources

Bundle configuration