apache analytics sql #24

Apache Analytics with SQL

This bundle is an 8 node cluster designed to scale out. Built around Apache
Hadoop components and MySQL, it contains the following units:

  • 1 NameNode
  • 1 ResourceManager
  • 3 Slaves
  • 1 Hive
  • 1 Plugin (colocated on the Hive unit)
  • 1 MySQL

Usage

Deploy this bundle using juju-quickstart:

juju quickstart apache-analytics-sql

See juju quickstart --help for deployment options, including machine
constraints and how to deploy a locally modified version of the
apache-analytics-sql bundle.yaml.

Verify the deployment

The services provide extended status reporting to indicate when they are ready:

juju status --format=tabular

This is particularly useful when combined with watch to track the on-going
progress of the deployment:

watch -n 0.5 juju status --format=tabular

The charm for each core component (namenode, resourcemanager, hive)
also each provide a smoke-test action that can be used to verify that each
component is functioning as expected. You can run them all and then watch the
action status list:

juju action do namenode/0 smoke-test
juju action do resourcemanager/0 smoke-test
juju action do hive/0 smoke-test
watch -n 0.5 juju action status

Eventually, all of the actions should settle to status: completed. If
any go instead to status: failed then it means that component is not working
as expected. You can get more information about that component's smoke test:

juju action fetch <action-id>

Scale out

This bundle was designed to scale out. To increase the amount of
slaves, you can add units to the slave service. To add one unit:

juju add-unit slave

Or you can add multiple units at once:

juju add-unit -n4 slave

Contact Information

Help