Apache Drill Drillbit
Query any non-relational datastore (well, almost....)
Drill supports a variety of NoSQL databases and file systems,
including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3,
Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files.
A single query can join data from multiple datastores. For example,
you can join a user profile collection in MongoDB with a directory
of event logs in Hadoop.
Drill's datastore-aware optimizer automatically restructures a
query plan to leverage the datastore's internal processing capabilities.
In addition, Drill supports data locality, so it's a good idea to
co-locate Drill and the datastore on the same nodes.
To deploy this charm simply run:
juju deploy cs:openjdk
juju deploy apache-zookeeper zookeeper
juju add-unit -n 2 apache-zookeeper (optional but recommended for a quorum)
juju deploy cs:~spicule/drillbit
juju add-relation drillbit zookeeper
juju add-relation drillbit openjdk
juju expose drillbit
(If you run this on LXD Local, check the issues below)
Currently there isn't much in the way of actions and relations support,
this will come shortly.
There is a webconsole running on http://:8047/
If you are running a Juju hosted MongoDB charm, you can test the MongoDB
SQL support, by running:
juju add-relation mongodb drillbit
This will create a new storage entry on your drill cluster with connections
to your MongoDB cluster.
To query it you can either connect to drill via JDBC or
juju ssh drillbit/0
You should see a connection called something like: juju_mongo_mongodb..
Now you can do:
select * from mytable;
If you are running a Hadoop setup, you can also test the HDFS connectivity.
juju add-relation drillbit namenode
This will add a datasource entry for your Hadoop namenode. You can then query CSV/JSON/Parquet files.
You can simply add new units and they will be added to the cluster automatically:
juju add-unit -n 2 drillbit
If you run this on LXD Local there is a bug where its not setting the hostname
of the LXD container and Drill fails to start. For now you need to edit /etc/hosts
and add the hostname to the localhost line ensuring that
resolves. Once that works:
drill_url: Allows you to set an alternative download url for Apache Drill.
cluster_id: Allows you to set an alternative cluster id for Zookeeper.
If you require commercial support for this charm or Apache Drill, please contact us and we'd be happy to help.
Email us at firstname.lastname@example.org and we can arrange a call to discuss your requirements.