big data cloudera cluster kerberos hive oracle python linux spark security hortonworks impala zeppelin jupyterlab jupyterhub etl red hat hive.server2.zookeeper.namespace jupyter architecture high availability framework jdbc application yum bda data analysys pyspark it big data sql centos data science nifi environment livy variables sql multi tenancy tuning solutions open source encrypting masking gdpr tools hive on spark performance hive.zookeeper.quorum hive.zookeeper.client.port software collections hive metadata zookeeper repos concurrency load balancing hive interpreter packaging hivedriver apache scl hdp hdf latest custom versions mpack repository ssl tls keystore business intelligence development devops howto machine learning parallel ssh systemd grafana prometheus monitoring jmx pushgateway webservice metrics kafka script authentication ldap active directory impersonation scala r pip hadoop technology java maria db mysql sentry hue cdh scheduler oozie airflow mariadb hiveserver2 ad hoc query data layer erlang rabbitmq rpm ambari
See more