Cloudera Manager是一个用于管理CDH集群的端到端应用程序。Cloudera Manager通过向CDH集群的每个部分提供粒度可见性并对其进行控制来为企业部署设置标准,从而授权操作人员改进性能、提高服务质量、增加遵从性并降低管理成本。使用Cloudera Manager,您可以轻松地部署和集中操作完整的CDH堆栈和其他托管服务。应用程序使安装过程自动化,将部署时间从几周减少到几分钟;提供一个集群范围内的主机和正在运行的服务的实时视图;提供一个单一的中央控制台,用于跨集群执行配置更改;并集成了各种报告和诊断工具,以帮助您优化性能和利用率。本入门介绍了Cloudera管理器的基本概念、结构和功能。
术语:
要有效地使用Cloudera Manager,首先应该理解它的术语。这些术语之间的关系说明如下,它们的定义如下:
deployment
A configuration of Cloudera Manager and all the clusters it manages.
dynamic resource pool
In Cloudera Manager, a named configuration of resources and a policy for scheduling the resources among YARN applications or Impala queries running in the pool.
cluster
host
In Cloudera Manager, a physical or virtual machine that runs role instances. A host can belong to only one cluster.
rack
In Cloudera Manager, a physical entity that contains a set of physical hosts typically served by the same switch.
service
service instance
In Cloudera Manager, an instance of a service running on a cluster. For example: "HDFS-1" and "yarn". A service instance spans many role instances.
role
In Cloudera Manager, a category of functionality within a service. For example, the HDFS service has the following roles: NameNode, SecondaryNameNode, DataNode, and Balancer. Sometimes referred to as a role type. See also user role.
role instance
In Cloudera Manager, an instance of a role running on a host. It typically maps to a Unix process. For example: "NameNode-h1" and "DataNode-h1".
role group
In Cloudera Manager, a set of configuration properties for a set of role instances.
host template
A set of role groups in Cloudera Manager. When a template is applied to a host, a role instance from each role group is created and assigned to that host.
gateway
parcel
A binary distribution format that contains compiled code and meta-information such as a package description, version, and dependencies.
static service pool
In Cloudera Manager, a static partitioning of total cluster resources—CPU, memory, and I/O weight—across a set of services.
Architecture:
- Agent - installed on every host. The agent is responsible for starting and stopping processes, unpacking configurations, triggering installations, and monitoring the host.
- Management Service - a service consisting of a set of roles that perform various monitoring, alerting, and reporting functions.
- Database - stores configuration and monitoring information. Typically, multiple logical databases run across one or more database servers. For example, the Cloudera Manager Server and the monitoring roles use different logical databases.
- Cloudera Repository - repository of software for distribution by Cloudera Manager.
- Clients - are the interfaces for interacting with the server:
- Admin Console - Web-based UI with which administrators manage clusters and Cloudera Manager.
- API - API with which developers create custom Cloudera Manager applications.
State Management:
Cloudera Manager服务器维护集群的状态。这种状态可以分为两类:“model”和“runtime”,它们都存储在Cloudera Manager Server数据库中。
model状态捕获应该在何处运行的内容,以及使用什么配置
runtime状态是进程在何处运行,以及当前运行哪些命令
配置管理
Cloudera Manager在几个级别定义配置:
服务级别可以定义应用于整个服务实例的配置,例如HDFS服务的默认复制因子(dfs.replication)。
角色组级别可以定义应用于成员角色的配置,例如datanode的处理程序计数(dfs.datanode.handler.count)。这可以针对不同的数据阳极组进行不同的设置。例如,在功能更强的硬件上运行的数据阳极可能有更多的处理程序。
角色实例级可以覆盖从其角色组继承的配置。这应该谨慎使用,因为它很容易导致角色组中的配置差异。一个示例用法是临时启用特定角色实例中的调试日志记录功能,以排除问题。
主机具有与监视、软件管理和资源管理相关的配置。
Cloudera Manager本身具有与自己的管理操作相关的配置。