Lenovo Big Data Validated Design for Cloudera Enterprise and VMware with System x ServersReference Architecture
This document describes the reference architecture for Cloudera Enterprise including virtualization with VMware. It provides a predefined and optimized hardware infrastructure for the Cloudera Enterprise, a distribution of Apache Hadoop and Apache Spark with enterprise-ready capabilities from Cloudera.
This reference architecture provides the planning, design considerations, and best practices for implementing Cloudera Enterprise with Lenovo products including System x servers. Jointly tested and validated by Lenovo, Cloudera and VMware, the predefined configuration provides a baseline configuration for a big data solution, which can be modified, based on the specific customer requirements, such as lower cost, improved performance, and increased reliability.
The intended audience of this document is IT professionals, technical architects, sales engineers, and consultants to assist in planning, designing, and implementing the big data solution with Lenovo hardware.
Note: For Cloudera Enterprise with ThinkSystem servers, see http://lenovopress.com/lp0776.
Table of Contents
Business problem and business value
Appendix: Bill of Materials
Changes in the June 20 update:
- Added sections on virtualized Hadoop with CDH 5.11 on VMware vSphere 6.5