Lenovo Big Data Reference Design for Cloudera Data Platform on ThinkSystem ServersReference Architecture

30 Jun 2021
Form Number
PDF size
32 pages, 1.2 MB


This document describes the reference design for Cloudera Data Platform software on ThinkSystem servers. It provides architecture guidance for designing optimized hardware infrastructure for the Cloudera Data Platform Private Cloud edition, a distribution of Apache Hadoop and Apache Spark with enterprise-ready capabilities from Cloudera. This reference design provides the planning, design considerations, and best practices for implementing Cloudera Data Platform with Lenovo products.

The intended audience for this reference architecture is IT professionals, technical architects, sales engineers, and consultants to assist in planning, designing, and implementing the big data solution with Lenovo hardware. It is assumed that you are familiar with Hadoop components and capabilities.

Table of Contents

  1. Introduction
  2. Business problem and business value
  3. Requirements
  4. Architectural Overview
  5. Component Model
  6. Operational Model
  7. Resources

To view the document, click the Download PDF button.

Change History

Changes in the June 30, 2021 update:

  • Add new Server Selection, section 6.1.1
  • Add more description in 2 Business problem and business value, section 2
  • Update software version and reference link

Related product families

Product families related to this document are the following: