The considerations and requirements for data management are constantly evolving. There are new realities for managing data and data-centric workloads across the enterprise in a unified and comprehensive manner.
Use cases were previously focused on efficiently storing and processing data in batch processes. Now there are increasing needs for integrating the entire data life cycle and for processing in both real time and batch.
Technology infrastructure used to demand the co-location of compute and storage to avoid costly network transfers. Now the needs of high-performance analytics drive a move toward disaggregated storage, and segregated compute, memory, and SSD.
From a user experience perspective, it used to be acceptable to deploy and run in timeframes of weeks, months, or even quarters. Now the expectation is to be able to spin up services in minutes, give users their own clusters, and get insights quickly.
From the privacy, security, and governance perspectives, the primary concerns were formerly about network perimeter and physical access controls. Now, with the entire data life cycle being managed, operators need fine-grained authentication and authorization at the workload and data layers.
CDP Data Center is the unification of Cloudera Distribution for Apache Hadoop (CDH) and Hortonworks Data Platform (HDP), giving customers the best of both worlds. This new product combines the best technologies from Cloudera and Hortonworks, with new features and enhancements across the stack, to form a comprehensive data platform that encompasses the entire data life cycle. This unified distribution is a scalable and customizable platform where you can securely run many types of data analytics workloads.
CDP Data Center, as a comprehensive data management and analytics platform for on-premises IT environments, includes such capabilities as:
CDP Data Center contains several preconfigured packages of data services or shapes for common workloads, including: