CDP Public Cloud Preview Features
The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.
Data Hub
- Fine-grained Access Control from ABFS File Browser in Hue
-
published: 2021-10-21;
modified: 2021-10-21
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue. - Fine-grained Access Control from S3 File Browser in Hue
-
published: 2021-10-21;
modified: 2021-10-21
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.
Data Engineering
- CDE In-place Upgrades
-
published: 2022-07-20;
modified: 2022-07-20
Cloudera Data Engineering (CDE) now supports upgrades from CDE 1.14 on both AWS and Azure.
DataFlow
- CDF Service Upgrade
-
published: 2022-06-28;
modified: 2022-06-28
Cloudera DataFlow (CDF) now supports upgrades from CDF 2.0.0 on both AWS and Azure.
Data Warehouse
- Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for AWS environments
-
published: 2022-04-27;
modified: 2022-04-27
Learn how to enable fine-grained access to S3 buckets from Hue in Cloudera Data Warehouse. - Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for Azure environments
-
published: 2022-04-27;
modified: 2022-04-27
Learn how to enable fine-grained access to ADLS Gen2 containers from Hue in Cloudera Data Warehouse. - Add Access to External S3 Buckets for CDW Clusters on AWS
-
published: 2021-05-12;
modified: 2021-05-25
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS. - Azure Spot instances for Virtual Warehouses
-
published: 2021-09-28;
modified: 2021-09-28
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance. - Enable SSO for JDBC/ODBC Connections to Virtual Warehouses
-
published: 2021-05-21;
modified: 2021-05-25
Enable single sign-on (SSO) for third-party BI tool connections to Virtual Warehouses that use JDBC and ODBC. - Managed Storage Access for AWS
-
published: 2021-10-21;
modified: 2022-02-01
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for AWS. - Managed storage access for Azure
-
published: 2021-10-21;
modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for Azure. - Reserving nodes for auto-scaling
-
published: 2022-07-26;
modified: 2022-07-26
To speed up Virtual Warehouse startup and autoscaling, keep some number of compute instances on standby. You configure extra buffer nodes to stand by, ready to join a new compute or autoscaled cluster.
Governance
- Integrating CDP Data Catalog with AWS Glue Data Catalog
-
published: 2021-08-09;
modified: 2021-12-08
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities. - Navigating to tables and databases in Hue using Data Catalog
-
published: 2021-08-07;
modified: 2021-08-07
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services. - Support for CDP Private Cloud Base clusters in Data Catalog
-
published: 2022-02-24;
modified: 2022-04-06
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters. - Supporting High Availability for Profiler services
-
published: 2021-08-07;
modified: 2021-08-07
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature. - Transitioning Profiler Manager Service into SDX
-
published: 2022-02-24;
modified: 2022-02-24
The Profiler Manager Service is moved to the SDX infrastructure. - Using the Download CSV option
-
published: 2022-02-24;
modified: 2022-02-24
Using the selected data lake, the search result for the current query can be downloaded.
Machine Learning
- PBJ Workbench
-
published: 2022-04-21;
modified: 2022-04-28
The PBJ Workbench features a Jupyter Notebook editor pre-packaged with a runtime image. Data Scientists can easily choose this runtime image when launching a session, and then they can use the familiar Jupyter environment in their Cloudera Machine Learning workspace. - Data Discovery and Exploration
-
published: 2022-04-21;
modified: 2022-04-21
Data Discovery and Exploration enables you to connect to data sources, explore them with SQL commands, and build visualizations and dashboards with that data, all from within CML. - Private Cluster Support
-
published: 2022-01-06;
modified: 2022-01-06
Private Clusters provide a simple way to create a secure cluster, where the API server and the workloads themselves only rely on private IP addresses that are not accessible from the internet. - Experiments with MLflow
-
published: 2021-10-27;
modified: 2021-10-27
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments. - CMK Encryption on AWS
-
published: 2021-08-10;
modified: 2022-08-10
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.
Management Console
- Azure VM Encryption at Host
-
published: 2022-06-06;
modified: 2022-06-06
Description: You can optionally enable encryption at host for Data Lake, FreeIPA, and Data Hubs. Currently, you need to enable it individually for each Virtual Machine (VM) on Azure Portal. - Data Lake Scaling
-
published: 2022-05-11;
modified: 2022-05-11
Data Lake scaling is the process of scaling up a light duty Data Lake to the medium duty form factor, which has greater resiliency than light duty and can service a larger number of clients. - New UI for adding a CDP Private Cloud Base cluster
-
published: 2022-03-29;
modified: 2022-03-29
Register a CDP Private Cloud Base cluster as a classic cluster using Cloudera Manager and Knox endpoints so that you can use this cluster in Replication Manager and Data Catalog services. - Public Endpoint Access Gateway for GCP
-
published: 2021-12-17;
modified: 2021-12-17
You can enable Public Endpoint Access Gateway for GCP during GCP environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.
Replication Manager
- Snapshot Policies in Replication Manager
-
published: 2022-02-25;
modified: 2022-02-25
You can create HDFS and HBase snapshot policies in Replication Manager to schedule taking snapshots of snapshottable HDFS directories and HBase tables at regular intervals. An HDFS directory is snapshottable after it has been enabled for snapshots, or because a parent directory is enabled for snapshots in Cloudera Manager.