Carlos Aguni

Highly motivated self-taught IT analyst. Always learning and ready to explore new skills. An eternal apprentice.


Cloudera Courses

03 Apr 2022 »
txt = []
$('li.outline-item').each(function(){
    h3 = $(this).find('h3.section-title')
    //console.log('*', h3.html())
    txt.push(`* [ ] ${h3.html()}`)  
    $(this).find('li.subsection').each((el, obj) => {
        h4 = $(obj).find('h4.subsection-title')
        //console.log('    *', h4.text().trim())
        txt.push(`    * [ ] ${h4.text().trim()}`)
    })
})
console.log(txt.join('\n'))

[x] Cloudera Essentials for CDP

slides: file.pdf

  • Introduction
    • About This Course
  • Introducing the Enterprise Data Cloud
    • The Evolution of CDP
    • Data access and control layer that provides consistent security and governance for all applications running within CDP, in all locations, both on-premises and in the cloud.
      • Control Plane Also provides a cmdline and APIs to allow admins to automate common tasks.
        • Data Catalog
          • Helps them to understand, manager, and secure data assets
        • Replication Manager
          • Used to replicate migrate data and metadata between environments
        • Workload Manager
          • Used to analyze, troubleshoot, and optimize workloads
        • Management Console
          • Provides a single pane of glass for managing all clusters
    • Characteristics of an Enterprise Data Cloud
    • From the Edge to AI: An End-to-End Use Case
    • Test Your Learning (3 Questions)
  • Cloudera Data Platform Overview
    • Learn: Cloudera Data Platform Overview
    • Test Your Learning: Cloudera Data Platform Overview (3 Questions)
      • q1
      • q2
      • q3
    • Role of Cloudera Runtime in CDP?
      • Cloudera Runtime is the distribution package of open source software, combining the best open source software from CDH and HDP. which provides the foundation for all CDP deployments.
    • CDP Public Cloud differs in some ways from our older distributions, CDH and HDP. Which two of the following indicate some ways in which CDP Public Cloud differs from these other distributions?
      • CDP Public Cloud emphasizes the use of the cloud provider’s object store, such as S3 on Amazon Web Services or ADLS Gen 2 on Microsoft Azure, for data storage.
      • CDP Public Cloud prefers to separate data storage and compute, whereas CDH and HDP performed these tasks on the same node.
  • Workload and Data Management
    • The Role of an Administrator
    • SDX
    • Managing Resources and Costs
    • Workload Isolation
    • Data Migration and Replication
    • Test Your Learning (3 Questions)
  • Data in Motion
    • The Role of a Data Engineer
    • Data in Motion Use Cases
    • Cloudera DataFlow
    • Test Your Learning (3 Questions)
  • Data Warehousing and Analytics
    • The Role of a Data Analyst
    • Data Warehouse and Analytics Use Cases

    • Operational and Analytic Database Capabilities

    • CDP Data Warehouse Experience
    • Test Your Learning (3 Questions)
  • Data Science and Machine Learning
    • The Role of a Data Scientist
    • Machine Learning Use Cases
    • Cloudera Data Science Workbench (CDSW)
    • Cloudera Machine Learning
    • Fast Forward Labs
    • Test Your Learning (3 Questions)

  • Security and Data Governance Data governance referes to the policies and organizational structures that a single company uses to secure and identify where their data is coming from, how it’s used and where it’s used.
    • The Role of a Data Steward
    • Data Catalog
    • Controlling and Auditing Data Access
    • Test Your Learning (3 Questions)

  • Planning for Success
    • Challenges and Opportunities
    • How Cloudera Can Help

    • [x]/ Test Your Learning

[x] CDP Private Cloud Fundamentals

slides: file.pdf

  • The Enterprise Data Cloud Vision
    • The Enterprise Data Cloud Vision
    • Characteristics of the Enterprise Data Cloud
    • Data Lifecycle

  • Cloudera Data Platform Overview
    • Cloudera Data Platform: Recap
    • How to Eliminate Shadow IT
    • CDP Public Cloud
    • CDP Private Cloud Base
  • Introducing CDP Private Cloud
    • CDP Product Overview
    • CDP Private Cloud Plus
  • CDP Private Cloud Architecture
    • Important Trends
    • Traditional Cluster Architecture (Bare Metal)

    • Limitations of the Traditional Cluster Architecture

    • Key Aspects of the Cloud-Native Architecture
    • What is Kubernetes?
    • Kubernetes Overview
    • Comparing CDP Public and Private Cloud

    • CDP Private Cloud Architecture

  • Installation
    • Installation Requirements

    • Installation Demo
  • Conclusion
    • Essential Points

Introduction to Cloudera Machine Learning

CDP for HDP Users

  • CDP for HDP Users
    • Introduction
    • Cloudera Manager
    • CDSW
    • Hue
    • Apache Impala

    • Apache Kudu

CDP for CDH Users

  • CDP for CDH Users
    • Introduction
    • Apache NiFi

    • Apache Phoenix
    • Apache Ranger

    • Apache Hive and LLAP

    • Apache Atlas
    • Apache Zeppelin

Introduction to Cloudera Data Warehouse: Self-Service Analytics in the Cloud with CDP

  • Cloudera Data Warehouse (CDW) Overview
    • Introduction to the Course
    • What is CDW?
    • Benefits of CDW
    • Test Your Learning: Cloudera Data Warehouse (CDW) Overview
  • The CDW Web Interface
    • How to Access CDW
    • CDW Orientation
    • Test Your Learning: The CDW Web Interface
  • Choosing Hive LLAP or Impala
    • Choosing Impala or Hive LLAP
    • When to Use Impala
    • When to Use Hive LLAP
  • Creating Database Catalogs and Virtual Warehouses (Data Engineer Track)
    • Creating Database Catalogs
    • Creating Virtual Warehouses
    • Test Your Learning: Creating Database Catalogs and Virtual Warehouse
  • Managing Virtual Warehouses (Data Engineer Track)
    • Auto-Scaling
    • Additional Tuning for Hive Virtual Warehouses
    • Additional Tuning for Impala Virtual Warehouses
    • Test Your Learning: Managing Virtual Warehouses
  • Querying Data from the CDW Web Interface (Data Analyst Track)
    • Accessing Query Editors
    • Setting Workload Password
    • Querying with Hue
    • Querying with DAS
    • Test Your Learning: Querying Data from the CDW Web Interface
  • Querying Data Using CLI and Third-Party Integration
    • Using Impala Shell
    • Using Third-Party Tools
    • Test Your Learning: Querying Data Using CLI and Third-Party Integration

HDP Overview: Apache Hadoop Essentials

Deploying and Scaling Cloudera Enterprise on Microsoft Azure

Cloudera Director: An Introduction

AWS Fundamentals for CDP Public Cloud