info@caliberacademy.in
(+91) 7993030801

Hadoop Admin

img

Hadoop

Become an expert Apache Hadoop administrator, Master the elements of cluster monitoring, governance, security, and troubleshooting.

Learn Apache Hadoop from the leading hadoop expert. Sixty-five percent of the current Fortune 100 are using big data to drive their business.

You can too, by getting expert training through Skillathon integrated course —the industry's only truly dynamic Hadoop training curriculum that’s updated regularly to reflect the state-of-the-art in big data domain.

Core Java: Part 1

  • The Case for Apache Hadoop
    • Why Hadoop?
    • Fundamental Concepts
    • Core Hadoop Components
  • Hadoop Cluster Installation
    • Rationale for a Cluster Management Solution
    • Cloudera Manager Features
    • Cloudera Manager Installation
    • Hadoop (CDH) Installation
  • The Hadoop Distributed File System (HDFS)
    • HDFS Features
    • Writing and Reading Files
    • NameNode Memory Considerations
    • Overview of HDFS Security
    • Web UIs for HDFS
    • Using the Hadoop File Shell
  • MapReduce and Spark on YARN
    • The Role of Computational Frameworks
    • YARN: The Cluster Resource Manager
    • MapReduce Concepts
    • Apache Spark Concepts
    • Running Computational Frameworks on YARN
    • Exploring YARN Applications Through the Web UIs, & the Shell.
    • YARN Application Logs
  • Hadoop Configuration and Daemon Logs
    • Cloudera Manager Constructs for Managing Configurations
    • Locating Configurations and Applying Configuration Changes
    • Managing Role Instances and Adding Services
    • Configuring the HDFS Service
    • Configuring Hadoop Daemon Logs
    • Configuring the YARN Service
  • Getting Data Into HDFS
    • Ingesting Data From External Sources With Flume
    • Ingesting Data From Relational Databases With Sqoop
    • Best Practices for Importing Data.
  • Planning Your Hadoop Cluster
    • General Planning Considerations
    • Choosing the Right Hardware
    • Virtualization Options
    • Network Considerations
    • Configuring Nodes
  • Advanced Cluster Configuration
    • Advanced Configuration Parameters
    • Configuring Hadoop Ports
    • Configuring HDFS High Availability
    • Configuring the HDFS Service
  • Hadoop Security
    • Why Hadoop Security is Important
    • Hadoop’s Security System Concepts
    • What Kerberos is and how it Works
    • Securing a Hadoop Cluster With Kerberos
    • Other Security Concepts
  • Managing Resources
    • Configuring cgroups with Static Service Pools
    • The Fair Scheduler
    • Configuring Dynamic Resource Pools
    • YARN Memory and CPU Settings
  • Cluster Maintenance
    • Checking HDFS Status
    • Copying Data Between Clusters
    • Adding and Removing Cluster Nodes
    • Rebalancing the Cluster
    • Directory Snapshots
    • Cluster Upgrading
  • Cluster Monitoring and Troubleshooting
    • Cloudera Manager Monitoring Features
    • Monitoring Hadoop Clusters
    • Troubleshooting Hadoop Clusters
    • Common Misconfigurations