Enter your keyword

Big Data

BIG DATA

Hadoop:

This course is designed to give you understanding of key Big Data technologies. In this course, we first begin with describing what Big Data is and the need for Hadoop to be able to process that data in a timely manner. This is followed by describing the Hadoop architecture and how to work with the Hadoop Distributed File System (HDFS). This also describes techniques for moving data into Hadoop. There are a variety of ways to get data into Hadoop from simple Hadoop shell commands to more sophisticated processes.

Duration :  4 Days
Mode: Online Instructor Led

HBASE:

This course provides you with a thorough understanding of the HBase data model and architecture, which is required before going on to designing HBase schemas and developing HBase applications.
The goal of this course is to enable you to design HBase schemas based on design guidelines. You will learn about the various elements of schema design and how to design for data access patterns. The course offers an in-depth look at designing row keys, avoiding hot-spotting and designing column families. It discusses how to transition from a relational model to an HBase model. You will learn the differences between tall tables and wide tables.

Duration :  4 Days
Mode: Online Instructor Led

Apache Spark:

This course enables you to get started developing big data applications with Apache Spark. The course describes the various modes for launching a Spark application. You will then go on to build and launch a standalone Spark application. You will also learn to create and modify pair RDDs, perform aggregations, and control the layout of pair RDDs across nodes with data partitioning. This course also discusses Spark SQL and DataFrames, the programming abstraction of Spark SQL. This course also describes the components of the Spark execution model using the Spark Web UI to monitor Spark applications.
You will also cover the following Apache Spark libraries – Spark Streaming, Spark SQL, Spark MLlib, and Spark GraphX. This course describes the benefits of the Apache Spark unified platform and how to build a data pipeline application using Spark Streaming, Spark SQL, Spark GraphX, and MLlib.

Duration :  4 Days
Mode: Online Instructor Led