Wed Jul 3, 9:00 AM - Thu Jul 4, 5:00 PM
Park Avenue, 39th Floor, New York, NY 10167, United States , New York, NY 10167

Community: Midtown Manhattan

Description

Students will understand the overall big data space, technologies involved and will get a detailed overview of Apache Hadoop.

Event Details

Course Description: This big data training course will provide a technical overview of Apache Hadoop for project managers, business managers and data analysts. Students will understand the overall big data space, technologies involved and will get a detailed overview of Apache Hadoop. The course will expose students to real world use cases to comprehend the capabilities of Apache Hadoop. Students will also learn about YARN and HDFS and how to develop applications and analyze Big Data stored in Apache Hadoop using Apache Pig and Apache Hive. Each topic will provide hands on experience to the students.

Introduction to Big Data

● Big Data – beyond the obvious trends

● Exponentially increasing data

● Big data sources

● Data warehousing, business intelligence, analytics, predictive statistics, data science

Survey of Big Data technologies

● First generation systems

● Second generation systems

● Enterprise search

● Visualizing and understanding data with processing

● NOSQL databases

● Apache Hadoop

Introduction to Hadoop

● What is Hadoop? Who are the major vendors?

● A dive into the Hadoop Ecosystem

● Benefits of using Hadoop

● How to use Hadoop within your infrastructure?

Introduction to MapReduce

● What is MapReduce?

● Why do you need MapReduce?

● Using Mapreduce with Java and Ruby

Introduction to Yarn

● What is Yarn?

● What are the advantages of using Yarn over classical MapReduce?

● Using Yarn with Java and Ruby

Introduction to HDFS

● What is HDFS?

● Why do you need a distributed file system?

● How is a distributed file system different from a traditional file system?

● What is unique about HDFS when compared to other file systems?

● HDFS and reliability?

● Does it offer support for compressions, checksums and data integrity?

Data Transformation

● Why do you need to transform data?

● What is Pig?

● Use cases for Pig

Structured Data Analysis?

● How do you handle structured data with Hadoop?

● What is Hive/HCatalog?

● Use cases for Hive/HCatalog

Loading data into Hadoop

● How do you move your existing data into Hadoop?

● What is Sqoop?

Automating workflows in Hadoop

● Benefits of A

Premier Event Photos

People Attending

Event Feed

Also See other Events Listed in New York City

Trending Events

Find Events in New York City

Leave empty Dates for all upcoming Events

Join My Community

212area.com would like to send you latest updates