The document provides an overview of a Hadoop for Developers and Admins course, including the course content which teaches how to use Apache Hadoop and write MapReduce programs, prerequisites of having basic Linux and Java familiarity, and a course outline that covers topics such as understanding distributed systems and Hadoop, running MapReduce programs, Hive, Pig, and using Hadoop in cloud computing. The course is intended for programmers, architects, and project managers who need to process large amounts of data offline. Upon completion, students will be able to use Hadoop and write MapReduce programs, understand how Hadoop supports cloud computing, and explore examples using Amazon Web Services
1 of 5
Download to read offline
More Related Content
Ecaretechnologies hadoop training in bangalore
1. ECARE Technologiesisone of the pioneersinsoftware training.We deliveredthe trainingformany
individuals,groups,andcorporations.We prioritizetogive ourbestqualityof trainingandfurther
assistance.Youhave complete freedomtocustomize course topicsandtime.Onlinetrainingisagood
experience andagenerationahead..Registeryourselfforunbelievable prices.
HADOOPFOR DEVELOPERSANDADMIN COURSE CONTENT
You will learnhowtouse Apache Hadoopand write MapReduce programs.Youwill beginwithaquick
overviewof installingHadoop,settingitupina cluster,andthenproceedtowritingdataanalytic
programs.The course will presentthe basicconceptsof MapReduce applicationsdevelopedusing
Hadoop,includingaclose lookat frameworkcomponents,use of Hadoopfora varietyof data analysis
tasks,and numerousexamplesof Hadoopinaction.The course will furtherexaminerelated
technologiessuchasHive,Pig,andApache Accumulo.Apache Accumuloisahighlyscalable structured
store basedon Google'sBigTable,writteninJavaandoperatesoverthe HadoopDistributedFileSystem
(HDFS).Hive isdata warehouse softwareforqueryingandmanaginglarge datasets.Pigisaplatformto
take advantage of parallelizationwhenrunningdataanalysis.Finally,youwill observe how Hadoop
worksinand supportscloudcomputingandexplore exampleswithAmazonWebServicesandcase
studies.
Who ShouldAttend?
The course isintendedforprogrammers,architects,andprojectmanagerswhohave toprocesslarge
amountsof data offline.
Benefitsof Attendance:
Upon completionof thiscourse,studentswillbe able to:
• Understandbasicconceptsof MapReduce applicationsdevelopedusingHadoop
• UnderstandhowHadoopworksin and supportscloudcomputing andexplore exampleswith
AmazonWebServicesandcase studies
• Use Apache Hadoopand write MapReduce programs
Prerequisites:
2. Studentsshouldhave abasicfamiliaritywithLinuxadministrationandJava,asmost code exampleswill
be writteninJava.Familiaritywithbasicstatistical concepts(e.g.histogram, correlation) ishelpfulto
appreciate the more advanceddataprocessingexamples.
Course Outline:
What isHadoop?
UnderstandingdistributedsystemsandHadoop
ComparingSQL databasesandHadoop
UnderstandingMapReduce
CountingwordswithHadoop—runningyourfirstprogram
Historyof Hadoop
StartingHadoop
The buildingblocksof Hadoop
SettingupSSH for a Hadoopcluster
RunningHadoop
Web-basedclusterUI
Componentsof Hadoop
Workingwithfilesin HDFS
Anatomyof a MapReduce program
Readingandwriting
WritingbasicMapReduce programs
Constructingthe basictemplate of aMapReduce program
Countingthings