BIG DATA AND HADOOP - Inventateq

Transcription

InventateqADVANCEDBIG DATA AND HADOOPCOURSE CURRICULUMYOUR JOB HUNTING ENDS HEREStart your career with Big Data and Hadoop course that getsYour dream job!E-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR1

Become A Big Data & HadoopCertified ProfessionalIt is only skills and not degree that can help you grow. But if you are one of thoseindividuals who believe in getting certified along with skills then we have got youcovered. After completion of the training not only will you become an expert in BigData but you will also be a Big Data & Hadoop certified professional.One Training Program4 CertificationsInventateqCCA ADMINSTRATORCCA SPARK AND HADOOP DEVELOPERCCP DATA ENGINEERIndustry Recognized INVENTATEQ CertificateE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR2

GET TRAININDAND InventateqGE T EMPLOYEDCLASS ROOM TRAININGONLINE TRAININGCORPORATE TRAINING23,409 Trainees500 Batches4.9/5RatingsE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR3

23,409 Success StoriesHere is what they say about INVENTATEQ.Inventateq BTM, it's indeed a great platform to learn Big DataHadoop for fresher's n working professionals as well. Thetrainer, shivank. He been the best with excellent subjectknowledge and gr8 real time experience. His interactionamongst learners made the session even more effective. Hefocused more on practical sessions which really helped us a lot.Quality of learning is gr8 with good placement assistance.Highly recommended.DEEPANELIGIThank you, Shivank Sir, for all your help & support throughoutthe journey for Hadoop and Big Data technologies yourassignments and real time scenarios help me to crackinterviews. Got offer in hand.RAVINDRA RAGHUWANSHIInventateqA good institute to boost your Hadoop knowledge. Helped mea lot to gain a good knowhow of Hadoop ecosystem. Simplybest institute in BTM for big data.MANOHAR JISHUI have taken Hadoop Course with INVENTATEQ early thisyear where the Instructor was very knowledgeable withexcellent communication and presentation skills withawesome time management in wrapping up of each sessionwith real time scenarios. Now, I am with full confidence thatI can crack any interview with my enhanced skills in thistechnology and get placed in Infosys.NANI VINODE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR4

23,409 Success StoriesHere is what they say about INVENTATEQ.I joined to learn HADOOP and got good understanding ofthe subject. Classes are interactive and teachingmethodology is very good. Recommended for anyonewho's looking for this course.LAKSHIMI ROYInventateq marathahalli is one of the best software traininginstitute for fresher’s to start their journey in the field ofHadoop and big data. Here taught us with real timeexperience, hands on experience was really great. I suggestInventateq is the best platform for fresher’s to start theircareer.KOMARA SUMAInventateqAfter completing graduation I have joined for big data andHadoop course in Inventateq Marathahalli. It was amazingexperience with real time classes, and with one hands onproject. I suggest the Inventateq institute if you are afresher and want to become expert in Hadoop.VIJAYA LAKSHIMIBest instructor Best study material 100% support iswhat made my experience with Inventateq very amazingand satisfied too. Can't recommend a better institute. Notonly did i get to learn Data Analytics Hadoop but i got 3great job opportunities within no time and i chose the bestamongstthem.LAKSHMI TEJUE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR5

Training ICATIONATTEND INTERVIEWRESUME PREPARATTIONYOU GOT THE JOB!E-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR6

DETAILED SYLLUBUSTABLE OF CONTENT1Big Data2Hadoop Admin3Hadoop Developer4Python for Hadoop5Java for Hadoop6SQL for Hadoop7Big data with SparkInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR7

Module 1Big Data introductionand Hadoop Fundamental Data Storage and Analysis Comparison with RDBMSInventateqHDFS ARCHITECTURE Basic Terminologies HDFS Block Concepts Replication Concepts Basic reading & writing of files in HDFS Basic processing concepts in MapReduce Data Flow Anatomy of file READ and WRITEE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR8

Module 2HADOOP ADMINISTRATOR HADOOP GEN1 VS HADOOP GEN 2(YARN) Linux commands Single and Multinode cluster installation (HADOOP Gen 2) AWS (EC2, RDS, S3, IAM and Cloud formation) Cloudera and Hortonworks distribution installation on AWS Cloudera Manager and Ambari Hadoop Security and Commissioning and Decommissioning of nodes Sizing of Hadoop Cluster and Name Node High AvailabilityInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR9

Module 3DATA INGESTIONSqoop: Migration of data from MYSQL/ ORACLE to HDFS. Creating SQOOP job. Scheduling and Monitoring SQOOP job using OOZIE and Crontab. Incremental and Last modified mode in sqoop.Talend: Installation of Talend big data studio on windows server.Inventateq Creating and Scheduling talend Jobs. Components: tmap, tmssqlinput, tmssqloutput,tFileInputDelimited, tfileoutputdelimited, tmssqloutputbulkexec, tunique,tFlowToIterate,tIterateToFlow, tlogcatcher, tflowmetercatcher, tfilelist,taggregate, tsort, thdfsinput, thdfsoutput, tFilterRow, thiveload.Flume: Flume Architecture Data Ingest in HDFS with Flume Flume Sources Flume Sinks Topology Design ConsiderationsE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR10

Module 4DATA PROCESSINGMapReduce: Env Setup Tool and ToolRunner Mapper Reducer Driver program How to package the job? MapReduce WebUI How MapReduce Job run? Shuffle & Sort Speculative Execution InputFormats Input Splits and Record Reader Default Input Formats Implement Custom Input Format OutputFormats Default Output formats Output Record Reader Compression Map Output Final Output Data types – default Writable vs Writable Comparable Custom Data types – Custom Writable/ComparableInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR11

File Based Data structures Sequence file Reading and Writing into Sequence file Map File Tuning MapReduce Jobs Advanced MapReduce Sorting Partial Sort Total Sort Secondary Sort Joins Comparison with RDBMS HQL Data types Tables Importing and Exporting Partitioning and Bucketing – Advanced. Joins and Join Optimization. Functions- Built in & user defined Advanced Optimization of HQL Storage File Formats – Advanced Loading and Storing Data SerDes – Advanced Important basics Pig Latin Data types Functions – Built-in, User Defined Loading and Storing DataHive:InventateqPig:E-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR12

Spark: Spark introduction Spark vs MapReduce Intro to spark lib (SparkSql, SparkStreaming, Spark Core)Module 5PYTHON FOR HADOOP1: An Introduction to Python1.1 Brief about the courseInventateq1.2 History/timelines of python1.3 What is python ?1.4 What python can do?1.5 How the name was put up as python1.6 Why python?1.7 Who all are using python1.8 Features of python1.9 Python installation1.10. Hello world1. using cmd2. IDLE3. By py script4. python command lineE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR13

2: Beginning Python Basics2.1. The print statements2.2. Comments2.3. Python Data Structures2.4. variables & Data Types1. rules for variable2. declaring variables3. Assignment in variables4. operations with variables5. Reserved keyword2.5. Operators in Python2.6. Simple Input & Output2.7. Examples for variables , Data Types ,operators3: Python Program Flow3.1. IndentationInventateq3.2. The If statement and its' related statement3.3. An example with if and it's related statement3.4. The while loop3.5. The for loop3.6. The range statement3.7. Break3.8. Continue3.9. pass3.9. Examples for looping4: Functions & Modules4.1. system define function(number system and its sdf ,String and its sdf)4.2. Create your own functions (user define function)4.3. Functions Parameters4.4. Variable Arguments4.5. An Exercise with functionsE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR14

5: Exceptions5.1. Errors5.2. Exception Handling with try5.3. Handling Multiple Exceptions5.4. raise5.5. finally5.6. else6: File Handling6.1. File Handling Modes6.2. Reading Files6.3. Writing & Appending to Files6.4. Handling File Exceptions7: Data Structures and Data Structures functions7.1. List and its sdfInventateq7.2. tuple and its sdf7.3. Dictionary and its sdf7.4. set and its sdf7.5. use cases and practical examples8: casting8:1 intro to castingE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR15

Module 6NOSQLCassandra: Cassandra cluster installation Cassandra Architecture Cqlsh Replication strategy Tools: Opscenter, Nodetool and CCM Cassandra use casesLabs:Inventateq Real Time use cases and Data sets covered (10 Real Time datasets) Word count, Sensors (Weather Sensors) Dataset, Social Media data setslike YouTube, Twitter data analysisE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR16

Module 7Scala & Spark trainingCourse OutlineSpark Batch processing APIIntroduction Why Spark? Evolution of Distributed systems Challenges with existing distributed systems Need of new generation Hardware/software evolution in last decade Spark History Unification in Spark Spark ecosystem vs Hadoop Spark with Hadoop Who are using Spark?Scala Basics Required for Spark Spark Architecture RDD Immutability Laziness Type inference Cacheable Spark on cluster management frameworks Spark task distributionSpark installation Local Spark on YARN Stand alone Spark on MesosInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR17

Spark API Hands on RDD operations Key-value pair RDD Map Reduce Double RDDAdvanced operations Aggregate Fold mapPartitions glom BroadcastersIntegration with HDFS Introduction to HDFS HDFS architecture Using HDFSCaching and Lineage RDD caching Fault recoverySpark streaming API Introduction Spark streaming Architecture DStreams DStream vs RDD Receivers Batch vs StreamingInput Streams Socket HDFS Twitter KafkaInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR18

Spark API Hands on RDD operations Key-value pair RDD Map Reduce Double RDDAdvanced operations Aggregate Fold mapPartitions glom BroadcastersIntegration with HDFS Introduction to HDFS HDFS architecture Using HDFSCaching and Lineage RDD caching Fault recoverySpark streaming API Introduction Spark streaming Architecture DStreams DStream vs RDD Receivers Batch vs StreamingInput Streams Socket HDFS Twitter KafkaStreaming API Hands-on DStream creation Transformations Stateful operationsInventateqE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR19

Check pointing Recoverable computations Error handlingCombining batch and Streaming Foreach Transform JoinsPersist and Caching Saving DStream Caching DStreamWindow Operations window countByWindow reduceByWindowDeploying Spark Streaming Clustering Check pointing Driver fallbackSpark SQL IntroductionInventateqSpark SQL DDL Case classes Inferred schema Parquet files JSON Schema RDDSpark SQL DML Projection Condition groupBy joins partitioningE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR20

Extending Spark SQL User defined functions User defined aggregate functionSpark SQL in Streaming Querying DStreaming DStream joinsDemo Project Ecommerce Log Analytics using Kafka, Spark Streaming and Cassandra.Master Project:Inventateq Real-time Data Warehouse migration: Real-time concepts covered are Hive - Advanced topics Sqoop import/export Oozie Scheduling How Hadoop MR/Spark used in DW RDBMS concepts ETL tool concepts Integration with Reporting toolsE-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR21

OUR HIRING PARTNERED COMPANIES LISTInventateqPOPULAR COURSES FROMINVENTATEQE-MAIL: info@inventateq.com 100% CALL:JOB7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGARORIENTEDTRAININGCOURSESWE PROVIDE 22

Digital Marketing BigData Hadoop Course Machine Learning(SEO/Social Media/PPCGoogle Adwords) Best SEO TrainingCertification Training Spark and Scala Course Block Chain Training AWS Training DevOps Training Cloud Computing Angularjs and Node JSTraining Data Science CoursesTraining Weblogic TrainingCourses Artificial IntelligenceCourses Tally ERP & GSTAccounting classes Java course RPA Training .NET Technologies Software Testing Course, Internet of Things IoT SOA Suite 11gManual Testing, QTP, UFT,Loadrunner C C CourseTraining Microsoft Azure Training Oracle DBA Training Tableau Data Warehousing - Oracle SQL, PLSQL, PHP MYSQL, PythonInformatica Selenium TrainingDBA, D2k, Apps ETL Testing Course Human Resources Classes IBM Cognos 10 BI & PPC Training Institute Microstartegy CourseCognos TM1 Qlikview (Deisgner,Developer, Publisher,Server) IBM WebsphereInventateq Autodesk Revit Training Cisco CCNA Networking Autodesk CAD 2d and SAS Training Learn ODI 11g3d Course Catia Training Softskill Courses Python Training Wiring Harness Training ITIL Certi

¾ Basic reading & writing of files in HDFS ¾ Basic processing concepts in MapReduce ¾ Data Flow ¾ Anatomy of file READ and WRITE Inventateq. E-MAIL: info@inventateq.com CALL: 7676765421 BTM MARATHAHLLI JAYANAGAR RAJAJI NAGAR 9 Module 2 HADOOP ADMINISTRATOR ¾ HADOOP GEN1 VS HADOOP GEN 2(YARN) ¾ Linux commands ¾ Single and Multinode cluster installation (HADOOP