Big Data - .microsoft

Transcription

Big DataThe Next Generation Grit Suwa Betimes Solutions

WHAT IS BIG DATA?Title ofPresentation Speaker Name, Title Microsoft Corporation2

WHAT IS BIG DATA?Title of “techniques and technologiesthatmakehandlingdataatPresentation extreme scale economical.” Speaker Name, Title Microsoft Corporation3

DIGITAL DATA AGETitle ofPresentation Speaker Name, Title Microsoft Corporation4

DIGITAL DATA AGETitle ofPresentation Speaker Name, Title Microsoft Corporation5

WHERE DATA COME FROMTitle ofPresentation Speaker Name, Title Microsoft Corporation6

DIGITAL DATA FACTS AND FIGURESTitle ofPresentation Speaker Name, Title Microsoft Corporation7

Title ofPresentation Speaker Name, Title Microsoft Corporation8

Title ofPresentation Speaker Name, Title Microsoft Corporation9

Title ofPresentation Speaker Name, Title Microsoft Corporation10

STORAGE CAPACITY AND TRANSFER RATETitle ofPresentationIt takes approximately 714 s or 12 minutesto read whole disk.It takes approximately 7,600 s or 120 minutes Speaker Name, Titleto read whole disk. Microsoft Corporation11

TAKE 4 HOURSTitle ofPresentationTAKE30SECONDS Speaker Name, Title Microsoft Corporation12

PARALLEL DATA STORAGE ANDPROCESSINGTitle ofPresentation Speaker Name, Title Microsoft Corporation13

HADOOP FILE SYSTEMTitle ofPresentation Speaker Name, Title Microsoft Corporation14

HADOOP FILE SYSTEMTitle ofPresentation Speaker Name, Title Microsoft Corporation15

HADOOP DISTRIBUTED FILE SYSTEMTitle ofPresentation Speaker Name, Title Microsoft Corporation16

HADOOP ARCHITECTURETitle ofPresentation Speaker Name, Title Microsoft Corporation17

DATA WAS REPLICATED AND PROCESSED ACROSS THE CLUSTERTitle ofPresentationWHEN NODES FAILREBALANCES FILES ACROSS CLUSTERJUST BY ADDING NEW NODES Speaker Name, Title Microsoft Corporation18

HADOOP STACKTitle ofPresentation Speaker Name, Title Microsoft Corporation19

MAP/REDUCEAutomatically Parallelizes Map & Reduce Operations Supporting 1,000’s of Processors and Petabytes of DataTitle ofPresentationReplicated Data in HDFS Failed Jobs Automatically Restarted without Loss of the Rest of JobsDegree of Parallelism can be Determined at Runtime Flexible Data Model and Programing Speaker Name, Title Microsoft CorporationOpen Source and Designed to Work on Commodity Hardware Two Routines : Map & Reduce20

INTERNET SCALE ANALYTICSTitle ofPresentation Speaker Name, Title Microsoft Corporation21

MICROSOFT BIG DATA ANALYTICTitle ofPresentation Speaker Name, Title Microsoft Corporation22

Title ofPresentation Speaker Name, Title Microsoft Corporation23

Title ofPresentation Speaker Name, Title Microsoft Corporation24

Title ofPresentation Speaker Name, Title Microsoft Corporation25

Title ofPresentation Speaker Name, Title Microsoft Corporation26

Title ofPresentation Speaker Name, Title Microsoft Corporation27

Betimes Solutions. Title of Presentation Speaker Name, Title Microsoft Corporation 2 WHAT IS BIG DATA? Title of Presentation Speaker Name, Title Microsoft Corporation 3 WHAT IS BIG DATA? "techniques and technologies that make handling data at extreme scale economical." Title of Presentation Speaker Name, Title Microsoft Corporation 4 DIGITAL DATA AGE. Title of .