Getting Real About Data Virtualization - Informatica

Transcription

5/2/2011Getting Real About Data VirtualizationInformatica Data ServicesAsh ParikhMay 3, 20111Defining Data Virtualization Makes multiple heterogeneous datasources appear as one Federates data in real-time & alsosupports physical materialization to DW Abstracts data sources from consumersand insulates from change Hides & handles complexity (quality, richtransformations, bus-IT collaboration) Let’s the business own the data & definethe rules while IT retains controlDATA VIRTUALIZATIONHARDWAREVIRTUALIZATIONBIComposite AppsPortalDataOperatingConsumersSystemsLogical Data VOICE erpriseData SourcesEnterpriseStorageSource: VMWare, Inc.Logicalof UnderlyingDataLogicalViewViewof ComputingResourcesSimple DataFederation Does Not Cut It21

5/2/2011Here’s Why The TraditionalDI e/Test6.DeployBusinessTypical Value Stream Map - Too Much WAIT & WASTE The biz gets involved late &does not get what is needed3The Problem(s)42

5/2/2011It Takes too Long to Deliver Data theBusiness Needs!BusinessITChange Request Approve & Prioritize Analyze & Design Build Test DeployDWUnstructuredDataDMDMTrustApplications Mainframe5Data Is Everywhere & Growing!BusinessITDW/MDMHubUnstructuredData Spread MartsDMDMDMDMTrustApplications Mainframe63

5/2/2011TheImpact7Reports Take too LongReporting Scenario: On-going requests for data that is NOT in the DWBusinessChangeRequest? if?WhatITDeploy toProduction3-6 MonthsWeeks/DaysChange Request Approve & Prioritize Analyze & Design Build Test Deploy 66% 71% 36% 77%of BI requirements change on between a daily and monthly basisof the respondents said they have to ask data analysts to create custom reports for themof custom report requests require a custom cube or data mart to answer the requestof respondents cited that it takes between days and months to get their BI requests fulfilledSource: Forrester Research, “Agile BI: Best Practices for Breaking Through the BI Backlog,” 201084

5/2/2011HealthNow Case StudyBI(Cognos)Portal(WebSphere)BusinessWeb servicesNo ReuseSQLITXNo Reuse of Data Services for BI, MDM & SOAAgreement on MEMBER & Attributes Time-Consuming & PainfulDifferent Price Info in Each BU & 1700 Dev Hours to Add 1 Product16 Heterogeneous Enterprise Stores With Large Volumes of Data30,000 Data Marts(MS Access)Data Warehouse(DB2)Facets [Benefits, Products](Sybase ASE)Product Config Mgmt(MS SQL Server)9Lean Integration &Data Virtualization105

5/2/2011Data Virtualization Built on Lean IntegrationPrinciplesThe TraditionalDI ProcessThe New AgileDI Process1.Source1.Logical Data ow5.Execute/Test6.Deploy- Preview- Profile at any stage- Apply rich transformations- Apply DQ & masking rules on-the-fly- Federate data without data movementBusiness4.Comment/Tag- DebugBusinessOriginal Value Stream Map - Too Much WAIT & WASTE5.Deploy as Reusable Data Services- Web services or SQLOptimized Value Stream Map – Cut the WAIT & WASTE Early and on-going businessuser involvement for agile DI11Self Service – Analyst Empowerment &Business-IT CollaborationDI Analyst Easily map sources to physical &virtual targetsSQL or Web Service Quickly find data via integratedbusiness glossaryBI Report Specify transformationswith reusable expressions Include pre-built rules andmappings (e.g. ETL, DQ) Collaborate, test and validatespecification results Automatically generate ETLmappings and SQL viewsBatch ETLDI DeveloperData Warehouse Improved analyst &developer productivity126

5/2/20115x Faster Direct Data Access, IncreasedReuse, Improved Governance & AgilityBI(Cognos)Portal(WebSphere)REUSE BI Strategy MDM Strategy SOA Strategy1 week(vs. 3 months)“Virtual Table”MEMBERCLAIM30,000 Data Marts(MS Access)PRODUCTData Warehouse(DB2)Facets [Benefits, Products](Sybase ASE)ORDERProduct Config Mgmt(MS SQL Server)13Case for Making Data Virtualization as part ofYour Information Management Best ntsCRMAccess all data,accelerate profilingAbstract, find, governLogical Data ObjectsOptimizations& CachingCRMInformatica Data Services(Data Virtualization)AccountsLeverage a scalable& reliable onMetadataInvolve business earlyto define & validate rules465EIICall CenterETLAccountsSupport multiple stylesof data processingBatchQueryEngineWeb ServicesWSServerSingle-click reuseacross all applicationsDQTransformationRich transforms, DQ& masking rules in RT147

5/2/2011Next StepsLet’s Talk!Architect to Architect Webinar SeriesData Virtualization Architecture & BestPractices for Agile Data IntegrationMay 19, 2011“SOA Data Integration ArchitectureGroup”Forrester IaaS(Data Services) Wave15168

Getting Real About Data Virtualization Informatica Data Services Ash Parikh May 3, 2011 2 Makes multiple heterogeneous data sources appear as one Federates data in real-time & also supports physical materialization to DW Abstracts data sources from consumers and insulates fr