Advanced Scanners Session For EDC Customers - Informatica

Transcription

October 1, 2020Advanced Scanners Sessionfor EDC CustomersGaurav Pathak, Vice President, Product Management Louis-Noel, Trapadoux, Principal Product Manager

Housekeeping Tips Today’s Webinar is scheduled for 1 hour The session will include a webcast and then your questions will be answered live at the end of the presentation All dial-in participants will be muted to enable the speakers to present without interruption Questions can be submitted to “All Panelists" via the Q&A option and we will respond at the end of the presentation The webinar is being recorded and will be available to view on our INFASupport YouTube channel and Success Portal.The link will be emailed as well. Please take time to complete the post-webinar survey and provide your feedback and suggestions for upcoming topics.2 Informatica. Proprietary and Confidential.

Feature Rich Success PortalBootstrap trial andPOC CustomersEnriched CustomerOnboardingexperience Informatica. Proprietary and Confidential.Product LearningPaths and WeeklyExpert SessionsInformaticaConcierge withChatbot integrationsTailored training andcontentrecommendations

More InformationSuccess Portalhttps://success.informatica.com4Communities &Supporthttps://network.informatica.com Informatica. Proprietary and ces-and-training/informaticauniversity.html

Safe HarborThe information being provided today is for informational purposes only. Thedevelopment, release, and timing of any Informatica product or functionalitydescribed today remain at the sole discretion of Informatica and should not berelied upon in making a purchasing decision.Statements made today are based on currently available information, which issubject to change. Such statements should not be relied upon as arepresentation, warranty or commitment to deliver specific products orfunctionality in the future.5 Informatica. Proprietary and Confidential.

SpeakersGaurav PathakVice PresidentProduct ManagementMetadata and CLAIRELouis-Noel TrapadouxPrincipal Product ManagerEDC

Enterprise Data Catalog Powered byEnterprise Data CatalogBroadMetadataSources Operational Usage Glossary Policies ProcessWisdomof Crowd[Data Analysts, Data Scientists] TechnicalBusinessContextSelf Service AnalyticsAI Curated CatalogStructure Discovery, Profilingand Domain Discovery,Similarity Clustering,RecommendationsBusiness & CrowdSourced CurationBusiness GlossaryAssociations, BusinessClassifications, Annotations,Comments Behavior Informatica. Proprietary and Confidential.Data Governance[Data Stewards, Data Architects] Associate Business glossary totechnical objects Verify business to technical lineage Track key data elements complianceData Asset Management[Architects, Developers] Comments RatingsGoogle for enterprise data assetsData Lineage, holistic relationship viewTrust with data profileAccess to dataKnowledge Graph Analyze column-level Lineage &Change Impact View transformation Logic Data asset and BI usage

Technical ChallengesIt’s difficult and nearly impossible tocatalog all of our enterprise data includinglegacy on-premises systems and newerCloud enterprise and analytic applications8 Informatica. Proprietary and Confidential.The ability to understand data throughdata lineage is typically incomplete(there exist black boxes) especially forscripts, code, legacy mainframe systems,multi-vendor ETL tools, and BIapplications

Informatica Compact SolutionsInformatica Acquires Compact Solutions to Extend Industry Leading Enterprise Data CatalogIndustry’s first and only catalog of catalogs across all enterprise data with the broadest metadata connectivity Extends the industry’s most comprehensive and detailed data lineage (i.e. no“black boxes”) to understand the provenance of all enterprise data andimpact of changes as companies digitally transform and modernize. Expands the industry’s broadest metadata connectivity to catalog all types ofdata in support of analytics, data governance and privacy, customerexperience, and data warehouse modernization initiatives. Establishes the industry’s only single vendor metadata management solutionto simplify procurement, deployment, maintenance and support.9 Informatica. Proprietary and Confidential.

The Catalog of CatalogsDataGovernanceAnalyticsMaster ceDataIntegrationData QualityOpen APIs, Full Access DiscoveryProfilingLineageImpact Analysis Semantic SearchDomain DiscoverySimilarity ClusteringBusiness Term AssociationEnterprise Data Catalog RelationshipsBusiness ContextGlossary IntegrationCustom Annotations Reviews/RatingsQuestions/AnswersData CertificationsChange NotificationsKnowledge Graph AI/MLBreadth of Active MetadataOn-premDatabasesDataWarehousesData LakesFileSystemsCode andScriptingStatistical & BIToolsAnalyticsAppsOn-prem/SaaS AppsETLMainframesPlatform as aService

The Catalog of Catalogs: One Vendor, One SolutionDataGovernanceAnalyticsMaster ceDataIntegrationData QualityOpen APIs, Full Access Discovery Profiling Lineage Impact Analysis Semantic SearchDomain DiscoverySimilarity ClusteringBusiness Term AssociationEnterprise Data Catalog Compact Solutions RelationshipsBusiness ContextGlossary IntegrationCustom Annotations Reviews/RatingsQuestions/AnswersData CertificationsChange NotificationsKnowledge Graph AI/MLBreadth of Active MetadataOn-premDatabasesDataWarehousesData LakesFileSystemsCode andScriptingStatistical & BIToolsAnalyticsAppsOn-prem/SaaS AppsETLMainframesPlatform as aService

Broadest and Most Complete Metadata ConnectivityEDC Advanced ScannersCode and Scripting OracleSQL ServerTeradataNetezzaIBM DB2Sybase ASEETL Tools IBM DatastageMicrosoft SSISMainframes COBOLJCLStatistical and BI Tools SASMicrosoft SSASMicrosoft SSRSSAP BWSAP BW4HANA

Usecases

Data Lineage: A Business ImperativeRequirements:Regulatory ComplianceData QualityData GovernanceData Lineage traces data fromsource to destination, covering theentire lifecycle of data. It includesinformation about changes todata during its journey.Data AnalyticsData Privacy and Security

Data Lineage: The Foundational Use CaseIncreasingly “IT” use cases are coming to the forefront Dev Operations: Change Management & ImpactAnalysis - what-if analyses for changes Operational Efficiency: Eliminateproliferation, duplication, data silos, reducecosts DW/Apps Modernization: Completeunderstanding of the data landscape to enableapp modernization & cloud migration and AI use cases Explainable AI & AI Governance: Track andassess data used to train models, govern AIprojects. Support Explainable AI. Ensure trainingdata variety.15 Informatica. Proprietary and Confidential.

Enterprise Data Catalog Advanced ScannersExtract metadata and data lineage with in-depth details Parse code from various stored procedures and multi-vendor ETL tools Obtain automatic lineage and data relationships at scale Extract deep metadata from both static and dynamic code Obtain complete visibility into the procedure calls with parameter tracking, dynamic SQLgeneration from values based on parameters, database queries and more16 Informatica. Proprietary and Confidential.

Advanced scanner availabilityCategorySourceDatabaseOracle DBMS SQL ServerTeradataNetezzaSybase ASEIBM DB2 (LUW)ETLSSISReporting/Statistics Standard and Advancedscanner only available for selectdatabases- Standard scanners fetch simpleobject metadata and aremandatory.- Advanced Scanners are requiredfor extracting lineage metadata. Advanced scanners cover:DataStageSSAS- Selected list of reporting scannersSSRS- Mainframe scannersSAP BWSAP BW4HANA17Advanced scanner- Third Party ETL scanners (otherthan INFA)SASMainframeStandard scannerCobol Informatica. Proprietary and Confidential.JCL We will be releasing moreadvanced scanners overtime

Standard vs Advanced DB scannersStandard DB scannersAdvanced DB scannersObject metadata- Tables- Views- Materialized views- Synonym- Trigger definitions- Procedure definitions (no lineage)- Function definitions (no lineage)Lineage- Views and Synonyms to Tables Limited Lineage from database scripts (available onlyfor Oracle, Teradata, Hive, DB2) Summary level only Table level onlyProfiling- AvailableCode Lineage- Generated from Procedures/Functions SQL parsing- Detailed lineage for SQL statement attable/view/synonym and field level- Advanced visualization available for complex SQL- Support parsing SQL dynamically generated18 Informatica. Proprietary and Confidential. Lineage from database scripts (available for Oracle,SQL Server, Teradata, Netezza, DB2) Detailed lineage for SQL statementsat table/view/synonym and field level Support dynamic SQL Overcome “select *” limitations Support Loader/Export scriptsAdvanced Scanners are required for any customer interested in datalineage. Standard Scanners provide metadata extraction for simpleobjects – tables, columns, views. But Data Lineage requires metadatafrom parsing SQL Code, Stored Procedures, SQL Scripts that move data– this is where Advanced Scanners come in.

Dynamic SQL support with Advanced ScannersMost Real-life SQL code is dynamic – is heavily parameterized withvalues of the parameters determining the code path. None of the INFAcompetitors today can parse dynamic SQL, most cannot even parsestatic SQL code. With Advanced Scanners we support accurate datalineage extraction from all SQL code.19 Informatica. Proprietary and Confidential.

No Black Boxes – COBOL, JCL and SASManage and govern all yourenterprise data, improve changemanagement and minimize riskof changes with end-to-end anddetailed lineage and impactanalysis (no black boxes).20 Informatica. Proprietary and Confidential.

No Black Boxes – Microsoft SSIS, SSAS and SSRSManage and govern all yourenterprise data, improve changemanagement and minimize riskof changes with end-to-end anddetailed lineage and impactanalysis (no black boxes).21 Informatica. Proprietary and Confidential.

No Black Boxes – IBM DataStageManage and govern all yourenterprise data, improve changemanagement and minimize riskof changes with end-to-end anddetailed lineage and impactanalysis (no black boxes).22 Informatica. Proprietary and Confidential.

Deep Lineage Visualizations with Advanced Scanners Advanced lineage visualization with decomposition of SQL statements into individualtransformations allow users to analyze Stored Procedures Availability of mapping report including list of transformation applied23 Informatica. Proprietary and Confidential.Data Lineage is essential for regulatorycompliance, root cause analysis,impact analysis, data migrations tocloud and establishing trust in data.The first three use cases requiredecomposition of code intounderstandable chunks. AdvancedScanners break down large blobs ofSQL code into a data lineage subgraphfor deeper analysis.

EDC Advanced Custom Metadata Loader Load custom object and lineage metadata into the catalogthrough a business-friendly process Allow ingesting metadata without creating manual models- directly ingest metadata- Relational databases- Microsoft Excel spreadsheets- File formats such as XML, JSON and CSV No development required – repeatable after configurationand setup Obtain complete auditing and governance control over theentire metadata extraction and loading process24 Informatica. Proprietary and Confidential.CustomMetadataSources (Excel,CSV, JSON,XML, DB)AdvancedCustomMetadataLoaderEDC

Leave no metadata behind Breadth – scan everything you need including stored procedures, mainframe, ETL, BI,analytical applications, embedded SQL buried everywhere and more Depth – scan every single transformation and every piece of logic, including dynamic SQL,hand-written scripts, database specific load/unload utilities and more Trust – be sure that you get all the lineage and no surprises with clear information about everysituation when for some reason complete lineage could not be extracted Integration – lineage is a critical part of the data governance story, but there is more that EDCprovides to create a complete data governance platform like profiling or glossary25 Informatica. Proprietary and Confidential.

DEMO

Learn More Read the Informatica Enterprise Data Catalog Advanced Scanner datasheet Download a free copy of “Drive Your Business Forward With a Catalog of Catalogs” Watch on-demand customer, partner and Informatica expert presentations on CLAIREview(the Informatica virtual summit in 2020) Visit us at Informatica Enterprise Data Catalog Advanced Scanners27 Informatica. Proprietary and Confidential.

Questions?

Thank You

Questions/Answers Data Certifications Change Notifications. The Catalog of Catalogs: One Vendor, One Solution. On-prem Databases. File Systems. Statistical & BI Tools. On-prem/ SaaS Apps. ETL. Knowledge Graph AI/ML . Breadth of Active Metadata. Open APIs, Full Access. Enterprise Data Catalog. Platform as a Service. Data Warehouses. Analytics Apps. Code and Mainframes. Scripting .