Transcription
Application PerformanceManagement for CloudCMGBy Priyanka Aroraprarora803@gmail.com
Cloud Adoption Trends2 Spending on public cloud Infrastructure as a Servicehardware and software is forecast to reach 173B in 2026SaaS and PaaS portion of cloud hardware andinfrastructure software spending are projected to reach 55B in 2026
Agenda3 Understanding Cloud What is Cloud ComputingCloud Deployment Models Cloud Service Models Understanding APMConsiderations for Cloud ApplicationsMaking the right APM choice Obscurity of CloudSolution IaaS/PaaS/SaaSProblem Statement Public vs Private, Hybrid?APM for XaaSSynthetic Vs Real User MonitoringMagic Quadrant and Market SolutionsEvaluating APM SolutionsExamples
4Understanding Cloud
What is cloud computing?5Traditional IT Cost of Ownership andMaintenance Cost of Operation Total ControlCloud Computing Flexible availability Cost effective, Pay per use Unlimited Scalability Fit for Purpose
Cloud Deployment Models6 Private Cloud Privately shared virtualized resources Systems and Services operated and delivered exclusively for an organizationPublic cloud Enterprise taps into a public set of resources delivering standardized, highlyautomated offerings Compute resources, storage and networking capabilities, are owned by aservice provider and offered on demand with limited SLAs regarding tenancy,isolation, and performance. Shifts CapEx to OpEx – Pay as you goHybrid cloud Implementation of “Hybrid” includes a mix of internal (on-premises, oftenvirtualized) and external (cloud-hosted) solutions, with applications switchingbetween resources as needed Allows manipulation of CapEx and OpEx
Cloud Service orageServers
8Problem StatementObscurity of Cloud
Cloud Challenges9 Security and Privacy Compliance Obscurity Lackof Visibility Loss of Control Availability, Reliability, Service Quality PerformanceUnpredictabilityLack of Skills, Expertise Integration with Existing Infrastructure
Traditional Management Paradigm10 Traditional data center management and monitoring protocols focus on technology elementsin silosLack of visibility and control puts organization at the mercy of cloud vendor Vendor’s commitment and capability to meet SLAs
11Solution - CAPM
Application PerformanceMeasurement/Monitoring/Management?12UEM and Reporting Real-time end userexperience monitoring SLA Reporting Performance andAvailability of CriticalBusiness Functions Business Analytics Business TransactionVolumes Trend Reports Geographic LocationdistributionCode Optimization Method level code profiling - rootcause identification andoptimization Thread synchronization anddeadlock analysis Heap Object analysis andoptimization Response Time tier breakdownProduction Monitoring / Alerting CPU, Memory, Disk and Network Application availability/health Critical business functionperformance Garbage Collection Health Out of Memory, DeadlockConditionsApplication Monitoring CPU, Memory, Disk andNetwork monitoring Process Health Heap Usage, GarbageCollection Health Threads Count,Contention/Raceconditions DB Connection Health # of connections,use/wait time,deadlocks
Application op13FeedbackUEM and Reporting Real-time end userexperience monitoring SLA Reporting Performance andAvailability of CriticalBusiness Functions Business Analytics Business TransactionVolumes Trend Reports Geographic LocationdistributionMonitorCode Optimization Method level code profiling - rootcause identification andoptimization Thread synchronization anddeadlock analysis Heap Object analysis andoptimization Response Time tier breakdownProduction Monitoring / Alerting CPU, Memory, Disk and Network Application availability/health Critical business functionperformance Garbage Collection Health Out of Memory, DeadlockConditionsTestApplication Monitoring CPU, Memory, Disk andNetwork monitoring Process Health Heap Usage, GarbageCollection Health Threads Count,Contention/Raceconditions DB Connection Health # of connections,use/wait time,deadlocksDeploy
APM & Cloud: Why is it Important?14 Ensuring excellent end-user experience‘Right Sizing’, validation and forecasting futuregrowth, smart scaling Mitigating Risk of Unpredictable Performance Regaining Visibility and Control SLA Verification
15CAPM – Making the right choice
APM for XaaS16 Deep Dive Monitoring Real User Monitoring Script Injection Agent install at times Compatible with IaaS andPaaS Can also be implemented forSaaS if vendor permits To monitor container resources Java/J2EE and .Net Needs an Agent Compatible with IaaS and PaaSAgent BasedActive(IaaS/PaaS)Agent oring Synthetic Monitoring Robotic transactionssimulating User Behavior Availability focused Compatible with all Cloudofferings: IaaS, PaaS sive(Not CloudCompatible) Network Monitoring TCP/IP and SPAN Portmonitoring Not Cloud Compatible
Synthetic Vs RUM17 Synthetic ActiveMonitoring, special focus on Availability Based on ‘Scripted’ User Interactions Gauges possible user experience Executed from any location across the globe Is Agentless RUM Basedon ‘Real’ User Interactions A more definitive indicator of User Experience Also provides a global perspective Can be Agentless(JavaScript, Network tap) orAgent based
Some things to consider.18 Focus on Unified View across Hybrid EnvironmentsAPM Approach Business TransactionsTiersBaselinesEnd User ExperienceAgent Vs AgentlessSynthetic Vs Real User Monitoring (RUM)Development and Optimization considerations
Gartner’s Magic Quadrant for APM Suites19Dimensions of APM SuitesEnd-user experience monitoring (EUEM) The capture of data about how end-to end latency, executioncorrectness and quality appear to the real user of the application A secondary focus on application availability may be accomplished bysynthetic transactions emulating the end userApplication topology discovery and visualization The discovery of the software and hardware infrastructurecomponents involved in application execution, and the array ofpossible paths across which these components communicate to deliverthe applicationUser-defined transaction profiling The tracing of user-grouped events, which comprise a transaction asthey occur within the application as they interact with componentsdiscovered in the second dimension; this is generated in response to auser's request to the application.Application component deep dive The fine-grained monitoring of resources consumed and eventsoccurring within the components discovered in the applicationtopology discovery and visualization dimension This includes the server-side components of software being executed.IT operations analytics (ITOA) The combination or usage of the following techniques: complexoperations event processing, statistical pattern discovery andrecognition, unstructured text indexing, search and inference,topological analysis, and multidimensional database search andanalysis.
Dynatrace20
AppDynamics21
New Relic22
Monitoring Capabilities of Cloud Providers23 AWS- Amazon Cloud Watch and now X-Ray Chargeback/Cost Metrics, Status and Availability Monitoring Log Monitoring, Transaction Tracing, Service Mapping For AWS only
Monitoring Capabilities of Cloud Providers24 Azure - Azure Monitor Activity and Diagnostic Logs, Alerts and Automated actions Metrics – Resources, Application Performance, Storage, Service Bus etc For Azure only
Monitoring Capabilities of Cloud Providers25 Google - StackDriver Logging, event monitoring, diagnostics, Visibility into performance, uptime, and overall health of cloud applications For Google Cloud and AWS
26Evaluating APM Solutions
Defining APM Requirements27Real End User Monitoring Solution Must Have Real-time capture and reporting on User Activity, Response Timesand ErrorsPage Rendering and object level detailsBusiness Transaction and SLA monitoringBusiness Analytics and User Session InsightsOmni-channel visibility across Browsers and Mobile platformsGeographic perspectiveUser defined transaction profiling for critical business applications Integration with Runtime application performance ‘User-Complaint’ to ‘Code-Level’ root cause for faster triage of Production issuesGood to Have User defined transaction profiling for non critical applications
Defining APM Requirements28Synthetic Monitoring Solution for Active Availability coverage Must HavePage rendering and object level details for RootCause analysis for critical business applications Last Mile response time from across the geography Multi Browser support for playback Good to HaveThird part content analytics for critical businessapplications Competitive Benchmarking Integration with UEM, Runtime ApplicationPerformance
Defining APM Requirements29Runtime Application Performance Monitoring Solution Must Have Process health, Container resource, event monitoring (Heap, Thread, Connection pools, Servlets, Auto discovery of transaction topology for service dependency mappingTransaction tracking/stitching for response time tier breakdownCode level ‘deep-dive’DB Performance (SQL Captures, Pool usage, Deadlocks, Hotspots)WebService, Messaging and Remote calls PerformanceSupport for Java/J2EE, .Net, Microservices and other runtimesMonitoring of on premises and cloud hosted applications – seamless integration,presentationReal time high granularity collection and reportingNo impact to application performance with minimal resource overhead EJBs)Good to Have Transaction/Method level resource cost evaluation for code optimizationRuntime Thread Dump and Heap Dump capabilitiesApplication Events- Errors, Exceptions, logs and Stack TraceIntegration with DevOps, support development/test lifecycle for agility Integration with Load Generation tools (HP Performance Center) for monitoring of Performance testsIntegration with IDEs and CI/CD solutions for code optimization
Defining APM Requirements30Other Considerations Data analysis, presentation and reporting capabilitiesIntegration with MoM solutionsEase of deployment and UseArchitecture of solution (Aggregation vs Gapless design), Data model, Agentdesign (application resource reliant vs independent)Converged platforms providing one consolidated view – Single pane of glassOne unified solution for multi purpose APM APM-aaS availabilityVendor Evaluation On-premises and Cloud (IaaS/PaaS/SaaS) applicationsMonitoring capability for Containers and MicroservicesRoadmap and technology/trend adoption towards Digital Performance ManagementLicensing ModelCurrent Market ShareCustomer Service and Product SupportGartner’s EvaluationCost of Deployment, Maintainability and Scalability
31Examples
Synthetic Monitoring32
Synthetic Monitoring - Reports33
Real User Monitoring (JavaScript Injection)3441SaaSApplication243
RUM Reports35 Capability to track ‘critical’ transactions, visibility into top customer locations, branches and associated userexperiences
RUM Reports36
Agent Based Monitoring37IaaS/PaaSApplication
Agent Based Monitoring - Reports38
Agent Based Monitoring - Reports39 Garbage Collection health, Suspension time, Process throughput, Thread Count
Agent Based Monitoring - Reports40
Agent Based Monitoring - Reports41 Giving the Developers capability to drill in all the way to the source of the problem Deep dive into the code, identifying root cause at the method level Long running methods contributing to latency; Also identifying resource intensive methods
And that’s a wrap!42 We talked about Cloud Cloud deployment modelsService offeringsCloud ChallengesAPM SolutionsConsiderations for APM solutionsIrrespective ofWhat cloud solution you adopt Which applications you migrate to it What monitoring methodologies you choose What APM solutions you deploy .as an IT organization, you have to move from traditionalmonitoring/management paradigms to business service andend user focused, holistic solutions to survive this new age ofCloud
Questions?43
References44Gartner's Magic Quadrant for Application Performance Monitoring SuitesThings to Know About APM in the CloudManaging Performance of Cloud Based ApplicationsAPM for Cloud ComputingAnatomy of APMCloud PredictionsObscurity of CloudDuality of APMAzure MonitorAWS X-RayGoogle StackdriverDynatraceAppDynamicsNew Relic
45APPENDIX
RUM Reports (JavaScript Injection)46 Reportingon Average Application throughput during typical and peak timesresponse times for users with anomalies Response Time Breakdown between Server, Network and Rendering time AverageNo improvement in Response Time since July 1stEST Users come onlinePST Users come online
Agent Based Monitoring - Reports47
RUM Reports (JavaScript Injection)48 Understanding User Experience across the globe Application Workload Trending for forecasting growth
RUM Reports (Agent Based)49
SLA/SLM and Reporting50 Monitoring/Reporting Service levels for overallapplication as well as at ‘component level’Combining ‘Active’ and ‘Passive’ monitoringfor complete visibilityProactive alertingTrending Transaction Volumes and End UserExperienceUnderstanding ‘cost per transaction’ Correlatingbusiness throughput to systemresource for ‘right-sizing’
Optimization for Cloud Applications51 Content CachingCompression algorithms – data is expensive inthe cloudOptimized page rendering Parallel rendering of contentWAN Quality of ServiceChatty Applications ReducingLatencyacknowledgements to reduce WAN
Dynatrace – Instance Cost52
Dynatrace – Problem Evolution53
Dynatrace – Topology Visualization54
Dynatrace – Overview Dashboards55
AppDynamics – Topology Visualization56
AppDynamics – User Experience57
New Relic – Infrastructure Overview58
New Relic – Instance Cost Monitoring59
Application Performance Management for Cloud By Priyanka Arora prarora803@gmail.com CMG. Cloud Adoption Trends . Roadmap and technology/trend adoption towards Digital Performance Management Licensing Model Current Market Share Customer Service and Product Support Gartner's Evaluation Cost of Deployment, Maintainability and Scalability 30.