SAP High Availability In Azure Using SUSE Linux

Transcription

SAP High Availability inAzure Using SUSE Linux[BP-1404]1

AgendaBasic SAP ArchitectureSAP HA Architecture in AzurePacemakerAzure Load BalancerDemo of unplanned failover2

SAP Architecture3

Basic SAP seServerShared DiskDatabaseStorage4

S/4HANA High Availability Architecture in Azure5

Availability Set (99.95%)Availability Zones (99.99%)6

SUSE HighAvailabilityExtension7

The Goal of HAMTTR9

SUSE High Availability OverviewvIPvIPSAPSAPSAPFencing (stonith)Resource Agents (RAs)pacemaker (crm)corosync (cluster membership)KernelStorage(SBD)Kernel10

Resource AgentsProvides ‘intelligence to Pacemaker’A script used to start/stop/monitor a resource Ideally should be Open Cluster Framework compliant Well defined return values Mandatory operations Return value passed back to Pacemaker Many providers of RAs Ships with around 140 RA out of the box. Resource Agents for SAP HANA included in SLES for SAP Applications14

SAP HANA Resource Agents15

Why Do We Need Fencing?To a cluster node, loss of a peer node is indistinguishable from loss ofcommunication with that node.In the former case, is it safe to failover resources?And in the latter case?16

Split Brain When a cluster partitions due to network failure Neither side knows if the other is still alive Worst case scenario: each side attempts to failover the other's resource Better scenario: neither side does anything(But then, why do we have a cluster?) Best scenario: one side is able to guarantee that the other is down Fencing is about moving from an UNKNOWN state to a KNOWN state17

SUSE HighAvailability withSAP CentralServices18

Enqueue-Replication VersionsENSA1ENSA219

Central Services – Multi SID20

Architecture options for SAP on AzureFilesystem BYO SUSEcluster ANF NFS (future)AzavailabilityoptionsFencingagent Av Set Av Zone SBD Azure Fencingagent (future)21

Floating IP: Two Basic Architectures PossiblePASPASAzure LoadBalancerFloating IPHana1Hana2Hana1Hana222

Let’s look at the first case “Floating” IP can be moved fromone machine to another viaAPI/CLI IP Move takes approximately 2minutesPASHanaSystemReplicationFloating IPSr takeoverHana1Hana223

Load BalancerClientAzure LoadBalancerHana1Hana2Backend Pool24

PASAzure ationFloating IPSr takeoverHana1Hana225

SOCAT & Virtual IP Network Resourcesudo crm configure primitive rsc ip HN1 HDB03 ocf:heartbeat:IPaddr2 \ meta target-role "Started" is-managed "true" \ operations \ id "rsc ip HN1 HDB03-operations" \op monitor interval "10s" timeout "20s" \ params ip "10.0.0.13"sudo crm configure primitive rsc nc HN1 HDB03 anything \ paramsbinfile "/usr/bin/socat" cmdline options "-U TCPLISTEN:62503,backlog 10,fork,reuseaddr /dev/null" \ op monitor timeout 20sinterval 10 depth 0sudo crm configure group g ip HN1 HDB03 rsc ip HN1 HDB03 rsc nc HN1 HDB0326

Unplanned FailoverSeveral mechanisms for testing: Shut down machine from Azureportal ps aux grep sbd, kill inquisitor service pacemaker stop28

TakeawaysRead the DocumentationSetup & test your configuration andkeep testingUnderstand the operationsMonitoring & Alerts29

ResourcesLinks to l/Links to deployments[TUT-1226]SAP HA on SUSE: All you need to know[TUT-1396]"Day 2" Operations of SAP HANA Cluster using SUSE HighAvailability on Public Cloud[HOL-1064]SAP HANA scale-out with high availability NFS usingDRBDTraining & p/[BP-1351]SUSE High Availability for SAP HANA: Tales from the realworld, tips, tricks, & troubleshootingAzure training & certifications[HOL-1225]High Availability for SAP application servers using ENSA2enqueue replication.30

31

32

33

Load Balancer Hana 1 Hana 2 Client Azure Load Balancer. 25 Hana 1 Hana 2 PAS Floating IP Hana System Replication Sr_takeover Azure Load Balancer Health Probe Health . SAP HANA scale-out with high availability NFS using DRBD [BP-1351] SUSE High Availa