For your disaster recovery (DR) strategy using Azure Site Recovery (ASR) in a scenario where no virtual machines (VMs) are pre-provisioned in the DR region, here are some best practices and recommendations:
- ASR Recovery Plan: Yes, using an ASR Recovery Plan is the recommended approach for orchestrating the end-to-end failover process. This includes the automatic creation of VMs in the DR region, sequencing their startup, and executing post-failover tasks such as load balancer and DNS updates. The Recovery Plan allows you to define the order in which VMs are brought online, ensuring that dependencies are respected.
- Pre-Creation of Components: Microsoft supports having zero compute resources in the DR region. ASR can create all VMs automatically during a failover. However, consider pre-creating essential network components (like VNets, subnets, and NSGs) and potentially critical infrastructure components (like domain controllers) to ensure a smoother recovery process. Load balancers and SQL replicas can also be pre-configured if needed, but it is not mandatory.
- SQL on Azure VMs: While ASR replication can be relied upon for DR, implementing SQL Server Always On Availability Groups (AG) is recommended for improved recovery point objectives (RPO) and recovery time objectives (RTO). If you choose to use SQL AG, you can integrate automated failover into your ASR Recovery Plans by scripting the failover process within the Recovery Plan's automation tasks.
- Automation: Azure Automation Runbooks or Azure Functions are indeed recommended for performing tasks during failover, such as DNS updates and application warm-up. These tools can help automate the necessary configurations to ensure a seamless transition.
- Official Samples: While the context does not specify a repository for ASR Recovery Plan automation scripts, you can typically find official Microsoft documentation and sample scripts through the Azure documentation portal.
- Performance Considerations: For large environments with around 50 VMs, ASR is designed to handle bulk failover scenarios. However, it is advisable to review any documented limits or performance considerations specific to your deployment and test the failover process to ensure that it meets your expectations under load.
By following these guidelines, you can implement a cost-optimized DR solution that aligns with Microsoft’s best practices, ensuring that your ASR setup is effective and efficient.
References: