Developed a full disaster recovery setup for live production environment in AWS
Designed and implemented a full disaster recovery (DR) strategy for a live production system running on AWS for a Threat Exposure Management Company. The solution included multi-region failover, data replication, and automated recovery processes to ensure business continuity. Critical workloads such as RabbitMQ (Amazon MQ), Redis (AWS ElastiCache), and MongoDB Atlas were integrated into the DR plan. Kubernetes workloads were made highly available with EFS-backed persistent volumes and GitOps-driven deployments via Helm and ArgoCD. Security and reliability were strengthened with AWS WAFv2 and Datadog observability, while infrastructure provisioning was automated with Terraform and Pulumi.
Want to learn more about this project or discuss a similar solution for your business?
Contact Us→