Job Description:
We are looking for a Senior Cloud Infra / DevOps / SRE Engineer to take a leadership role in architecting, building, and operating our enterprise-grade cloud infrastructure. This role will serve as a key technical
authority across multi-cloud platforms, security, scalability, cost optimization, and system reliability.
You will collaborate closely with multiple teams to design solutions, implement complex systems, guide junior and mid-level engineers, and drive long-term platform reliability and efficiency. Strong technical leadership, strategic thinking, and hands-on capability are required.
Key Responsibilities:
● Lead the architecture, design, and implementation of scalable, secure, and cost-effective
multi-cloud infrastructure (AWS, GCP, Azure, DigitalOcean).
● Define and implement Infrastructure-as-Code standards using Terraform, Pulumi, and other
automation tools.
● Architect advanced observability platforms integrating metrics, logs, traces, and service health
monitoring (Prometheus, Grafana, Loki, Tempo, ELK, Datadog, etc).
● Design, build, and maintain CI/CD pipelines supporting complex microservice deployments,
canary releases, blue/green deployments, and progressive delivery patterns.
● Lead and coordinate complex incident response, root cause analysis (RCA), and post-incident reviews.
● Establish and enforce cloud security architecture, access control models, credential management, and compliance baselines.
● Drive company-wide FinOps strategy, cloud cost optimization initiatives, budgeting, and forecasting models.
● Design and maintain advanced Disaster Recovery (DR), High Availability (HA), Multi-Region architecture, and Business Continuity Plans (BCP).
● Conduct infrastructure performance analysis, capacity planning, scalability testing, and proactive optimization.
● Lead technical reviews, design discussions, infrastructure roadmaps, and cross-team collaboration.
● Mentor and coach junior/mid-level engineers to build strong internal technical capabilities.
● Produce and maintain high-quality documentation, architectural diagrams, compliance evidence, and governance artifacts.
● Actively engage in security audits, compliance certification processes (ISO27001, SOC2, etc), and risk assessments.
Qualifications:
● Bachelor’s or Master’s degree in Computer Science, Engineering, or equivalent hands-on expertise.
● 6+ years of professional experience in Cloud Infrastructure, DevOps, SRE, or Platform
Engineering.
● Expert-level Linux/Unix administration and complex troubleshooting capabilities.
● Deep expertise with at least one major cloud platform (AWS strongly preferred; GCP/Azure a
plus).
● Solid multi-cloud architecture experience, including hybrid cloud, VPC peering, private link, service mesh.
● Expert in scripting, system automation, and workflow orchestration (Python, Go, Bash mandatory).
● Mastery of Infrastructure-as-Code (Terraform is non-negotiable; Pulumi, Ansible highly valuable).
● Expert Kubernetes knowledge (architecture, multi-cluster, custom controllers, operators, scaling
models).
● Advanced observability stack design: metrics, logs, tracing, and full-stack telemetry architecture.
● Security architecture expertise: IAM design, Zero Trust, identity federation, data encryption
models.
● Solid FinOps knowledge, budget ownership experience, and cost architecture skills.
● Extensive experience designing and operating High-Availability, DR, and Multi-Region
infrastructures.
● Proficient in large-scale CI/CD design, pipeline optimization, release strategies, and deployment
workflows.
● Demonstrated ability to lead technical discussions, design reviews, and cross-team projects.
● Strong ownership, leadership, and long-term thinking mindset.
● Excellent communication skill for fully remote (WFH) cross-functional collaboration.
● Highly disciplined in technical writing, documentation, SOP, and architectural record keeping.
Collaboration with Other:
● Engineering Leadership
● Product & Technical Teams
● DevOps, SRE, and Platform Engineering Squads
● Security & Compliance Organization
● Finance & FinOps Team
● Architecture Board & Governance Committees
● Incident Response & Postmortem Team
● External Auditors & Certification Bodies
Application Confirmation
You're applying for the role below: