Design, develop, and implement secure and scalable cloud infrastructure solutions on Azure and AWS using industry best practices
Automate infrastructure provisioning, configuration management, and deployments via Infrastructure as Code (IaC) tools like Terraform, Terragrunt, CloudFormation, and Pulumi
Design, develop, and maintain efficient CI/CD pipelines using GitHub Actions for automated builds, testing, and deployments
Author and maintain transparent, well-structured GitHub workflows that automate key development processes
Leverage reusable GitHub actions, integrate with external tools as needed, and optimize workflows for speed and reliability, ensuring timely and successful release cycles
Troubleshoot CI/CD issues and make continuous improvements to enhance pipeline efficiency and reliability
Collaborate with developers to ensure effective use of GitHub Actions and promote best practices for CI/CD
Implement monitoring and alerting systems to identify and resolve infrastructure issues proactively
Manage and optimize Datadog for monitoring, logging, and observability across our cloud infrastructure and applications
Design and implement stable, business-oriented monitoring dashboards that provide real-time insights into infrastructure health and application performance
Proactively configure alerts to notify relevant stakeholders of potential issues, ensuring timely resolution and minimizing downtime
Continuously evaluate and improve Datadog configurations and integrations to enhance monitoring coverage and effectiveness
Collaborate with data scientists and engineers to implement DataOps best practices
Troubleshoot and debug complex infrastructure and application problems
Collaborate with engineering teams to ensure smooth deployments and application uptime
Stay up to date with the latest cloud technologies and best practices and share your expertise by mentoring junior team members
Qualifications
5+ years of experience as a Cloud Platform Engineer or similar role
Cloud Expertise: Proven experience designing, building, and managing cloud infrastructure on Azure
IaC Tools: Expertise in Infrastructure as Code (IaC) tools like Terraform, CloudFormation, ARM, AWS SDK, Pulumi, and Ansible
CI/CD Tools: Hands-on experience with multiple CI/CD tools like GitHub, Jenkins, Azure DevOps, or AWS CodeBuild/CodePipeline
Containerization: Experience with containerization using Docker and orchestration tools like ECS, AKS, EKS, and Kubernetes
DevOps Principles: Strong understanding of DevOps principles and practices