Job description
NovaTech Cloud is seeking a seasoned Senior DevOps Engineer to design, build, and scale secure, reliable cloud infrastructure for our rapidly growing platform. You will own the end-to-end lifecycle of CI/CD pipelines, infrastructure as code, and container orchestration, partnering with software engineers and security teams to deliver high-velocity, resilient software.
In this role, you’ll influence architectural decisions, mentor teammates, and help shape a culture of reliability, performance, and operational excellence. If you thrive in a fast-paced environment and are passionate about automation and cloud-native technologies, we want to hear from you.
What we offer: competitive salary, comprehensive benefits, generous stock options, flexible work arrangements, and ongoing learning opportunities.
Responsibility
- Design, implement, and maintain scalable CI/CD pipelines across multi-cloud environments (AWS, GCP, Azure) with a focus on stability, speed, and rollback capabilities.
- Champion infrastructure as code using Terraform, Pulumi, or CloudFormation to provision and manage cloud resources safely and versionably.
- Architect and administer container orchestration platforms (Kubernetes, Docker) ensuring high availability, security, and efficient resource utilization.
- Develop robust monitoring, logging, and alerting with Prometheus, Grafana, and ELK/EFK stacks to achieve observability and rapid incident response.
- Collaborate with development teams to optimize build times, release processes, and security controls across the software delivery lifecycle.
- Lead on-call rotations, incident management, and post-mortem analysis to drive continuous improvement.
- Identify cost optimization opportunities and implement governance for cloud spend and resource usage.
- Mentor and coach junior engineers, promoting DevOps best practices and knowledge sharing across the organization.
Qualification
- 5+ years in DevOps/SRE roles with hands-on cloud experience (AWS preferred; multi-cloud a plus).
- Strong expertise in Kubernetes cluster design, deployment, and operations; experience with Helm and service mesh is a plus.
- Proficiency with CI/CD tooling (GitHub Actions, GitLab CI, Jenkins) and scripting (Python, Bash, or similar).
- Extensive experience with Infrastructure as Code (Terraform, Pulumi, CloudFormation).
- Solid understanding of monitoring, logging, and incident response (Prometheus, Grafana, ELK/EFK, and on-call readiness).
- Security-minded approach: IAM, secret management, encryption, network segmentation, and compliance awareness.
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent practical experience.
- Strong communication and collaboration skills with a proactive, results-driven mindset.