Job description
NovaCloud Systems is seeking a Senior DevOps Engineer to join our Cloud Platform team. You will design, implement, and operate scalable infrastructure that powers global services. This role emphasizes automation, reliability, and collaboration with development teams to accelerate delivery while ensuring security and compliance.
As part of our mission to enable fast, safe software delivery, you will own CI/CD pipelines, manage containerized workloads, and drive improvements across multi-cloud environments. You will collaborate with SREs, software engineers, and security teams to build highly available, observable, and cost-efficient systems.
Responsibility
- Design, implement, and maintain scalable CI/CD pipelines using GitHub Actions and Jenkins to accelerate secure software delivery.
- Architect and operate containerized platforms with Docker and Kubernetes across AWS, GCP, and Azure.
- Manage infrastructure as code with Terraform, Ansible, and CloudFormation to enable repeatable, auditable deployments.
- Ensure reliability, observability, and incident readiness with monitoring and logging stacks (Prometheus, Grafana, ELK) and on-call practices.
- Collaborate with development teams to optimize deployment strategies, release management, and rollback plans.
- Implement security best practices, IAM, secrets management, and compliance controls across all environments.
- Continuously optimize cost, performance, and scalability through automation and architectural improvements.
Qualification
- 3+ years of DevOps experience in cloud-based environments (AWS, GCP, Azure) with hands-on production ownership.
- Strong Linux administration skills (Ubuntu/CentOS) and automation experience.
- Proficiency with container orchestration (Kubernetes) and container runtimes (Docker).
- Experience with infrastructure as code tools (Terraform, Ansible, CloudFormation).
- Familiarity with monitoring, logging, and tracing tools (Prometheus, Grafana, ELK, OpenTelemetry).
- Programming/scripting experience (Python, Bash, or similar) for automation.
- Excellent collaboration, communication, and problem-solving skills; ability to work across teams and time zones.