Job description
Join NexaCloud Solutions as a Senior DevOps Engineer and help us build a world-class cloud-native platform. You will own the automation, reliability, and security that power our customer-facing services.
We’re a fast-growing, product-led company that values engineering excellence, collaboration, and continuous improvement. This role offers a hybrid work model with a strong presence in our Austin, TX hub and generous remote options.
As a member of our Platform team, you’ll champion scalable architectures, implement best practices, and mentor junior engineers as we scale.
Responsibility
- Design, implement, and maintain scalable CI/CD pipelines across multi-cloud environments (AWS and GCP) to accelerate software delivery with quality and security baked in.
- Architect and manage infrastructure as code using Terraform, CloudFormation, or equivalent tooling to ensure repeatable, auditable deployments.
- Build and operate containerized environments with Kubernetes, Docker, and related tooling; optimize clusters for performance, reliability, and cost.
- Implement robust monitoring, logging, and tracing stacks (Prometheus, Grafana, ELK/OpenTelemetry) and define SLOs/SLIs with incident postmortems.
- Collaborate with software engineering, security, and SRE teams to drive reliability, security, and deployment velocity.
- Own incident response, runbooks, disaster recovery planning, and continuous improvement of on-call practices.
- Identify optimization opportunities to reduce toil, automate repeatable tasks, and improve cloud cost efficiency.
Qualification
- 5+ years of DevOps or platform engineering experience with cloud providers (AWS, GCP, or Azure).
- Hands-on expertise with Kubernetes, Docker, and container orchestration patterns.
- Deep knowledge of IaC tools such as Terraform, CloudFormation, or similar frameworks.
- Proficiency with CI/CD tooling (Jenkins, GitLab CI, GitHub Actions, or CircleCI) and modern automation practices.
- Strong foundation in monitoring and incident management stacks (Prometheus, Grafana, ELK/OpenTelemetry) and SRE fundamentals.
- Scripting and automation skills (Python, Bash, or Go); ability to develop reliable tooling to reduce manual toil.
- Excellent communication and collaboration skills with a customer-centric mindset and the ability to mentor peers.
- Bachelor’s degree in Computer Science, Engineering, or a related field or equivalent practical experience.