
Senior Cloud Operations Engineer
Navteca, Houston, TX, United States
Senior Cloud Operations Engineer
The Senior Cloud Operations Engineer to support a NASA program as part of a contractor. Responsible for designing, implementing, and maintaining scalable, secure, and highly available cloud infrastructure across enterprise environments. This role leads operational excellence by leveraging DevSecOps practices, automation, and monitoring to ensure optimal system performance, reliability, and cost efficiency. The engineer collaborates with cross-functional teams to support software development, data platforms, and business-critical applications in fast-paced, complex environments.
Key Responsibilities
Design, deploy, and manage cloud infrastructure across AWS, Azure, and/or GCP environments
Ensure high availability, scalability, and reliability of cloud-based systems and services
Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or Bicep
Build and optimize CI/CD pipelines to support automated deployments and releases
Monitor system performance and availability using observability tools; proactively resolve incidents and bottlenecks
Lead incident response, root cause analysis, and post-incident reviews
Enforce security best practices, including identity and access management (IAM), encryption, and compliance standards
Optimize cloud costs through resource management, rightsizing, and usage analysis
Collaborate with software engineering, DevOps, and data teams to support application and platform needs
Drive continuous improvement initiatives for operational efficiency and system resilience
Mentor junior engineers and provide technical leadership across teams
Required Qualifications
Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience)
5-10+ years of experience in cloud operations, DevOps, or infrastructure engineering
Hands-on experience with at least one major cloud provider (AWS, Azure, or GCP)
Strong knowledge of networking concepts (VPCs, subnets, load balancing, DNS)
Experience with containerization and orchestration (Docker, Kubernetes)
Proficiency in scripting or programming (Python, Bash, PowerShell, etc.)
Experience with CI/CD tools (GitHub Actions, Jenkins, Azure DevOps, etc.)
Familiarity with monitoring/logging tools (CloudWatch, Azure Monitor, Stackdriver, Datadog, etc.)
Strong understanding of security and compliance in cloud environments
Preferred Qualifications
Multi-cloud experience (AWS + Azure + GCP)
Certifications (e.g., AWS Certified Solutions Architect, Azure Administrator, Google Cloud Professional Engineer)
Experience supporting data science or big data platforms
Knowledge of Site Reliability Engineering (SRE) principles
Experience in highly regulated environments (finance, healthcare, government)
Core Competencies
Technical Leadership
Cloud Architecture & Operations
DevSecOps & Automation
Strategic Planning
Problem Solving & Incident Management
Agile & Release Management
Performance Optimization
Mentoring & Team Development
Benefits
Navteca offers a comprehensive benefits package, including:
Medical Insurance
Dental Insurance
Life and AD&D Insurance
Short-Term and Long-Term Disability (STD/LTD)
401(k) Retirement Plan
Paid Vacation
Paid Holidays
Paid Sick Leave
Comp/Flex Time
The Senior Cloud Operations Engineer to support a NASA program as part of a contractor. Responsible for designing, implementing, and maintaining scalable, secure, and highly available cloud infrastructure across enterprise environments. This role leads operational excellence by leveraging DevSecOps practices, automation, and monitoring to ensure optimal system performance, reliability, and cost efficiency. The engineer collaborates with cross-functional teams to support software development, data platforms, and business-critical applications in fast-paced, complex environments.
Key Responsibilities
Design, deploy, and manage cloud infrastructure across AWS, Azure, and/or GCP environments
Ensure high availability, scalability, and reliability of cloud-based systems and services
Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or Bicep
Build and optimize CI/CD pipelines to support automated deployments and releases
Monitor system performance and availability using observability tools; proactively resolve incidents and bottlenecks
Lead incident response, root cause analysis, and post-incident reviews
Enforce security best practices, including identity and access management (IAM), encryption, and compliance standards
Optimize cloud costs through resource management, rightsizing, and usage analysis
Collaborate with software engineering, DevOps, and data teams to support application and platform needs
Drive continuous improvement initiatives for operational efficiency and system resilience
Mentor junior engineers and provide technical leadership across teams
Required Qualifications
Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience)
5-10+ years of experience in cloud operations, DevOps, or infrastructure engineering
Hands-on experience with at least one major cloud provider (AWS, Azure, or GCP)
Strong knowledge of networking concepts (VPCs, subnets, load balancing, DNS)
Experience with containerization and orchestration (Docker, Kubernetes)
Proficiency in scripting or programming (Python, Bash, PowerShell, etc.)
Experience with CI/CD tools (GitHub Actions, Jenkins, Azure DevOps, etc.)
Familiarity with monitoring/logging tools (CloudWatch, Azure Monitor, Stackdriver, Datadog, etc.)
Strong understanding of security and compliance in cloud environments
Preferred Qualifications
Multi-cloud experience (AWS + Azure + GCP)
Certifications (e.g., AWS Certified Solutions Architect, Azure Administrator, Google Cloud Professional Engineer)
Experience supporting data science or big data platforms
Knowledge of Site Reliability Engineering (SRE) principles
Experience in highly regulated environments (finance, healthcare, government)
Core Competencies
Technical Leadership
Cloud Architecture & Operations
DevSecOps & Automation
Strategic Planning
Problem Solving & Incident Management
Agile & Release Management
Performance Optimization
Mentoring & Team Development
Benefits
Navteca offers a comprehensive benefits package, including:
Medical Insurance
Dental Insurance
Life and AD&D Insurance
Short-Term and Long-Term Disability (STD/LTD)
401(k) Retirement Plan
Paid Vacation
Paid Holidays
Paid Sick Leave
Comp/Flex Time