
Data Center Operations Cluster Manager, Amazon Dedicated Cloud
Navstar, Culpeper, VA, United States
Data Center Operations Cluster Manager
Do you like helping U.S. government agencies implement innovative cloud computing solutions and solve technical problems? Are you committed to fundamentally transforming the way national security and defense mission agencies partner with industry to meet mission requirements? Do you have experience managing large, complex programs? Amazon Web Services (AWS) is seeking a Data Center Operations (DCO) Cluster Manager to join the AWS Infrastructure Services organization and serve as a technical resource and leader within data centers.
The DCO Cluster Manager is the senior leadership role for our compute operations teams within an AWS region that operates 24/7. You will have managerial responsibility for safety, security, availability, scaling, costs and efficiency for your department. You lead the team that is installing, maintaining, and decommissioning network and server equipment in a safe, secure, and cost-effective manner across the region. The DCO Cluster Manager must manage across each function but also have the ability to dive deep into any given function as needed. The DCO Cluster Manager must be physically collocated near the region they are responsible for and able to respond to any high-severity event and be on site within an hour.
The successful candidate will be a highly driven, self-managed individual who demonstrates initiative and proactively seeks solutions to problems. They will have a strong track record of developing talent and managing the performance of their direct reports and organization; including being able to support a high cadence of developing people into new roles outside of the organization. Ideally, they have worked with ticketing systems and been involved in responses to high-severity operational events. In addition to strong knowledge in data centers and a broad technical understanding of how networks and cloud architecture works, the candidate will create documentation, drive continuous improvement, participate in Inclusion and Diversity initiatives, and fix complex problems with simple solutions across multiple AWS regions. While not required, an understanding of critical electrical & HVAC systems will enhance a candidate's ability to be successful. This team works in an environment that operates 24/7.
This position requires that the candidate selected be a US citizen and currently possess and maintain an active Top Secret security clearance with SCI eligibility. The position further requires that, after start, the selected candidate obtain and maintain an active TS/SCI security clearance with polygraph and satisfy other security related requirements.
Key job responsibilities include:
Hiring, managing, and developing the operations management team including DCO site managers, Decom managers, DCO technicians, and Decom technicians.
Overseeing the safety, security, availability, quality, and performance of the team, while driving a positive customer experience across a 24/7 shift schedule.
Prioritizing projects assigned to DCO teams and sites.
Routinely reviewing ticket queue for large events and addressing problems accordingly.
Coordinating change management resources.
Guiding, training, and educating data center staff on the best practices related to all service owner issues.
Managing front line managers. This includes mentoring, training, and developing career progression for both direct reports and members of the organization.
About the team:
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain - and we're looking for talented people who want to help. You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Basic qualifications:
4+ years of management experience
Knowledge of information technology infrastructure domains such as compute server platforms, storage server platforms, server components, network devices, technologies and architectures, IT service delivery principles and best practices
Experience hiring, developing, and managing high-performing technical teams
Current, active US Government Security Clearance of Top Secret with SCI eligibility or above
Preferred qualifications:
Knowledge of building codes and regulations including Life Safety, BOCA, NFPA, NEC, and OSHA
Experience owning the operation of a mission-critical team or product
Experience with large-scale technical operations or large-scale compute farms
Experience with process improvement techniques such as Kaizen, Lean Manufacturing or Six Sigma
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Do you like helping U.S. government agencies implement innovative cloud computing solutions and solve technical problems? Are you committed to fundamentally transforming the way national security and defense mission agencies partner with industry to meet mission requirements? Do you have experience managing large, complex programs? Amazon Web Services (AWS) is seeking a Data Center Operations (DCO) Cluster Manager to join the AWS Infrastructure Services organization and serve as a technical resource and leader within data centers.
The DCO Cluster Manager is the senior leadership role for our compute operations teams within an AWS region that operates 24/7. You will have managerial responsibility for safety, security, availability, scaling, costs and efficiency for your department. You lead the team that is installing, maintaining, and decommissioning network and server equipment in a safe, secure, and cost-effective manner across the region. The DCO Cluster Manager must manage across each function but also have the ability to dive deep into any given function as needed. The DCO Cluster Manager must be physically collocated near the region they are responsible for and able to respond to any high-severity event and be on site within an hour.
The successful candidate will be a highly driven, self-managed individual who demonstrates initiative and proactively seeks solutions to problems. They will have a strong track record of developing talent and managing the performance of their direct reports and organization; including being able to support a high cadence of developing people into new roles outside of the organization. Ideally, they have worked with ticketing systems and been involved in responses to high-severity operational events. In addition to strong knowledge in data centers and a broad technical understanding of how networks and cloud architecture works, the candidate will create documentation, drive continuous improvement, participate in Inclusion and Diversity initiatives, and fix complex problems with simple solutions across multiple AWS regions. While not required, an understanding of critical electrical & HVAC systems will enhance a candidate's ability to be successful. This team works in an environment that operates 24/7.
This position requires that the candidate selected be a US citizen and currently possess and maintain an active Top Secret security clearance with SCI eligibility. The position further requires that, after start, the selected candidate obtain and maintain an active TS/SCI security clearance with polygraph and satisfy other security related requirements.
Key job responsibilities include:
Hiring, managing, and developing the operations management team including DCO site managers, Decom managers, DCO technicians, and Decom technicians.
Overseeing the safety, security, availability, quality, and performance of the team, while driving a positive customer experience across a 24/7 shift schedule.
Prioritizing projects assigned to DCO teams and sites.
Routinely reviewing ticket queue for large events and addressing problems accordingly.
Coordinating change management resources.
Guiding, training, and educating data center staff on the best practices related to all service owner issues.
Managing front line managers. This includes mentoring, training, and developing career progression for both direct reports and members of the organization.
About the team:
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain - and we're looking for talented people who want to help. You'll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Basic qualifications:
4+ years of management experience
Knowledge of information technology infrastructure domains such as compute server platforms, storage server platforms, server components, network devices, technologies and architectures, IT service delivery principles and best practices
Experience hiring, developing, and managing high-performing technical teams
Current, active US Government Security Clearance of Top Secret with SCI eligibility or above
Preferred qualifications:
Knowledge of building codes and regulations including Life Safety, BOCA, NFPA, NEC, and OSHA
Experience owning the operation of a mission-critical team or product
Experience with large-scale technical operations or large-scale compute farms
Experience with process improvement techniques such as Kaizen, Lean Manufacturing or Six Sigma
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.