
Fidelity Investments is hiring: Director, Observability Platform Engineering Tec
Fidelity Investments, Durham, CO, United States
Job Description: Director, Observability Platform Engineering Technical Lead The Role We are seeking a highly experienced, hands-on Technical Lead and Build Owner to lead a dedicated team of software engineers responsible for delivering core platform capabilities within Fidelity's Enterprise Observability Platform. This is not an SRE role; instead, you will focus on building and evolving the observability platform that Site Reliability Engineering and development teams depend on. In this role, you will define and drive the Observability Integrations roadmap, emphasizing scalable automation, security-by-design, and enterprise readiness across a complex hybrid-multi-cloud environment. You will lead the design, development, and support of enterprise-grade observability integrations with SaaS solutions such as Datadog, as well as open-source frameworks including OpenTelemetry (OTel) and Prometheus. The Expertise and Skills You Bring Technical Expertise
- Bachelor's degree in a technology-related field (Computer Science, Engineering, etc.) or equivalent experience.
- Extensive hands-on engineering experience with Java, Go, and/or Python.
- Deep engineering experience with commercial and open-source observability platforms, including:
- Agent lifecycle management
- Agent release processes
- Platform governance
- FinOps best practices
- Experience designing, enabling, and managing observability capabilities across diverse technology stacks at enterprise scale.
- Strong understanding of observability patterns and practices, including:
- Distributed tracing
- Metrics and logs pipelines
- Synthetics
- Real User Monitoring (browser and mobile)
- Demonstrated ability to coach, mentor, and lead engineering teams to build scalable, resilient platform solutions.
- Strategic, forward-thinking mindset with a strong ability to identify patterns, simplify architectures, and create long-term platform value.
- Ability to define platform roadmap that blends automation, usability, security, and cost efficiency.
- Deep expertise in building and integrating security controls in public cloud environments.
- Strong understanding of modern IT service management practices and enterprise technology landscapes, including:
- Cloud delivery models: IaaS, PaaS, SaaS
- Automation frameworks
- Container platforms
- Auto-scaling and compute orchestration
- Networking, storage, and identity/access management
- Configuration, incident, problem, and asset management
- Logging, auditing, and compliance frameworks
- Passion for technology and for delivering platform solutions that solve real business problems using cloud‑native architectures.
- Ability to work across organizational boundaries and communicate effectively with technical and non-technical stakeholders.
- Experience with Agentic AI a plus