
Observability Engineer
Booz Allen Hamilton, Mc Lean, VA, United States
Your growth matters to us - explore our career development opportunities.
BE EMPOWERED TO SUCCEED
Connect with others in our people-first culture and enhance our collective ingenuity.
SUPPORT YOUR WELLBEING
Learn how we’ll support you as you pursue a balanced, fulfilling life.
YOUR CANDIDATE JOURNEY
Discover what to expect during your journey as a candidate with us.
So met hing breaks at 2 AM. Today, a human gets paged. Tomorrow, an AI agent detects the anomaly, correlates the root cause, triggers the remediation, and closes the ticket, all before the first cup of coffee. You are the engineer who builds that tomorrow.
We are seeking a senior Observability Engineer with expertise in both AI technologies and enterprise performance monitoring. This role combines hands‑on engineering with AIOps implementation to deliver full‑stack visibility across 250+ services. You will lead efforts to implement predictive monitoring and self‑healing capabilities that drive down operational costs while increasing system availability by leveraging AI to triage and resolve incidents. You will mentor and supervise engineers, own technical quality, and push the program to warm AI‑driven observability with opportunities to build new observability platforms from the ground up as we expand into new environments. Due to the nature of work performed within this facility, U.S. citizenship is required.
Join us. The world can’t wait.
YOU HAVE:
5+ years of experience in enterprise observability, monitoring, and site reliability engineering
Experience architecting and operating Dynatrace for full-stack observability, including agent deployment, distributed tracing, log management, synthetic monitoring, and digital experience monitoring
Experience implementing AIOps workflows, including predictive alerting, anomaly detection, automated remediation, and incident automation
Experience building operational and executive dashboards and implementing SLOs and SLAs
Experience working in Agile environments with sprint-based delivery
Knowledge of network monitoring protocols, including SNMP, SNMP traps, NetFlow, and Syslog
Ability to mentor engineers, conduct code reviews, and take accountability for technical delivery and quality
Bachelor's degree in a Computer Science or Information Technology field
Nice If You Have:
Experience with advanced Dynatrace platform capabilities, including Grail, Smartscape, Davis AI, OpenPipeline, DQL, Workflow Automation, Platform API, AppSec, Session Replay, Grail-powered RUM, AI Observability, and Grail log management
Experience with Dynatrace Intelligence, including Dynatrace Assist, Intelligence Agents, MCP Server integration, and Dynatrace Apps development using the App Toolkit
Experience deploying and building observability platforms from scratch in government cloud environments such as AWS GovCloud, Azure Government, or IL4/IL5, including air‑gapped, restricted network, and STIG-hardened deployments
Experience building self‑service onboarding portals for application team observability adoption
Experience with open‑source observability tooling, including OpenTelemetry, Prometheus, Grafana, and ELK/EFK
Experience with FinOps practices, containerization, and cloud platforms such as AWS, Azure, or GCP
Experience operating Splunk and Splunk Enterprise Security (SIEM), Cribl, and SolarWinds at enterprise scale
Dynatrace Professional or Master certification or ServiceNow Certified Implementation Specialist - Event Management (CIS‑EM) Certification
Compensation
Salary range $86,800.00 to $198,000.00 (annualized USD).
Commitment to Non-Discrimination
All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.
#J-18808-Ljbffr
BE EMPOWERED TO SUCCEED
Connect with others in our people-first culture and enhance our collective ingenuity.
SUPPORT YOUR WELLBEING
Learn how we’ll support you as you pursue a balanced, fulfilling life.
YOUR CANDIDATE JOURNEY
Discover what to expect during your journey as a candidate with us.
So met hing breaks at 2 AM. Today, a human gets paged. Tomorrow, an AI agent detects the anomaly, correlates the root cause, triggers the remediation, and closes the ticket, all before the first cup of coffee. You are the engineer who builds that tomorrow.
We are seeking a senior Observability Engineer with expertise in both AI technologies and enterprise performance monitoring. This role combines hands‑on engineering with AIOps implementation to deliver full‑stack visibility across 250+ services. You will lead efforts to implement predictive monitoring and self‑healing capabilities that drive down operational costs while increasing system availability by leveraging AI to triage and resolve incidents. You will mentor and supervise engineers, own technical quality, and push the program to warm AI‑driven observability with opportunities to build new observability platforms from the ground up as we expand into new environments. Due to the nature of work performed within this facility, U.S. citizenship is required.
Join us. The world can’t wait.
YOU HAVE:
5+ years of experience in enterprise observability, monitoring, and site reliability engineering
Experience architecting and operating Dynatrace for full-stack observability, including agent deployment, distributed tracing, log management, synthetic monitoring, and digital experience monitoring
Experience implementing AIOps workflows, including predictive alerting, anomaly detection, automated remediation, and incident automation
Experience building operational and executive dashboards and implementing SLOs and SLAs
Experience working in Agile environments with sprint-based delivery
Knowledge of network monitoring protocols, including SNMP, SNMP traps, NetFlow, and Syslog
Ability to mentor engineers, conduct code reviews, and take accountability for technical delivery and quality
Bachelor's degree in a Computer Science or Information Technology field
Nice If You Have:
Experience with advanced Dynatrace platform capabilities, including Grail, Smartscape, Davis AI, OpenPipeline, DQL, Workflow Automation, Platform API, AppSec, Session Replay, Grail-powered RUM, AI Observability, and Grail log management
Experience with Dynatrace Intelligence, including Dynatrace Assist, Intelligence Agents, MCP Server integration, and Dynatrace Apps development using the App Toolkit
Experience deploying and building observability platforms from scratch in government cloud environments such as AWS GovCloud, Azure Government, or IL4/IL5, including air‑gapped, restricted network, and STIG-hardened deployments
Experience building self‑service onboarding portals for application team observability adoption
Experience with open‑source observability tooling, including OpenTelemetry, Prometheus, Grafana, and ELK/EFK
Experience with FinOps practices, containerization, and cloud platforms such as AWS, Azure, or GCP
Experience operating Splunk and Splunk Enterprise Security (SIEM), Cribl, and SolarWinds at enterprise scale
Dynatrace Professional or Master certification or ServiceNow Certified Implementation Specialist - Event Management (CIS‑EM) Certification
Compensation
Salary range $86,800.00 to $198,000.00 (annualized USD).
Commitment to Non-Discrimination
All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.
#J-18808-Ljbffr