
Monitoring and Incident Response Specialist
The One 23 Group, Herndon, VA, United States
Overview
The Monitoring and Incident Response Specialist provides real-time monitoring, incident response, and operational support for an enterprise network environment. This role supports the Monitoring and Incident Response Team (MIRT), which operates in a 24x7x365 environment to ensure network availability, performance, and security. The specialist monitors network infrastructure and services, investigates alerts and incidents, performs initial troubleshooting and root cause analysis, and escalates issues to appropriate engineering teams.
Responsibilities
Network and Service Monitoring: continuously monitor network infrastructure, applications, and services to ensure system availability and performance.
Monitor alerts generated by enterprise monitoring platforms and respond to operational events.
Track network performance metrics and identify anomalies or potential service disruptions.
Monitor enterprise infrastructure including routers, switches, firewalls, load balancers, and WAN circuits.
Incident Response and Troubleshooting: investigate alerts related to network outages, service degradation, and security events; perform initial triage and root cause analysis; troubleshoot connectivity issues and coordinate resolution with network engineering, security, and application teams; escalate critical incidents to appropriate support teams based on severity and impact.
Network Infrastructure Support: diagnose issues related to enterprise networking equipment including routers, switches, firewalls, and load balancers; assist with configuration updates and operational changes under established change management processes; utilize packet capture and network diagnostic tools to troubleshoot network anomalies.
Incident Documentation and Reporting: document incidents, troubleshooting actions, and resolution steps within the IT service management (ITSM) system; maintain detailed incident logs and operational reports for network and infrastructure events; provide updates to stakeholders regarding incident status, impact, and resolution timelines.
Operational Monitoring and Alert Management: monitor enterprise systems for health metrics including network availability, CPU utilization, memory usage, interface performance, and system alerts; investigate monitoring alerts and perform operational response procedures.
Requirements
Public Trust
Minimum 3–5 years of experience supporting network operations, IT infrastructure monitoring, or incident response
Experience working in enterprise IT environments supporting network or infrastructure operations
Equal opportunity employer, including disability/vets.
#J-18808-Ljbffr
The Monitoring and Incident Response Specialist provides real-time monitoring, incident response, and operational support for an enterprise network environment. This role supports the Monitoring and Incident Response Team (MIRT), which operates in a 24x7x365 environment to ensure network availability, performance, and security. The specialist monitors network infrastructure and services, investigates alerts and incidents, performs initial troubleshooting and root cause analysis, and escalates issues to appropriate engineering teams.
Responsibilities
Network and Service Monitoring: continuously monitor network infrastructure, applications, and services to ensure system availability and performance.
Monitor alerts generated by enterprise monitoring platforms and respond to operational events.
Track network performance metrics and identify anomalies or potential service disruptions.
Monitor enterprise infrastructure including routers, switches, firewalls, load balancers, and WAN circuits.
Incident Response and Troubleshooting: investigate alerts related to network outages, service degradation, and security events; perform initial triage and root cause analysis; troubleshoot connectivity issues and coordinate resolution with network engineering, security, and application teams; escalate critical incidents to appropriate support teams based on severity and impact.
Network Infrastructure Support: diagnose issues related to enterprise networking equipment including routers, switches, firewalls, and load balancers; assist with configuration updates and operational changes under established change management processes; utilize packet capture and network diagnostic tools to troubleshoot network anomalies.
Incident Documentation and Reporting: document incidents, troubleshooting actions, and resolution steps within the IT service management (ITSM) system; maintain detailed incident logs and operational reports for network and infrastructure events; provide updates to stakeholders regarding incident status, impact, and resolution timelines.
Operational Monitoring and Alert Management: monitor enterprise systems for health metrics including network availability, CPU utilization, memory usage, interface performance, and system alerts; investigate monitoring alerts and perform operational response procedures.
Requirements
Public Trust
Minimum 3–5 years of experience supporting network operations, IT infrastructure monitoring, or incident response
Experience working in enterprise IT environments supporting network or infrastructure operations
Equal opportunity employer, including disability/vets.
#J-18808-Ljbffr