Video Manager, NOC Video Manager
Neptune Holdings US Corp, Bethpage, NY, US, 11714
Duration: Full Time
Video Manager, NOC Video Manager
The SRE Manager Video Platform leads a high-performing team responsible for ensuring the reliability, scalability, and performance of video delivery infrastructure across consumer-facing services. This role blends deep technical expertise with leadership, driving continuous improvement in automation, monitoring, and fault-tolerant design. The ideal candidate will have experience managing systems at scale, working across DevOps and video engineering disciplines, and aligning platform resilience strategies with business objectives.
As a key stakeholder in service availability, this manager will guide teams in adopting SRE best practices (SLAs, SLOs, error budgets), championing automation to reduce operational toil, and overseeing incident response for video-related services. They will also collaborate with product, network, and application engineering groups to define system architecture that meets evolving content delivery demands.
Responsibilities
- Lead and mentor a team of SREs supporting large-scale video delivery platforms across live, linear, and on-demand services.
- Oversee observability, monitoring, alerting, and capacity planning for video pipeline components including:
- Video ingest (satellite, IP)
- Encoding/transcoding
- DRM packaging
- CDN edge delivery
- Playback monitoring
- Ensure performance, scalability, and fault tolerance of services like origin servers, manifest manipulation, and just-in-time packaging.
- Build and optimize CI/CD pipelines for deploying changes to video platform components and configurations.
- Define and manage SLOs, SLAs, and error budgets for video services; lead RCA and postmortem processes.
- Work closely with video operations, development, CDN, and infrastructure teams to resolve incidents and drive long-term fixes.
- Automate deployment, failover, and remediation procedures across distributed systems and data centers.
- Partner with product and engineering teams to validate service changes and improvements before rollout.
- Represent the team in strategic planning, architecture reviews, and executive updates.
- Manage performance reviews, hiring, and career growth for SRE team members.
- Ensure compliance with operational and security standards across services.
Qualifications
Education & Experience
- Bachelor's degree in Computer Science, Electrical Engineering, or related technical field (Master's preferred).
- 7+ years of experience in site reliability, systems, or software engineering roles.
- 3+ years of people leadership in technical teams, preferably within media delivery or video platforms.
- Proven experience supporting high-availability video systems with millions of concurrent users.
- Strong background in video quality monitoring, QoE metrics, and analytics.
Technical Expertise
- In-depth knowledge of video encoding/transcoding (H.264, H.265/HEVC), ABR ladder design, and packaging standards.
- Familiarity with video delivery technologies and platforms (e.g., Harmonic, MediaKind, AWS Elemental, Wowza, Broadpeak, Red5, or custom FFmpeg-based pipelines).
- Proficiency in Linux environments and programming/scripting (Python, Go, Bash).
- Experience with monitoring and telemetry systems: Prometheus, Grafana, ELK stack, DataDog, or similar.
- Working knowledge of CDN management (e.g., Akamai, Cloudflare, Fastly) and edge caching strategies.
- Infrastructure-as-Code experience (Terraform, Ansible) and CI/CD tooling (Jenkins, GitLab, ArgoCD)
Leadership & Communication
- Ability to lead distributed teams and drive complex network reliability programs.
- Excellent communication, collaboration, and cross-functional influence skills.
- Strong problem-solving mindset with a focus on reducing MTTR and improving operational KPIs.
- Experience managing on-call rotations and conducting operational readiness reviews.
Location: Bethpage, NY, US, 11714
Brand: Optimum
Nearest Major Market: Long Island
Nearest Secondary Market: New York City