Mediabistro logo
job logo

Infrastructure Vendor Ops Manager

Together AI, San Francisco, CA, USA

Pay: $170,000-$200,000/yr

Job type: Contract


Together AI is scaling its GPU infrastructure rapidly, working with a growing network of compute suppliers. As we expand, we need someone who owns the operational and financial accountability layer of our vendor relationships: tracking SLA compliance, managing credits, auditing invoices, and ensuring every dollar we spend on compute is accurate and accounted for.

This role sits within the Infrastructure Strategy team and is highly cross‑functional, working with infrastructure engineering, finance, and go‑to‑market teams. When incidents happen, our engineering team produces root‑cause analyses; your job is to take that technical detail, build an airtight case for credit claims, and negotiate directly with providers until credits are recovered. You will also partner with GTM and finance to assess the downstream impact of service disruptions and inform how we handle customer‑facing commitments. This requires someone with sharp attention to detail, comfort navigating technical documentation, and the persistence to hold vendors accountable.

Responsibilities

SLA tracking and credit recovery across all GPU compute and data center suppliers, including monitoring uptime and performance commitments, documenting violations, and driving credit claims to resolution

Invoice review and validation for compute infrastructure contracts, flagging discrepancies and resolving billing issues directly with vendors.

Regular audits of vendor contracts and SLA performance to verify accuracy of charges and identify cost recovery opportunities

Using root‑cause analyses prepared by the infrastructure engineering team to build the case for SLA credits, then negotiating directly with providers to recover them

Partnering with GTM and finance to assess the downstream impact of supplier service disruptions and provide the data needed to inform customer‑facing remediation decisions

Building tracking systems and dashboards for vendor financial data, SLA metrics, and credit status across the supplier portfolio, using modern tooling and AI‑assisted workflows where possible

Cross‑functional coordination with procurement, legal, and finance to ensure contract terms are properly reflected in billing and that SLA remedies are enforced

Historical spend analysis and cost forecasting to support operating plan development and infrastructure budget planning

Process development for invoice review, SLA monitoring, and vendor financial operations as the function scales

Requirements

4+ years of experience in vendor operations, technical program management, or contract compliance in a technology infrastructure, cloud, or data center environment

Direct experience managing SLA credit processes, invoice reconciliation, and vendor performance tracking with infrastructure or cloud providers

Extreme attention to detail. You catch discrepancies others miss, whether in an invoice, a vendor SLA report, or a contract clause

Enough technical fluency to read post‑mortems and incident reports, understand the engineering context, and translate that into a compelling case for credit recovery.

Strong negotiation skills and persistence in vendor‑facing conversations, especially when disputing charges or arguing for SLA credits

Proficiency with project management and financial tracking tools (e.g., Linear, JIRA, NetSuite, or similar). Comfort using AI tools to accelerate workflows

Nice to Have

Experience with GPU compute or cloud infrastructure vendors specifically (colocation providers, cloud service providers, or hardware OEMs)

Background in building vendor operations processes from scratch at a fast‑growing company

Familiarity with data center contract structures, including power and cooling pass‑throughs, metered billing, and committed‑use pricing

About Together AI
Together AI is an AI‑native cloud company building the infrastructure to make AI faster, cheaper, and more accessible. We’re rapidly scaling our GPU footprint: signing our own data center leases, building large‑scale clusters, and expanding toward a global owned‑infrastructure presence. Our research team has contributed to breakthroughs like FlashAttention, Hyena, and RedPajama, and we co‑design across software, hardware, and algorithms to push the frontier of AI efficiency.

Compensation
We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full‑time position is: $170-200K + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our Privacy Policy at https://www.together.ai/privacy

#J-18808-Ljbffr