
Infrastructure Vendor Ops Manager
Job Description
About The Role
Together AI is scaling its GPU infrastructure rapidly, working with a growing network of compute suppliers. As we expand, we need someone who owns the operational and financial accountability layer of our vendor relationships: tracking SLA compliance, managing credits, auditing invoices, and ensuring every dollar we spend on compute is accurate and accounted for.
This role sits within the Infrastructure Strategy team and is highly cross-functional, working with infrastructure engineering, finance, and go-to-market teams. When incidents happen, our engineering team produces root-cause analyses; your job is to take that technical detail, build an airtight case for credit claims, and negotiate directly with providers until credits are recovered. You will also partner with GTM and finance to assess the downstream impact of service disruptions and inform how we handle customer-facing commitments. This requires someone with sharp attention to detail, comfort navigating technical documentation, and the persistence to hold vendors accountable.
Responsibilities
- SLA tracking and credit recovery across all GPU compute and data center suppliers, including monitoring uptime and performance commitments, documenting violations, and driving credit claims to resolution
- Invoice review and validation for compute infrastructure contracts, flagging discrepancies and resolving billing issues directly with vendors.
- Regular audits of vendor contracts and SLA performance to verify accuracy of charges and identify cost recovery opportunities
- Using root-cause analyses prepared by the infrastructure engineering team to build the case for SLA credits, then negotiating directly with providers to recover them
- Partnering with GTM and finance to assess the downstream impact of supplier service disruptions and provide the data needed to inform customer-facing remediation decisions
- Building tracking systems and dashboards for vendor financial data, SLA metrics, and credit status across the supplier portfolio, using modern tooling and AI-assisted workflows where possible
- Cross-functional coordination with procurement, legal, and finance to ensure contract terms are properly reflected in billing and that SLA remedies are enforced
- Historical spend analysis and cost forecasting to support operating plan development and infrastructure budget planning
- Process development for invoice review, SLA monitoring, and vendor financial operations as the function scales
Requirements
- 4+ years of experience in vendor operations, technical program management, or contract compliance in a technology infrastructure, cloud, or data center environment
- Direct experience managing SLA credit processes, invoice reconciliation, and vendor performance tracking with infrastructure or cloud providers
- Extreme attention to detail. You catch discrepancies others miss, whether in an invoice, a vendor SLA report, or a contract clause
- Enough technical fluency to read postmortems and incident reports, understand the engineering context, and translate that into a compelling case for credit recovery.
- Strong negotiation skills and persistence in vendor-facing conversations, especially when disputing charges or arguing for SLA credits
- Proficiency with project management and financial tracking tools (e.g., Linear, JIRA, NetSuite, or similar). Comfort using AI tools to accelerate workflows
Nice to Have
- Experience with GPU compute or cloud infrastructure vendors specifically (colocation providers, cloud service providers, or hardware OEMs)
- Background in building vendor operations processes from scratch at a fast-growing company
- Familiarity with data center contract structures, including power and cooling pass-throughs, metered billing, and committed-use pricing
About Together AI
Together AI is an AI-native cloud company building the infrastructure to make AI faster, cheaper, and more accessible. We’re rapidly scaling our GPU footprint: signing our own data center leases, building large-scale clusters, and expanding toward a global owned-infrastructure presence. Our research team has contributed to breakthroughs like FlashAttention, Hyena, and RedPajama, and we co-design across software, hardware, and algorithms to push the frontier of AI efficiency.
Compensation
We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $170-200K + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our Privacy Policy at https://www.together.ai/privacy
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Department
- Operations
- Category
- Operations
- Employment Type
- Contract
- Location
- San Francisco, California, United States
- Posted
- May 13, 2026, 06:30 PM
About Together AI
Part of the growing frontier tech ecosystem pushing the edges of what's possible.
More Roles at Together AI





Similar Operations Roles



Found this role interesting?