Job Description

About TrueFoundry

Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all.

That infrastructure layer is being built right now.

We're TrueFoundry, and we're building it. We're looking for a Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team to join the team.

The Problem We're Solving

Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.

The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.

You need a control plane that handles:

Intelligent routing with observability, cost policies, and fallback logic
Centralized tool and MCP server management with security and lifecycle controls
Agent orchestration with governance and guardrails
A unified compute layer to run self-hosted models, custom tools, and agents

We've built two products to solve this:

AI Gateway is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance.

AI Deploy is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure.

We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.

What You’ll Do:

Build and productionize LLM-based and ML-based solutions, utilizing both open-source and proprietary models
Integrate TrueFoundry’s platform seamlessly into customer environments and leverage it to expedite the time to value of developing these applications
Build agents, write prompts, eval sets, optimize inference time and response quality for applications
Write maintainable production-quality high-performance code frequently in Python
Build and optimize REST APIs, gRPC services, and data pipelines
Drive rapid feedback loops from customer deployments into continuous improvements for product and platform
Participate in solution architecture design, code reviews, and engineering best practices adoption

Who You Are:

4+ years experience building and deploying ML applications in production.
4+ years experience writing production code in python
2+ years working in deep learning and Natural language processing
1+ year experience building Agentic applications and GenAI Apps
Experience building REST APIs, working with Docker, and setting up CI/CD pipelines

Deep familiarity with Pytorch, HuggingFace libraries

Working knowledge of model servers like vLLM, Triton, TensorRT is preferred
Understanding of Kubernetes, distributed systems architecture, and cloud-native technologies is preferred
Strong system design abilities, with a focus on modular, reliable, and scalable architecture
Passionate about applying AI to solve real-world, cross-industry problems
Familiarity with LLM fine-tuning, RAG (Retrieval-Augmented Generation), prompt engineering, or evaluation frameworks

Why Join TrueFoundry

Build foundational Applied GenAI solutions alongside world-class engineers (ex-Facebook Infrastructure leaders)
Work on real-world, high-impact problems across multiple industries
Collaborate directly with founders and early leadership on shaping company and product direction
Enjoy a flexible, ownership-driven work environment with rapid career growth
Weekly learning sessions, team-building activities, and startup mentorship opportunities
Learning credits and resources to help you grow your technical and professional skills

Optimize Your Resume for This Job

Get a match score and see exactly which keywords you're missing

Optimize Resume

Ready to Apply?

This will take you to the TrueFoundry application page

Apply on TrueFoundry

About TrueFoundry

TrueFoundry provides an enterprise-grade AI Gateway that encompasses an LLM Gateway, MCP Gateway, and Agent Gateway, enabling enterprises to securely connect, observe, and govern access to models, tools, guardrails, and agents from a single control plane. Beyond the gateway layer, TrueFoundry enables organizations to deploy and train custom LLMs on GPUs, host MCP servers, and run custom agents—all through a Kubernetes-native interface. It supports on-premise and VPC installations for both AI Gateway and deployment environments. TrueFoundry ensures enterprise-grade compliance with SOC 2, HIPAA, and ITAR standards. With built-in autoscaling, caching, and resource optimization, TrueFoundry empowers organizations to build, deploy, and govern AI systems securely, efficiently, and on a future-safe stack. Leading Fortune 1000 companies like Resmed, Siemens Healthineers, Automation Anywhere, Zscaler, Nvidia and others trust TrueFoundry to accelerate innovation and deliver AI at scale. To learn more about TrueFoundry, visit truefoundry.com.

Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team

Job Description

The Problem We're Solving

What You’ll Do:

Who You Are:

Deep familiarity with Pytorch, HuggingFace libraries

Why Join TrueFoundry

Optimize Your Resume for This Job

Ready to Apply?

Job Details

About TrueFoundry

More Roles at TrueFoundry

Similar Aerospace Engineering Roles

See full Aerospace Engineering compensation at TrueFoundry

Career Guides