
Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team
Job Description
About TrueFoundry
Every production AI system, whether it's powering customer support, writing code, analyzing financial data, or diagnosing medical conditions, needs the same foundational infrastructure. A way to route between models. A way to manage tools and integrate them securely. A way to orchestrate agents and enforce governance. A unified compute layer to run it all.
That infrastructure layer is being built right now.
We're TrueFoundry, and we're building it. We're looking for a Senior/Staff Applied GenAI Researcher – Enterprise Outcome Team to join the team.
The Problem We're Solving
Companies are moving beyond simple chatbots to production agentic systems. These systems route between OpenAI, Anthropic, Google, and self-hosted models. They integrate dozens of tools via protocols like MCP. They orchestrate multi-agent workflows where agents coordinate with other agents.
The infrastructure to support this doesn't exist yet. You can't just duct-tape together a few API calls and call it production-ready.
You need a control plane that handles:
- Intelligent routing with observability, cost policies, and fallback logic
- Centralized tool and MCP server management with security and lifecycle controls
- Agent orchestration with governance and guardrails
- A unified compute layer to run self-hosted models, custom tools, and agents
We've built two products to solve this:
AI Gateway is the control plane, five composable components (Prompts, LLM Gateway, MCP Gateway, Guardrails, Agent Gateway) that handle routing, orchestration, and governance.
AI Deploy is the compute layer, a Kubernetes-based platform that abstracts ML workloads as standard software primitives, so everything runs on unified infrastructure.
We're Series A, backed by Intel Capital and Sequoia. Companies like CVS, Mastercard, Siemens, Paytm, Synopsys, and Zscaler run production AI workloads on our platform.
What You’ll Do:
- Build and productionize LLM-based and ML-based solutions, utilizing both open-source and proprietary models
- Integrate TrueFoundry’s platform seamlessly into customer environments and leverage it to expedite the time to value of developing these applications
- Build agents, write prompts, eval sets, optimize inference time and response quality for applications
- Write maintainable production-quality high-performance code frequently in Python
- Build and optimize REST APIs, gRPC services, and data pipelines
- Drive rapid feedback loops from customer deployments into continuous improvements for product and platform
- Participate in solution architecture design, code reviews, and engineering best practices adoption
Who You Are:
- 4+ years experience building and deploying ML applications in production.
- 4+ years experience writing production code in python
- 2+ years working in deep learning and Natural language processing
- 1+ year experience building Agentic applications and GenAI Apps
- Experience building REST APIs, working with Docker, and setting up CI/CD pipelines
Deep familiarity with Pytorch, HuggingFace libraries
- Working knowledge of model servers like vLLM, Triton, TensorRT is preferred
- Understanding of Kubernetes, distributed systems architecture, and cloud-native technologies is preferred
- Strong system design abilities, with a focus on modular, reliable, and scalable architecture
- Passionate about applying AI to solve real-world, cross-industry problems
- Familiarity with LLM fine-tuning, RAG (Retrieval-Augmented Generation), prompt engineering, or evaluation frameworks
Why Join TrueFoundry
- Build foundational Applied GenAI solutions alongside world-class engineers (ex-Facebook Infrastructure leaders)
- Work on real-world, high-impact problems across multiple industries
- Collaborate directly with founders and early leadership on shaping company and product direction
- Enjoy a flexible, ownership-driven work environment with rapid career growth
- Weekly learning sessions, team-building activities, and startup mentorship opportunities
- Learning credits and resources to help you grow your technical and professional skills
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Category
- Aerospace Engineering
- Employment Type
- Full Time
- Location
- San Mateo
- Posted
About TrueFoundry
TrueFoundry provides an enterprise-grade AI Gateway that encompasses an LLM Gateway, MCP Gateway, and Agent Gateway, enabling enterprises to securely connect, observe, and govern access to models, tools, guardrails, and agents from a single control plane. Beyond the gateway layer, TrueFoundry enables organizations to deploy and train custom LLMs on GPUs, host MCP servers, and run custom agents—all through a Kubernetes-native interface. It supports on-premise and VPC installations for both AI Gateway and deployment environments. TrueFoundry ensures enterprise-grade compliance with SOC 2, HIPAA, and ITAR standards. With built-in autoscaling, caching, and resource optimization, TrueFoundry empowers organizations to build, deploy, and govern AI systems securely, efficiently, and on a future-safe stack. Leading Fortune 1000 companies like Resmed, Siemens Healthineers, Automation Anywhere, Zscaler, Nvidia and others trust TrueFoundry to accelerate innovation and deliver AI at scale. To learn more about TrueFoundry, visit truefoundry.com.
More Roles at TrueFoundry





Similar Aerospace Engineering Roles



Found this role interesting?