
Senior AI Engineer
Job Description
Zep is the memory and context layer for AI agents. As a Senior AI Engineer, you'll build low-latency backend systems, operate them in production on AWS, and ship LLM-powered capabilities our customers depend on.
You'll have the opportunity to work on Graphiti (25K+ GitHub stars), Zep’s popular open-source context graph framework.
This is a senior backend role centered on running LLM workloads at significant scale. We're not hiring ML researchers or data scientists. We're hiring engineers who have already lived through the messy reality of taking an LLM application from demo to production.
What you'll do
- Ship product features end-to-end across backend services, APIs, data flows, and the supporting UI where it makes sense.
- Build and operate LLM-powered systems: extraction pipelines, evaluation harnesses, and reliability improvements running at scale.
- Contribute to system design for new components. Write the code, document the decisions, iterate.
- Improve production quality across performance, observability, and operational runbooks on AWS.
What we're looking for
- 6+ years of production engineering with a strong backend systems background. You've shipped services with real throughput and latency requirements.
- Master's in Computer Science or equivalent.
- Go and Python experience in real systems. You can work in critical-path code and on performance.
- Hands-on AI agent and LLM application experience. You've shipped a non-trivial agentic system to production. Not a prototype, not a thin wrapper over a chat-completion API. We expect concrete examples: multi-turn agent loops with tool calling, retrieval and context pipelines you tuned against real failures, eval harnesses you built to catch regressions, or production memory and state systems for agents.
- Working familiarity with the agent ecosystem: at least one of LangGraph, Google ADK, Mastra, or other agent SDKs, vector stores, and eval tooling.
- Extremely comfortable with spec-driven agent coding, coding harnesses, and guiding agents to build complex product.
- Hands-on AWS experience in production: deployments, monitoring, scaling, cost and reliability tradeoffs.
Nice to have
- TypeScript experience for frontend or SDK work.
Tech stack: Go, Python, TypeScript, AWS.
This role is probably NOT a fit if:
- Your LLM experience is single-turn chat completions or RAG-as-a-feature.
- Your background is primarily in ML research or model training rather than shipping agent systems in production.
- You haven't operated production backend systems with real latency or throughput requirements.
Interview Process
We respect your time and keep our interview process tight and focussed.
Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Category
- Aerospace Engineering
- Employment Type
- Full Time
- Location
- San Francisco, United States / Remote (US) (Remote Available)
- Posted
- May 7, 2026, 08:40 PM
- Listed
- May 7, 2026, 07:40 PM
- Compensation
- $180,000 - $250,000 per year
About Zep AI
Part of the growing frontier tech ecosystem pushing the edges of what's possible.
More Roles at Zep AI


Similar Aerospace Engineering Roles



Found this role interesting?
Career Guides
Inside guide to Airbus Defence & Space careers: Ariane 6, Eurostar, Orion ESM programs, salary ranges across Europe, Graduate Programme, and hiring process.
Inside guide to Thales Alenia Space careers: Galileo, MetOp, SpaceRider programs, salary ranges, locations across France and Italy, hiring process, and work culture.
Practical guide to writing resumes for aerospace and space jobs: ATS optimization, keywords by role, translating experience from other industries, clearance listing, and cover letters.