Skip to main content
artificial intelligence

Temporal's Job Posting Bans Data Scientists. Senior Engineers Report $340K Median.

By Sarah Mitchell

#Temporal's $300M Series D and the Agentic AI Infrastructure Hiring Blitz Are Quietly Building Enterprise AI's First Production-Grade Orchestration Workforce — and the $5B Valuation Signal Changes Everything

Valuation Inflection

Temporal closed a $300 million Series D, Temporal's blog reported, in February at a $5 billion post-money valuation, Temporal's figures put the valuation at $5 billion, double the $2.5 billion mark from four months earlier. Andreessen Horowitz led with Lightspeed and Sapphire; Sequoia, Index, Tiger Global, GIC, Madrona, and Amplify re-upped. Total raised: $650 million, GeekWire's data shows.

Revenue grew 380% year over year, GeekWire found. Weekly active usage climbed 350%. Installations rose 500% to 20 million monthly. The cloud platform has logged 9.1 trillion lifetime action executions, 1.86 trillion from AI-native companies, with spikes exceeding 150,000 actions per second absorbed without warning.

Investors aren't betting on the model layer. They're betting on the execution layer beneath it. "For long-running agents operating over extended horizons, the durability that Temporal provides is the difference between a compelling demo and a production system," a16z wrote. OpenAI's VP of App Infrastructure, Venkat Venkataramani, put it more bluntly: durability matters as much as performance.

The capital targets three things: expanding the open-source community, scaling Temporal Cloud, and deepening integrations with OpenAI, Vercel, Pydantic, and Braintrust. Raghu Raghuram, former VMware CEO and a16z general partner, joins as board observer. Headcount sits at 375 across Bellevue and remote.

The model wars have a winner. The orchestration war is just starting.

Why Orchestration Became the Bottleneck

A LangChain demo runs clean for five minutes. The first production outage rewrites the architecture. The gap between demo and production isn't the model, prompt, or tools. It's the orchestration layer. LLM calls are expensive (a single GPT-4o call with large context can exceed $0.10), non-deterministic, slow (30 seconds to two minutes), and fail in novel ways: rate limits, content filters, malformed JSON, hallucinated tool calls. A multi-step research agent that searches, synthesizes, seeks human approval, and publishes involves five discrete steps, each capable of failing independently. Without orchestration, you have a fragile chain of API calls wrapped in try/except blocks, with no state persistence, no crash recovery, no ability to pause for human input.

Airflow was built for batch ETL, not agent reasoning that branches on non-deterministic outputs. LangChain and LlamaIndex provide agent frameworks but don't persist event history, replay deterministically after crashes, or apply bounded retries and timeouts to each step. Custom queues and homegrown state machines force teams to rebuild durability, compensation, and observability from scratch every time.

Production workflows need five capabilities existing tools don't deliver together: state persistence so a crash at step four doesn't lose work from steps one through three; configurable retry logic that respects rate limits and budget; compensation (rollback) that undoes side effects when step four fails; human-in-the-loop pauses that can last hours or days and resume exactly where they left off; and observability that traces every LLM call, tool invocation, and state transition.

A two-layer architecture is emerging. Temporal provides the outer orchestration layer: durable execution, fault tolerance, distributed parallelism, long-running lifecycle management, and cross-service coordination with Saga compensation. LangGraph provides the inner agent logic layer: graph-based state management, conditional routing, checkpointing, interrupts for human review, and fan-out parallelism via the Send API. Each Temporal activity can contain a complete LangGraph StateGraph handling nuanced agent reasoning, while Temporal guarantees the overall pipeline completes despite worker restarts and network failures.

Thresholds are concrete. Use LangGraph alone for single-step, read-only tasks finishing under 30 seconds with low failure cost. Add Temporal once a workflow exceeds 30 seconds, touches three or more external systems, pauses for multi-hour or multi-day approvals, or performs production side effects — deploys, deletes, payments, external messaging — where automatic retries would cause damage. Below those thresholds, the operational overhead of a second runtime isn't justified. Above them, it's the difference between a system that recovers and one that loses data at 3 AM.

Hiring Blitz: Four Pillars

Temporal's Greenhouse board lists 54 open roles, a pace matching the Series D's implicit mandate to scale the team behind the platform. Hiring splits across four pillars mirroring the product's surface area: core distributed-systems engineering, SDK and framework development, solutions architecture, and developer relations.

Core Engineering: Distributed Systems, Not "AI Engineering"

The AI SDK team — advertising a Staff Software Engineer, AI Foundations (US remote) and a Software Engineer, AI SDK at $160K–$200K (US only) — sits inside Engineering, not a separate AI org. The job description makes the distinction explicit: "What You Won't Do: Work as a Data Scientist, Data Analyst, DevOps SWE, or SRE." The role demands 3+ years production-grade software experience, deep concurrency fluency (mutexes, atomics), and a track record of open-source API design; Python and TypeScript are primary languages. A parallel Staff Software Engineer, Infrastructure Foundations role and two Open Source Server positions signal continued investment in the control plane and persistence layer underpinning durable execution.

Observability and compute get dedicated tracks: Senior Engineering Manager, Compute; Senior and Staff Observability engineers; Senior Staff Engineer, Infrastructure. Cloud Data Storage and Security (Application Security, GRC, Cloud Security) round out platform hardening hires.

Solutions Architecture: Enterprise Pull

Six Solutions Architect roles span geography and segment: Senior Solutions Architect, Commercial (SF); Senior Solutions Architect, Growth (West); Senior Solutions Architect, New Logo (West); Staff Solutions Architect, Enterprise (East); Staff Solutions Architect, Growth (Central); plus Senior Manager, Solutions Architecture, Growth. Titles reveal the motion: "Growth" and "Growth" and "New Logo" map to expansion and acquisition; "Enterprise" and "Commercial" to account depth. All are US-remote except where noted. A Director of Global Developer Success and three Developer Success Engineers close the post-sale loop.

Developer Relations: Geographic Coverage as Strategy

Four Staff-or-above Developer Advocate roles stake out regions: Principal Developer Advocate, AI (San Francisco); Staff Developer Advocate, Enterprise (US remote); Staff Developer Advocate, Enterprise (APJ, Singapore); Staff Developer Advocate, Enterprise (EMEA, Berlin). The AI-specific advocate in San Francisco sits near the densest concentration of LLM application builders; regional enterprise advocates mirror the sales footprint (Account Executives in Paris, Berlin, London, Singapore, Austin, Chicago). This isn't community marketing — it's technical field enablement for accounts running Temporal in production.

Location Reality: Remote-First, Seattle-Anchored

Temporal calls itself fully remote. The data bears it out: every engineering and solutions role lists "United States - Remote Opportunity" or "US/Canada Remote." But the headcount cluster is real: 62 to 65 of ~375 employees sit in the Seattle/Bellevue corridor. The Staff Developer Success Engineer role explicitly requires Seattle presence. The Series D hasn't changed policy; it's funded the travel budget (one to two company offsites per year).

The Hybrid Profile Emerging

Job specs converge on a profile that doesn't exist in standard taxonomies: engineers who model long-running, stateful workflows and integrate LLM tool-calling, memory, and human-in-the-loop patterns and reason about distributed-systems correctness (replay determinism, saga compensation, versioning). The AI SDK posting asks for "deep experience with concurrent programming" and "passion for harnessing generative AI, particularly for agents and coding" and "experience designing APIs and writing documentation for publicly-available libraries." That triple intersection — distributed systems, developer-facing API craft, agentic application patterns — is what the Series D is buying.

Technical Stack: Workflows, Activities, SDK Surface

Temporal's programming model centers on a single guarantee: code that looks synchronous executes with durable, fault-tolerant semantics. A Workflow is a function that can sleep for months, survive process crashes, and resume exactly where it left off. Activities are the fallible, retryable side effects (API calls, database writes, LLM invocations) that the runtime replays until they succeed or exhaust policy.

Signals and Queries let external callers inject events or inspect state without blocking the Workflow. Updates, the newest primitive, combine both: a client sends a command, the Workflow validates and mutates state atomically, and the caller receives a result, all inside the same durability boundary. This replaces ad-hoc webhook handlers and polling loops plaguing agentic systems built on stateless functions.

SDKs expose these primitives in TypeScript, Go, Java, and Python. Each binds to the same gRPC protocol and server, so a Workflow written in Go can signal a TypeScript Workflow that queries a Java Activity. For LLM application builders, TypeScript and Python SDKs see heaviest use, as they map naturally to async/await patterns common in agent frameworks. Go remains the choice for high-throughput control planes where cold-start latency and memory profile matter.

Engineers moving into this stack spend their first weeks unlearning distributed-systems habits: no idempotency keys, no outbox tables, no manual checkpointing. The runtime handles replay determinism but demands pure functions: no random UUIDs, no wall-clock time, no unsandboxed I/O inside Workflow code. Violations surface as non-determinism errors during replay, not in production. That feedback loop is the real onboarding signal.

Enterprise Pull: Stripe, Datadog, Coinbase

Temporal crossed 1,000 customers in April 2024 and hit 1,500 two months later, 50% growth in eight weeks. The adoption curve isn't theoretical; it's measured in production traffic at companies that lose money when workflows stall.

Coinbase moved 300-plus namespaces across 300-plus services from a custom self-hosted Temporal cluster to Temporal Cloud with zero planned downtime on critical trading and payment paths. The migration, executed with Bitovi over a year, touched compliance (GDPR deletion, travel-rule checks, KYC refresh), settlements and payouts running hours to days, ETL and backfills spanning weeks, and infrastructure provisioning. Coinbase engineers had run their own Temporal cluster since the Cadence days; scaling, securing, and upgrading custom components was on track to become "its own Coinbase product." They migrated in three phases: custom self-hosted → Aurora-backed staging clusters → Temporal Cloud. Two strategies carried the load. Drain-and-switch worked for short-lived workflows: stop new starts, drain in-flight executions, flip traffic. For long-running, business-critical flows that couldn't pause, they ran dual Workers, a multi-client manager routing a configurable percentage of new workflow executions to Cloud while the legacy cluster finished the rest. At cutover, routing hit 72% with zero errors and zero stranded workflows. Security wasn't an afterthought: private links and mTLS between Coinbase VPCs and Temporal Cloud, certificate filters restricting namespace access to approved services, LDAP-gated namespace provisioning, consensus-based admin service requiring peer approval for any workflow operation, payload encryption via centralized codec server (with in-pod option for latency-sensitive teams). Every service team got an embedded migration engineer, shared runbooks, dashboards, and reconciliation scripts before cutover. "Observability equals transparency, and transparency builds trust in a migration process," the team wrote.

Netflix tells the other side. Before 2021, its Clouddriver system for cloud operations failed roughly 4% of the time due to transient failures. Complex deployments took days; a mid-pipeline failure meant re-running everything. By 2025, deployment failure rate dropped to 0.0001%, a four-and-a-half-order-of-magnitude improvement. The engineering team described it as "virtually eliminating this failure mode" and "a huge win for developer productivity, especially for teams with long and complex deployment pipelines." Temporal let them remove "lots of homegrown orchestration and retry logic built up over years." Netflix now runs millions of deployments across global infrastructure on Temporal, from Open Connect CDN operators to Live reliability teams.

Stripe built a platform on Temporal, forcing developers to version gRPC fields when adding them to interfaces to prevent history-replay errors. Their approach centers on reliability, safety, and ease of use for internal developers. Datadog runs internal workflows on Temporal. HashiCorp uses it for infrastructure orchestration. Alaska Airlines for operations systems. Twilio and Snap process billions of daily operations on it.

A company hits scale, its homegrown orchestration (Celery + Kafka, custom SAGA implementations, AWS Step Functions with JSON definitions, hand-rolled retry logic) becomes a maintenance liability, they evaluate commercial options, and choose Temporal. Attentive spent eight engineering-months last year maintaining their self-hosted Temporal deployment before moving to Cloud. Vodafone reported better performance and lower costs after the same switch. The managed service charges $25 per million actions and $1 per GB-day of storage, consumption-based with no infrastructure over-provisioning. Self-hosting requires running a minimum of seven components and over-provisioning for spikes.

Developers call it "brutal" and a "complete mental model shift" from task-based queues. But the companies absorbing that curve run the workflows that can't afford to fail.

Competitive Landscape

The term "workflow orchestration" obscures more than it clarifies. Airflow schedules DAGs. Step Functions executes state machines. LangGraph chains LLM calls. Temporal does something different: it guarantees that a program — arbitrary, imperative code — makes forward progress even when underlying compute fails. That guarantee is the category boundary for durable execution.

Durable execution means your code makes progress through process death. The engine lifts state out of memory and into a persistent log. Replaying the log rebuilds state.

Airflow: Scheduled Pipeline Default

Airflow owns the scheduled data-pipeline niche. Its DAG model, cron-first semantics, and operator ecosystem (hundreds of pre-built integrations for BigQuery, Snowflake, Postgres, Kubernetes) make it the right tool for "run this ETL every night at 2 AM." Airflow wins when work is a fixed graph of tasks on a schedule.

It loses when work is interactive, long-running, or stateful in ways a DAG cannot express. Airflow's scheduler is a single point of failure (HA exists but is newer). Task instances are separate processes with no shared memory; passing data means XCom (small values) or external storage. Conditional branches, loops over dynamic item counts, and human-in-the-loop signals are awkward or require workarounds. Teams building user-facing async APIs on Airflow usually migrate to Temporal within six months.

LangChain / LlamaIndex / LangGraph: Agent Frameworks, Not Infrastructure

The agent-framework layer sits above durable execution. They provide prompt templates, retrieval pipelines, tool-calling abstractions, and graph-based control flow for LLM reasoning. But they don't solve the durability guarantees that define the anti-pattern discussed later. LangGraph's checkpointing is a step toward durability, yet remains tied to the framework's own runtime and lacks the operational maturity of a dedicated engine.

The production pattern uses LangGraph for agent logic and Temporal for execution guarantee, with the agent becoming an activity (or child workflow) inside a durable execution boundary. This separation lets teams swap agent frameworks without rewriting the reliability layer.

AWS Step Functions: Managed, AWS-Native, State-Machine-Limited

Step Functions wins for small-to-medium orchestration entirely inside AWS. Zero infrastructure, tight integration with Lambda, ECS, SageMaker, DynamoDB, SNS. The console's execution graph is operationally excellent. Two modes — Standard (full history, up to one year) and Express (high throughput, no full history) — cover different cost/latency profiles.

Trade-offs are structural:

  • Lock-in: Amazon States Language (JSON DSL) is verbose for complex logic. CDK constructs and Workflow Studio help but don't change the model.
  • No signals: SendTaskSuccess/SendTaskFailure handles human-in-the-loop; general "signal a running workflow from outside" is clumsy.
  • Per-transition cost: Standard workflows charge per state transition. At high volume this gets expensive fast.
  • Expressiveness ceiling: No arbitrary code, no loops with dynamic bounds, no recursion. If your workflow wants to be code, not configuration, Step Functions fights you.
Queue-Based Patterns: Not Durable

Cloud Tasks, Cloud Run Jobs, SQS + Lambda, and similar queue-worker patterns handle fire-and-forget or short-lived retriable tasks. They don't provide:

  • Workflow-level state surviving worker crashes mid-execution
  • Deterministic replay with local variable restoration
  • First-class signals, queries, or human-in-the-loop
  • Visibility into multi-step execution history

They're building blocks. Teams needing durable execution end up rebuilding a poor-man's Temporal on top — exactly the "homegrown" anti-pattern research warns against.

Homegrown Solutions: Hidden Cost Center

Every senior engineer has built (or inherited) a custom queue with a Postgres state table, a retry loop, and a cron job to clean up stuck rows. It works for the first workflow. The second adds a signal. The third needs a timeout surviving deployments. The fourth needs visibility. By the fifth, the team maintains a half-baked workflow engine instead of their product.

This is the anti-pattern worth naming: using a Redis queue or custom state machine when you need crash-safe state, retries across process death, or signals. The workflow engine pays rent when you need those guarantees. If you don't, don't pay the complexity cost.

Comparative Table
Dimension Temporal Airflow Step Functions LangGraph / Agent Frameworks Homegrown (Queue + DB)
Core Model Imperative code (multi-SDK) DAG of tasks (Python) JSON state machine Graph / chain abstractions Ad-hoc
Durability Event log, full deterministic replay Task DB, no replay Managed state log Checkpointing (framework-specific) Manual / none
Long-Running Days, weeks, forever Minutes to hours typical Up to 1 year (Standard) Session-bound Fragile
Signals / Human-in-Loop First-class Via sensors, awkward Task tokens, limited Framework-dependent Manual
Language Go, Java, Python, TypeScript, .NET, PHP, Ruby Python (DAG definition) JSON / CDK Python, TypeScript Any
Self-Host Yes (complex) Yes (moderate) No N/A Yes
Managed Option Temporal Cloud Astronomer, MWAA, Cloud Composer Native AWS LangGraph Cloud, hosted offerings N/A
Scale (workflows/day) Millions Tens of thousands Millions Thousands Low
Dev Loop Fast (run locally) Slow (DB + scheduler) Slow (deploy-test) Fast (local) Fast initially
Dynamic Fan-Out Natural (loops, recursion) Dynamic task mapping (2.3+) Map state Natural Manual
Cost Model Infra + Cloud tier Infra Per-state transition Framework + infra Infra + engineering time
Typical Use General durable execution Scheduled data pipelines Small AWS-native orchestration Agent reasoning / LLM chains Early-stage / simple async
Where Each Wins

Temporal wins when you need arbitrary code, long-running workflows, signals, human-in-the-loop, and can run infrastructure (or pay for Temporal Cloud). This is the general case for most ML-platform async work: document processing pipelines, fine-tuning jobs, evaluation runs, deep-research workflows, order fulfillment, customer onboarding.

Airflow wins for that classic case. Classic ETL. Nightly model training. Weekly report generation. The DAG model fits perfectly and the operator ecosystem saves months of integration work.

Step Functions wins when you're all-in on AWS, workflow is small-to-medium (<100 states), you want zero infrastructure, and the definition expresses easily as a state machine.

Agent frameworks win for the LLM reasoning layer: prompt orchestration, tool calling, retrieval, memory. They lose for the execution guarantee layer.

Homegrown wins only for trivial async work fitting in a single retriable task. Anything beyond that is technical debt compounding.

Many mature ML platforms run both Temporal and Airflow; Temporal for interactive async work, Airflow for scheduled pipelines. They're complementary, not competitive. The category error is treating them as interchangeable.

Talent Market: Rise of the Agent Orchestration Engineer

Temporal's Series D and production deployments at Stripe, Datadog, and Coinbase have forced a new hiring category into existence. Companies building agentic systems no longer need just distributed-systems engineers or just ML engineers — they need engineers with that profile. The market calls this profile an "Agent Orchestration Engineer." Compensation data says it's scarce.

Levels.fyi data from July 2026 puts Temporal's median software engineer total compensation at $268K. Senior engineers report $340K (base $228K, stock $112K). A Seattle-based senior engineer with seven years experience logged $400K (base $240K, stock $160K). Software engineering managers top out at $620K. Vesting is standard four-year, 25% annually, with a ten-year post-termination exercise window.

Role Median Total Comp Base Stock (annual) Source
Software Engineer (median) $268K Levels.fyi
Senior Software Engineer $340K $228K $112K Levels.fyi
Senior Software Engineer (Seattle, 7 yrs) $400K $240K $160K Levels.fyi
Software Engineer II (Denver, 6 yrs) $240K $185K $55K Levels.fyi
Software Engineering Manager $620K Levels.fyi
Product Manager $251K Levels.fyi
Senior Solutions Architect (commercial) $200K–$250K Zero G Talent board

External postings for "Agent Orchestration Engineer" roles cite $200K–$340K in 2026, with demand accelerating faster than supply. Zero G Talent's board shows Temporal added six roles in the past week (primarily sales and solutions architecture), but the engineering hiring signal is visible in the senior solutions architect band at $200K–$250K.

The skill intersection is narrow. Candidates must understand workflow-as-code semantics (signals, queries, updates, retries, timeouts), operate comfortably in Go, TypeScript, Java, or Python SDKs, and design for non-determinism: idempotency keys, saga patterns, human-task queues. They also need to integrate LLM calls as activities that can be replayed, versioned, and compensated. Most distributed-systems engineers have never modeled a prompt chain. Most ML engineers have never debugged a workflow history replay.

The talent pipeline is thin. Universities don't teach durable execution. Bootcamps don't cover it. The only reliable source is engineers who have already run Temporal (or Cadence) in production. That pool is small, concentrated in Seattle, San Francisco, New York, and remote-first distributed teams. As more enterprises migrate off custom queues and Airflow DAGs, the bidding war for this profile will widen.


Working in AI? Zero G Talent tracks the openings: browse AI jobs, openings at Stripe and Temporal Technologies, and the people building the field.

Ready to Start Your Space Career?

Browse artificial intelligence jobs and find your next opportunity.

View artificial intelligence Jobs