Luminal

Infrastructure B2B Software and Services2 open

3+ employeesSeed raised

“Making AI run fast on any hardware.”

Making AI run fast on any hardware.

YC S25

About the Company

Luminal (YC S25) is focused on optimizing AI models to accelerate and simplify model deployment. We are building an AI compiler that enhances model speeds by 10x and streamlines deployment to production with just a single line of code. Our mission is to turn AI research code into production code, seamlessly.

Role Description

This is a full-time on-site role for a Founding Engineer located in downtown San Francisco. The Founding Engineer will be responsible for assisting the design of the core compiler. Day-to-day tasks will include writing CUDA kernels, conducting model performance reviews, and shitposting on social media.

Tech Stack

Luminal uses a search based approach to generate, tune, and verify GPU kernels so engineers do not have to hand write CUDA.

Search based approach

Express computations in a small IR, then generate candidate kernels via equality saturation rewrite rules (tiling, unrolling, vectorization, memory layout).
Guide exploration with cost models and bandit style search to find the fastest valid kernels for a target GPU.
Compile and benchmark candidates on real hardware, enforce correctness with property tests and equivalence checks, and keep strict shape and dtype constraints.
Cache, version, and reuse the best kernels across models and deployments with full reproducibility.

Tech stack

Compiler and runtime: Rust and egglog based compiler generating GPU kernels. Using a lightweight IR with e-graph style rewrites to search and benchmark kernels.
Backends: CUDA and Metal in production today. Other backends in progress.

Founders

Joe Fioti

Generating GPU kernels automatically to speed up ML models. Ex-Intel, worked on CPU microcode and ML accelerators.

Matthew Gunton

Co-founder at Luminal AI. Ex-Amazon engineer, with globally deployed projects automatically finding issues in the Amazon fulfillment network and cost effectively fixing them

Jake Stevens

Cofounder at Luminal: generating GPU kernels automatically to speed up ML models. Ex-Apple. Talk to me about donuts or compilers or both :)

Open Positions at Luminal (2 Jobs)

Luminal

Compiler Engineer

San Francisco, CA, US$150K - $350K FULL_TIME

19 hours ago

Luminal

Cloud Inference Engineer

San Francisco, CA, US$150K - $350K FULL_TIME

19 hours ago

Ready to start your space career at Luminal?

View Luminal Jobs Browse All Space Jobs