Skip to main content
← Back to jobs
OneChronos logo

Site Reliability Engineer

Compensation
$150,000–$200,000/year

Job Description

OneChronos is seeking a Site Reliability Engineer (SRE) to own the reliability, performance, and operational integrity of our trading platform across US, UK, and European market hours.

This role exists because our platform matters. It processes billions of dollars in securities transactions daily, and when something degrades or fails, it needs to be understood and corrected quickly—and prevented from recurring.

This is a senior, high-autonomy role for someone who treats production as a first-class system. You will operate at the intersection of software engineering and real-time systems, combining deep debugging ability with a bias toward automation, observability, and systemic reliability improvements. Over time, you will reduce operational burden on the engineering team by building the systems and practices that make the platform inherently more resilient.

About OneChronos (www.OneChronos.com) OneChronos is a technology company of diverse thinkers innovating at the intersection of capital markets, mechanism design, and operations research, working to grow the global GDP by designing and operating matching markets leveraging advances in auction theory and artificial intelligence. Our software and systems are developed in house and match billions of dollars per day of security transactions.

What You'll Do

  • Own the reliability and performance of the trading platform across US, UK, and EU market hours
  • Design, build, and maintain observability systems (metrics, logging, tracing, alerting) that provide clear, actionable insight into system behavior
  • Triage, diagnose, and resolve production issues end-to-end across infrastructure, services, trade lifecycle, and market data systems
  • Lead incident response, including mitigation, root cause analysis, and post-mortems with a focus on systemic fixes
  • Build automation and tooling to eliminate manual operational work and reduce mean time to detect and resolve issues
  • Define and improve SLOs, alerting strategies, and reliability standards across the platform
  • Partner closely with software engineers to debug complex issues and drive reliability improvements back into system design
  • Improve deployment and release processes, including validation, rollback strategies, and production safety mechanisms
  • Expand and maintain runbooks and operational playbooks for critical systems
  • Serve as a real-time operational point of contact during market hours, providing clear and accurate communication during incidents
  • Build and maintain infrastructure and deployment automation using Ansible, Terraform, and custom scripts to ensure consistent, reliable releases

Who You Are

  • 5+ years of experience in Site Reliability Engineering, production engineering, or high-performance technical operations in trading or similarly demanding systems
  • Strong systems-level debugging skills across distributed services, networking, and Linux-based environments
  • Experience building and operating observability stacks (e.g., metrics, logs, tracing) and designing effective alerting
  • Proficient in Python or similar for automation and tooling—you build systems, not just respond to them
  • Solid SQL skills with the ability to investigate production datasets directly
  • Deep familiarity with FIX protocol and the realities of operating live trading systems
  • Strong understanding of trade lifecycle, market data flows, and equities/FX market structure
  • Comfortable owning production during high-pressure market hours and making sound, independent decisions
  • A track record of turning recurring issues into engineered solutions that eliminate entire classes of problems
  • Motivated by building reliable systems, not just reacting to failures

Compensation $150,000 to $200,000, plus potential bonus and ownership interest

Optimize Your Resume for This Job

Get a match score and see exactly which keywords you're missing

Optimize Resume

Job Details

Category
Software
Employment Type
Full Time
Location
New York, NY, US / London, England, GB / Amsterdam, NH, NL / Amsterdam, North Holland, NL / Remote (US) (Remote Available)
Posted
Apr 6, 2026, 05:40 PM
Listed
Apr 6, 2026, 05:40 PM
Compensation
$150,000 - $200,000 per year

About OneChronos

Part of the growing frontier tech ecosystem pushing the edges of what's possible.

Found this role interesting?

Site Reliability Engineer
OneChronos
Apply ↗

Shipping like we're funded. We're not. No affiliation.

Sequoia logo
Y Combinator logo
Founders Fund logo
a16z logo