
Freya
Voice AI for Enterprises
About the Company
At Freya, we build voice AI call centers for regulated industries: banks, insurance, and enterprise call centers. We are backed by Y Combinator (S25) and have raised $3.5M from including investors like Y Combinator, our agents answer real customer calls in on-prem environments inside banks.
Tech Stack
About the Role You will own the backbone that runs thousands of concurrent AI-powered phone calls inside bank-grade on-premise environments. Not just keeping pods alive, you architect distributed systems that handle real-time voice, scale STT / LLM / TTS inference across customer GPU clusters, integrate with enterprise telephony (Cisco CUBE, Genesys, Asterisk), and deploy behind the firewalls of largest financial institutions. Your work decides whether our platform answers a bank's rush-hour traffic or leaves customers on dead air.
What You'll Do • Own on-prem deployments into OpenShift clusters inside banks. Helm charts, image registries, GPU allocation, CyberArk integration, SAML 2.0 / OIDC SSO. • Scale GPU inference infrastructure for our STT, TTS, and LLM models across multiple customer environments (H100 / H200, NVLink, Triton or vLLM). • Integrate with telephony: Asterisk, SIP trunks, Cisco CUBE, Genesys, WebRTC. SIP header parsing (X-Genesys-*), direction routing, warm transfers, DTMF. • Own reliability: Splunk SIEM forwarding, Langfuse and Grafana observability, incident playbooks for bank-grade 24/7 SLAs. • Security and compliance: RBAC, pentest remediation, KVKK and BDDK compliance patterns, pod security policies. • Scale with growth: we onboard a new bank or insurer every quarter. Each is a new on-prem environment with its own constraints. • Spot flaws early. We are building new architecture for a regulated industry. You help us see what needs to be solved next.
Interesting Problems to Own • On-prem meets streaming. Most voice AI stacks assume cloud. We run the same stack inside banks with zero internet egress. Novel problems in image delivery, model updates, secrets rotation. • Bank-scale concurrency. A single campaign can put millions of customers on the line the same afternoon. Queueing, graceful degradation, GPU-aware autoscaling are yours to design. • Legacy-meets-new telephony. Cisco CUBE, Asterisk, Genesys, SIP, WebRTC. You wrangle old-school protocols alongside modern streaming stacks.
What Makes You a Great Fit • 3+ years building and scaling distributed systems. Deep Kubernetes / OpenShift knowledge. AWS / GCP helpful. • Fundamentals plus. You can sketch how a SIP INVITE flows through a proxy, explain a K8s GPU scheduler, or tell us the obscure thing you fell asleep reading last night. • Real-time systems experience. Low-latency streaming or inference. Voice / video is a big plus. • On-prem mentality. You have shipped software into environments you did not fully control: Turkish bank, European healthcare, US regulated finance, anything similar. • Startup hats. You have worked where problems find you before process does. • Opinionated without alienating. Opinions drive progress, but you find compromises with customers and teammates. • Familiar with: Kubernetes / OpenShift, Helm, Docker, Terraform, NVIDIA GPU stack, Asterisk, SIP, Cisco CUBE, Genesys, Splunk, Grafana, CyberArk, Python or Go.
Bonus Points • Telephony systems (SIP, VoIP, WebRTC). • ML infrastructure, model serving, or GPU computing (Triton, vLLM, TensorRT). • Real-time audio processing. • Banking / fintech / BDDK and KVKK compliance familiarity. • Fluent Turkish or comfortable working with Turkish customer teams daily.
How to Apply Email [email protected] with a short note about the hardest infra problem you have shipped and your CV. We move fast, expect a reply within 48 hours.
Don't worry about the checklist. We hire for how you think, not how many boxes you tick. The work is hard, the hours are long, and most of what we ship nobody has built before. If that sounds good instead of scary, apply.
Founders
CEO and Co-founder of Freya. Previously, he built VLMs for geometric reasoning and worked on Audio Transformers as an AI research engineer. As a Math Olympian, ranked 1st among 250,000 students at the Urfodu Math Olympiad and later 203rd Turkey’s national exams out of 3.5M+. At 16, he wrote his first paper on Quantum Machine Learning. Before dropping out of UPenn, he was a Teaching Assistant in the Engineering School and part of an AI Research team at Penn.
Tomas is the Co-Founder and COO of Freya, where he is building human-like Voice AI agents for the financial services industry. A former Wharton student, he gained experience across multiple financial services firms, including Gallagher, the world’s second-largest insurance broker. Now on his second startup, Tomas is focused on transforming customer support by bringing it into the era of AI.
Open Positions at Freya (1 Jobs)
Ready to start your space career at Freya?