
Job Description
Languages: Python (PyTorch), JavaScript (TypeScript)
Listen and try Sonauto: https://sonauto.ai/
About the role:
We're training state of the art generative models in a completely new modality where the standard of quality is significantly higher than those that came before (hands with three fingers is fine in images, but miss a beat in a song and it's ruined). Accordingly, we are consistently implementing new generative architectures and improving current ones.
Right now we're doing distributed training at the scale of hundreds of H100s on diffusion models, GANs, language models, and more written in PyTorch.
You will lead research efforts for improving the song and audio quality of our generative music models, along with productizing the results.
We’re looking for people with:
- Extensive experience with PyTorch training generative models like diffusion models, GANs, and language models. Work with open-source orgs/projects counts too.
- Experience with JavaScript (our frontend is React/NextJS) and backend development are also important to us.
- Experience with audio models is a huge plus.
- Many of our biggest improvements wouldn’t have been possible without extremely close listening, so being an audiophile and/or having music production experience are also huge pluses.
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Location
- San Francisco, CA, US
- Posted
- Mar 24, 2026, 04:29 PM
- Listed
- Mar 24, 2026, 04:29 PM
- Compensation
- $120,000 - $230,000 per year
About Sonauto
Part of the growing space & AI ecosystem pushing the frontiers of technology.
More Roles at Sonauto
Found this role interesting?