
Compensation
$6,000–$10,000/month
Job Description
Join our team to build the future of inference, GPU optimization and AI infrastructure. You'll work directly with the team to define our technical direction and build the core systems that power our GPU optimization platform.
What You'll Do
- Build scalable infrastructure for AI model training and inference
- Lead technical decisions and architecture choices
What We Look For
Core Technical Expertise
- GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
- Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
- LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
- Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.
Ideal Background
- Publications or open-source contributions in inference GPU computing or ML/AI for code are a plus.
- Hands-on experience with large-scale experiments, benchmarking, and performance tuning.
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Category
- Mechanical
- Employment Type
- Internship
- Location
- San Francisco, CA
- Posted
- Compensation
- $6,000 - $10,000 per month
About Wafer
Wafer provides product design, cloud, IT infrastructure, web development, and software consulting services.
More Roles at Wafer
Similar Mechanical Roles

OpenAI
Mechanical
Technical Deployment Lead - Sydney
Sydney, Australia Full Time
4 hours ago
Mechanical
4 hours ago
Isar Aerospace
MechanicalStructures
Working Student, LV Structures (System Integration)
Ottobrunn, Germany Full Time
14 hours ago
MechanicalStructures
14 hours ago
Thales Alenia Space
Mechanical
Digital Learning Manager Graphiste Radar Academy F/H
Limours, France Full Time
1 day ago
Mechanical
1 day agoFound this role interesting?