What You’ll Do
- Build and maintain Codex, our internal research and experimentation framework
- Design abstractions for testing GPU scheduling, orchestration, and optimization ideas
- Implement simulation and replay environments for AI workloads
- Enable rapid experimentation across hardware, software, and workload configurations
- Work with research scientists to operationalize experimental models
- Ensure Codex outputs are reproducible, measurable, and scalable
- Integrate Codex insights into production systems and customer-facing platforms
- Maintain performance, reliability, and extensibility of the Codex platform
You’ll Thrive Here if You
- Have 5+ years of experience in systems engineering or platform development
- Possess strong software design skills and systems thinking
- Have experience building internal tooling or experimentation frameworks
- Are comfortable owning complex, foundational infrastructure
- Demonstrate strong collaboration skills across research and engineering teams
Bonus Qualifications
- Experience with simulation, emulation, or workload replay systems
- Familiarity with GPU scheduling or orchestration
- Experience designing APIs for internal platforms
- Background in performance modeling or systems research
Why This Role is Unique
You will build the core engine that turns ideas into measurable insights, acting as the backbone of Zettabyte’s GPU research and optimization capabilities.
Details
- Competitive salary and equity based on experience and skill set
- Flexible work environment
- Applicants must be authorized to work in their respective location