Location: Toronto, Canada   |   Full-Time
C++ C CUDA Python GPU Optimization Performance HPC AI Engineer Back End Engineer
Company: Stealth GPU Optimization Startup is building the foundational frameworks and tools necessary to unlock the full potential of the massive GPU compute power being deployed globally. We are a Toronto-based company operating in stealth mode.

Role: We are expanding our core engineering team and seeking experienced GPU Optimization Engineers. You will play a key role in designing, developing, and optimizing high-performance software that leverages GPU architectures.

Responsibilities:
- Design and implement cutting-edge GPU optimization techniques.
- Develop and maintain frameworks and tools for GPU performance analysis and enhancement.
- Write highly optimized C/C++, CUDA, and Python code.
- Collaborate with the team to push the boundaries of GPU utilization.

Technical Skills Required:
- Strong proficiency in C/C++ and CUDA programming.
- Experience with Python for scripting and tooling.
- Solid understanding of modern GPU architectures (NVIDIA, AMD, etc.).
- Experience with performance analysis and debugging tools for GPUs.

Ideal Candidate:
- Passionate about low-level optimization and high-performance computing.
- Proven experience in delivering optimized GPU code.
- Ability to work effectively in a fast-paced startup environment.

Interview Process:
Our interview process is straightforward and respects your time. Expect an initial ~30-minute chat about your relevant experience. We might ask to see code samples you've written previously. There are no take-home assignments. Promising candidates may be invited for a paid 1-2 week trial period to work directly with the team.

Location & Type: Full-Time role based in Toronto, Canada, offering Hybrid or Onsite work options.
Post Date: April 21, 2025