Senior Technical Engineer, AI SW Development (ROCm), Advanced Micro Devices

Apply for this job

Email *
Executive Name *

Job Description

Advanced Micro Devices is searching for a Senior Technical Engineer AI SW Development (ROCm) who will create and improve system-level software and runtime software that supports Physical AI workloads. The position requires research development of ROCm and HIP runtime systems plus optimization work on AI frameworks and robotics support development and performance testing across AMD GPU and NPU devices.

Qualification: Bachelor’s or Master’s Degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

Apply: Apply Now

Main Duties

  • Create and develop new ROCm/HIP runtime features to support advanced AI and Physical AI workloads across AMD accelerators.
  • Optimize AI software stacks including PyTorch, ONNX Runtime, vLLM, and other open-source frameworks for improved performance.
  • Enable AI acceleration on GPUs and NPUs through system bring-up and platform optimization.
  • Develop high-performance kernels and contribute to compiler and runtime enhancements.
  • Design and implement system software supporting robotics, perception, navigation, and embodied AI models.
  • Optimize real-time inference pipelines, sensor fusion systems, and control loops running on AMD hardware.
  • Conduct profiling, tuning, and bottleneck analysis for AI workloads to improve runtime efficiency.
  • Develop debugging tools and performance instrumentation across AI system software layers.
  • Collaborate with hardware, compiler, ML framework, robotics, and embedded Linux teams to drive technical solutions.

Essential Qualifications:

  • Bachelor’s degree or Master’s degree in Computer Science or Engineering or a related field. 
  • Demonstrate strong C/C++ skills together with experience in developing system-level software. 
  • The candidate must have experience in GPU/NPU compute programming using either HIP, CUDA or OpenCL. 
  • Deep knowledge of AI/ML inference together with quantization, kernel optimization and runtime framework systems. 
  • Demonstrates extensive understanding of Linux systems together with debugging tools and profilers and Git. 
  • Experience using AI software stacks within real-world production environments.

Preferred Qualifications:

  • Knowledge of the ROCm ecosystem together with HIP runtime internals and MLIR and compilers and driver-level development.
  • Understand robotics, perception systems, multimodal AI models, and physical AI workloads.
  • Experience with embedded Linux systems and containers and edge AI inference platforms.
  • Strong problem-solving skills which enable him/her to solve complex technical challenges without assistance.
  • Possesses excellent skills for both communication and collaboration with different teams.