Strong C/C++ programming skills
Experience with compiler internals (llvm, gcc or any other)
Basic Python programming skills
Experience in performance analysis
Basic understanding of ML technologies
Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)
Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation
Knowledge of ROCm infrastructure
Experience in CMake, make/ninja build system
GEMM performance fundamentals