Our SCALE compiler allows unmodified CUDA code to run natively on AMD GPUs.
So far, I've worked on:
• New GPU compiler optimisations
• Parser fixes to support quirks of Nvidia's compiler, nvcc
• Better instruction selection for the AMDGPU backend to fully utilise the hardware's capabilities
• Maths library: expanding API coverage, improving accuracy, and fixing bugs