• Scaling and Parallelizing Applications across multi-core CPUs, multi-SP GPUs, and special accelerators utilizing OneAPI.
• LLVM/ICC Compiler Analysis and Performance Optimization:
Loop Optimizations (LoopOpt), Inter-procedural optimizations (IPO), Vectorization, etc.
• Intel Architecture (IA) Performance Leadership
• Parallel Programming and Application Acceleration across homogeneous and heterogeneous
architectures
• Path-finding Activities