# Siavash Norouzi > HPC & ML at Roche Sequencing Location: San Francisco Bay Area, United States Profile: https://flows.cv/siavash I’m a software engineer with a deep focus on CUDA development and cuda kernel optimization for deep learning inference, ML infra performance optimization, ML backbone, high performance computing, embedded platforms, and GPU-accelerated pipelines. My work spans engineering in computer vision, robotics, scientific imaging, and automotive — sitting at the intersection of hardware, high-performance code, and product design. I enjoy working where performance, hardware constraints, and practical product needs converge—and where fast feedback and full-stack responsibility drive the work forward. ## Work Experience ### Principal GPU Software Engineer @ Roche Jan 2026 – Present | Santa Clara, California, United States ### Founder & Consulting Engineer @ Ronais Software LLC Jan 2023 – Present • Re-architected a legacy C++/Qt application into a scalable MVVM-based design with a full QML frontend, enabling advanced UI logic, modularity, and long-term extensibility • Designed and implemented real-time telemetry and image publishing interfaces for seamless integration with a Python-based scientific alignment and control system • Developed high-performance centroiding and filtering algorithms, including gamma correction and iteratively-weighted center of mass (CoM) refinement for precise spot tracking • Delivered tools such as real-time histogramming, CSV-based logging, and robust configuration preset management • Enabled long-term automation by decoupling GUI from backend logic and preparing the pipeline for remote command/control • Provided production-grade code and documentation to support critical alignment milestones and prepare the software stack for future multi-camera scalability ### Senior Software Engineer @ Vayu Robotics Jan 2024 – Jan 2026 | Mountain View, California, United States • Architected and developed a modular perception Software Development Kit (SDK) for autonomous systems deployed on Jetson Orin, integrating 8-camera capture (V4L2) with TensorRT inference pipelines for real-time computer vision (depth, disparity, polarization). • Wrote and optimized CUDA kernels for depth processing, spatial filtering (with shared memory), and tensor layout conversion as part of a real-time inference pipeline on Jetson Orin; combined with stream-level parallelism and memory overlap to reduce preprocessing latency by 47% and boost throughput by 85%. • Improved backend throughput and stability by overlapping data capture, preprocessing, and TensorRT inference using CUDA APIs. • Built a real-time point cloud visualizer using Qt and OpenGL, transforming a minimal graphics example into a fully live 3D viewer synchronized with RGB and depth maps. • Implemented an interactive pixel-level depth prober with <1-frame latency, allowing users to hover over RGB or depth views to query accurate depth in real time. • Delivered robust, real-time solutions under high-reliability constraints for clients in various industries. ### Software Engineer @ Lucid Motors Jan 2021 – Jan 2024 | Newark, California, United States Signal Stack & System Boot Optimization • Sole maintainer of the infotainment Qt signal stack • Co-designed and co-implemented the Suspend-to-RAM (STR) signal-stack initiative: – Developed persistence and re-initialization logic for infotainment signals – Coordinated across platform, signal, and UI layers to reduce boot time from ≈40 s → <3 s fleet-wide • Implemented multiplex PDU support for Lucid Air and Lucid Gravity Qt/QML Cluster Development • Developed cross-platform (Mac/QNX) C++/Qt software for Lucid Air instrument cluster • Diagnosed critical Qt3D rendering bugs on QNX Unreal Engine Integration • First engineer to onboard Unreal Engine on infotainment ECU (QNX) • Built UE plugin to visualize ADAS perception data via RTI DDS • Integrated IPC for dynamic visibility control in the Drive Assist 3D application • Contributed to onboarding documentation used by other developers • Investigated QNX High Availability Manager (HAM) to improve runtime stability of the UE-based visualization system ### Software Engineer, Magdalena Ridge Observatory Interferometer (MROI) @ New Mexico Tech Jan 2019 – Jan 2021 | Socorro, New Mexico In this role, I am responsible for the design and development of control software for distributed systems, focusing on the Automated Alignment System for the interferometer. I assess needs and formulate the system requirements with the system designer and principal software engineering. My work spans the data collection methods (temperature sensors, cameras, quad cells), the development of efficient data processing algorithms, and the software for system control using a variety of communication protocols. Some notable achievements include: • I conceptualized and created a proof of concept for the integration of Qt/C++ applications with the central Interferometer Supervisory System. • I developed an application for a back-end stability camera, interfaced via the CameraLink technology over the network, created in C++ and Qt framework, with a GUI using OpenCV and the matrix library Eigen to calibrate the beam relay system. This tool is critical in addressing the stability of the system. As part of this same project, I also trained and implemented two deep convolutional neural networks, namely, DnCNN and U-Net in C++ using PyTorch to assist with wavefront reconstruction and measurement within the software, allowing for more accurate measurements in presence of downscaling and atmospheric turbulence. • I delivered a Java Application to operate the LabJack units controlling the power input for laser and filament light injection systems. ### Report Developer, Information Technology and Communications Department (ITC) @ New Mexico Tech Jan 2017 – Jan 2021 In this role, I support internal and external projects with reporting tool development, requirement elicitation, and other tasks as assigned. My work has involved creating reporting tools in SQL and delivered with user-friendly dashboards in Argos for various administrative departments including the Admissions Office, Residential Life, Graduate Office, and Business Office. Notable achievements include: • I transitioned the performance of key reports in Microsoft Access to SQL in Argos platform, reducing processing time from 10 minutes to 5 seconds in most cases. • I supported a team development initiative for an SQL parsing C/C# Windows application for managing an extensive database of object-level permissions. • I was a key contributor to a team effort to develop an SQL-Based application and comparison tool in Python to synchronize the Institute's database with the learning management system to allow for better management of student online evaluations. ## Education ### Master of Science in Computer Science in Thesis focused on Machine Learning New Mexico Institute of Mining and Technology ### Master of Science in Mineral Engineering in Thesis focused on Numerical Modeling New Mexico Institute of Mining and Technology ### Bachelor of Science in Mining Engineering in Project focused on Numerical Modeling Isfahan University of Technology ## Contact & Social - LinkedIn: https://linkedin.com/in/siavash-norouzi - GitHub: https://github.com/siavashnorouzi --- Source: https://flows.cv/siavash JSON Resume: https://flows.cv/siavash/resume.json Last updated: 2026-03-29