# Tianchen Cai > Senior Software Engineer at OneTrust Location: South San Francisco, California, United States Profile: https://flows.cv/tianchen Experienced Graduate Teaching Assistant with a demonstrated history of working in the higher education industry. Skilled in ICEM CFD, Numerical Simulation, Microsoft Word, Computer-Aided Design (CAD), and Paraview, Python, Matlab, and Machine Learning. Strong education professional with a Master's degree focused in Computational Science and Engineering from Georgia Institute of Technology. ## Work Experience ### Senior Software Engineer @ OneTrust Jan 2022 – Present | San Francisco Bay Area Core contributor to OneTrust's market-leading Data Discovery classification engine, building large-scale systems that power enterprise data discovery and compliance workflows. Led the implementation of Classification V2 using Java and Rust, significantly improving entity detection accuracy, throughput, and system robustness in production.Designed and productionized high-accuracy classification models through deep research, large-scale experimentation, and continuous validation. Selected to join OneTrust Labs, a CIO-sponsored AI Innovation team, to deliver the company's first LLM-powered AI agents for PIA and TPRM products.Played a key role in upgrading and extending the Assessment Auto-Completion RAG pipeline, substantially improving accuracy, reducing manual effort, and expanding the range and complexity of questions the AI agents can reliably answer. Evaluated and prototyped advanced retrieval and reasoning approaches, including early Graph-based RAG designs, to inform long-term AI platform strategy. Currently transitioning to the AI Platform team, contributing to the design of a foundational AI agent platform intended to support scalable, secure, and extensible AI-driven products across the company. Named inventor on two pending patents related to data discovery and classification technologies: US20240070319A1 - Methods and Systems for Detecting Entities Using Custom Classification Techniques US20250061154A1 - Generating probabilistic data structures for lookup tables in computer memory for multi-token searching ### Software Engineer @ OneTrust Jan 2022 – Jan 2022 | San Francisco Bay Area Work with PB level data and discovery meaningful terms --------------------------------- Maintain library and micro-services for Data Discovery core engine Design and implement many new and cool features to extend product functionality Maintain and develop high performance NLP models and services Keep enhancing the accuracy and speed of our product Continuously contribute to product support needs to make customers being happy Onboard and mentor new team members ### Senior Software Engineer/Data Engineer @ FIS Jan 2021 – Jan 2022 | San Francisco Bay Area Maintain and enhance our infrastructure of file ingestion system on AWS. Design and develop API endpoints monitors to provide abilities of alarming APIs' failures at runtime. Enhance and automate code testing strategies on Jenkins, Design and develop new data-pipeline with AWS, DynamoDB, Data brick Spark and so on. Design and develop RESTful APIs. Refactor codes to decouple complexity and improve readability and reusability. ### Software Engineer @ OneTrust Jan 2020 – Jan 2021 | Sandy Springs, Georgia, United States Worked for Data Discovery (Python project) as scanner team : 1. Develop and maintain data source connector micro-services by Flask. 2. Optimized the scanner to be able to handle large scale problem by using the coroutine, multi-thread and multi-processing. 3. Research, design and develop the OCR. 4. Designed load balancer for Kafka payload distributor to optimize the Kafka performance. • Worked for Integration Team. Seed and manage workflows for internal and external data source. • Working as terminator for product supporting. Diagnose technical issues for Kafka, RESTful API, DBs and so on. • Working for new Data Discovery Project (Integri's acquisition) as classification core team: 1. Develop , optimize and maintain Classification Core library in Java to allow classification happens for TBs data in short time. 2. Work with data science team to develop and deploy ML and deterministic models, in both Java and Python, to enhance our classification coverage. 3. Develop and maintain Restful APIs for multiple micro-services, with frameworks both Spring-boot(Java) and Flask(Python), to support operation of data discovery. 4. Design data models and data pipelines for new features. 6. Onboarding new team members. 5. Product support as technical export (sometimes I even perform as data analyst). ### Student of Computational Statistics @ Georgia Institute of Technology Jan 2019 – Jan 2020 | Atlanta • Optimization methods: linear regression and logistic regression using gradient descent and Newton’s Method • Parametric approaches: EM algorithm, GMM, HMM, linear model and generalized linear models, model selection and cross validation • Nonparametric approaches: PCA, Splines and approximation of functions, Bootstrap, Monte Carlo methods ### Graduate Teaching Assistant @ Georgia Institute of Technology Jan 2018 – Jan 2020 | Altanta 2018 Spring for AE 3125 Structure Analysis 2018 Fall For AE 4220 Aeroelasticity 2019 Spring for COE 2001 Statics 2019 Fall for AE 3610 Experiment of Fluid and Structural Mechanics ### Student of Machine Learning for Trading @ Georgia Institute of Technology Jan 2018 – Jan 2018 • Apply supervised learning algorithms to produce models that correctly classify future instances of the dataset • Optimize stock portfolio and created market simulator to test the performance • Develop Markov Decision Processes and applied q learning algorithms to develop best possible policies/solutions ### Student of Computational Problem Solving @ Georgia Institute of Technology Jan 2018 – Jan 2018 • Implementation of data structures and algorithms in C • Design the programs to solve physical problems such as Discrete Event simulator, Sorting and Searching in Graph and parallel programming. ### Student of Intro to High Performance Computing @ Georgia Institute of Technology Jan 2018 – Jan 2018 • Design and implement algorithms for parallel program with mpi • Estimate and optimize the efficiency of parallel program ### Graduate Research Assistant @ Georgia Institute of Technology Jan 2016 – Jan 2018 Focus on the computational combustion with large-eddy simulation Provide the product code to the sponsor ### Software Engineer Internship @ Agilysys Jan 2019 – Jan 2019 | Alpharetta, Georgia Chatbot for SMSService Integrated Azure Bot service into our SMS service. NLP model training. Data mining and processing. Designed RESTful API to handle the routing logic among Database, SMS Gateway, and Chatbot. Database management and stored procedures. Designed efficient algorithms. Asynchronous multithreading programming in C#. ### Graduate Research Assistant @ Computational combustion lab Jan 2016 – Jan 2018 | Georgia Tech • Numerical Modeling of Flame Acceleration in Hydrogen Exhaust (Solar Turbines Inc) • Model the flame acceleration cases in simple and complex domain by LES method • Model and generate the grid for simple and complex domain in ICEM • Researched and selected the chemistry mechanism for different conditions of combustion • Design a low NOx emission three stages jet-in cross flow (JICF) combustor • Model and validate the design via chemical reactor network (CRN) model ### Student Tutor @ Georgia Institute of Technology Jan 2014 – Jan 2014 Worked as certified undergrad tutor for most of Aerospace junior and senior undergrad courses. ## Education ### Master's degree in Computational Science and Engineering Georgia Institute of Technology Jan 2018 – Jan 2020 ### Master's degree in Combustion, CFD, Computational Combustion, Fluid Mechanics Georgia Institute of Technology Jan 2016 – Jan 2018 ### Bachelor's degree in Aerospace, Aeronautical and Astronautical Engineering Georgia Institute of Technology Jan 2011 – Jan 2015 ## Contact & Social - LinkedIn: https://linkedin.com/in/tianchen-cai-b7647590 --- Source: https://flows.cv/tianchen JSON Resume: https://flows.cv/tianchen/resume.json Last updated: 2026-03-23