• Building Uber's globally distributed transactional datastore. This was Uber's home-grown Spanner like db using open source technology like RAFT consensus and RocksDb
• Big data and storage infrastructure for self driving cars
Self driving cars rely on large scale ML training based on log data, which tend to be an extremely large number of very small sized files. Traditional storage systems like PG SQL or on-prem tech like parallel file systems have notoriously problematic scaling and perf challenges. I helped build a cloud-native parallel file systems that was built on commodity object storage targeted for this use case. This kind of tech has since then become a staple in the cloud provider itself (see AWS EFS offering)