# Chetas Joshi > Data & AI @Robinhood Location: San Francisco, California, United States Profile: https://flows.cv/chetas ■ Areas of interest: Distributed Systems, Big Data, Machine Learning and Pattern Recognition, Statistical Analysis, Signal and Image Processing ■ Data Engineer at Oracle (Bluekai) • Developed near real time data processing pipeline for the Oracle Data Cloud using open source technologies like Apache Spark, Solr, Parquet, Hadoop MapReduce. • Wrote Chef cookbooks for setting up big data processing/storage environment. • Led the project "logging and monitoring": built the ELK (Elasticsearch, Logstash, Kibana) stack, wrote the Logging and Metrics collection library; and deployed an alerting mechanism (ElastAlert) on top of ElasticSearch database. ■ Purdue University alumi (Graduate school) • Degree: Master of Science (Thesis) in Electrical and Computer engineering. • Lab: Purdue Neurotrauma Group • Thesis title : Detection of mTBI (mild Traumatic Brain Injury) by observing Cerebrovascular Reactivity Alterations in Asymptomatic High-School Football Players • Outcome : Observed changes in the functionality of the brain due to repetitive hit exposures by processing and analyzing the fMRI data acquired during the brearth-hold task ■ Indian Institute of Technology alumni (Undergraduate degree) • Bachelor of Technology in Electrical Engineering with minor in Computer Science ## Work Experience ### Software Engineer @ Robinhood Jan 2024 – Present ### Founding Engineer @ Stealth Startup Jan 2024 – Jan 2024 | San Francisco, California, United States ### Software Engineer @ Airtable Jan 2021 – Jan 2024 | San Francisco, California, United States As part of the Data infra team, played a key role in democratizing the development of Big Data Applications through a self-serve low-code framework AWS stack: S3, Kinesis, Redshift, Glue, EMR, ECS, Dynamodb, EKS, codeBuild ### Software Engineer @ Rubrik, Inc. Jan 2018 – Jan 2021 | Palo Alto, California Data infra & Observability * ETL, Data warehousing, Data modeling, Data Analytics * Led the development of a self-serve ETL framework that imports data from various data stores (SQL, No-sql, S3) into the DataWarehouse (Snowflake) and democratized data analytics * Drove Rubrik's top and bottom line growth by providing product insights * Played a key role in setting up the realtime ingestion pipelines for the telemetry data (logs, metrics/stats, traces); Helped build an alerts framework * Pro-active support, anomaly detection Cloud Services: AWS S3, Lambda, SQS, EMR, ECS, Redshift, Snowflake, ElasticSearch, MSK (Kafka) Programming languages: Python, Scala, Golang ### Senior Software Engineer (Oracle Data Cloud) @ Oracle Jan 2017 – Jan 2017 | Cupertino, California Technologies: Apache Spark, Spark Streaming + Kafka, Solr, Hadoop, MapReduce, Pig, ELK (Elasticsearch, Logstash, Kibana) stack, ElastAlert, Chef Programming languages: Scala, Java ### Software Engineer (Oracle Data Cloud) @ Oracle Jan 2015 – Jan 2017 | Cupertino, California Technologies: Apache Spark, Spark Streaming + Kafka, Solr, Hadoop, MapReduce, Pig, ELK (Elasticsearch, Logstash, Kibana) stack, ElastAlert, Chef Programming languages: Scala, Java ### Graduate Teaching Assistant @ Purdue University Jan 2014 – Jan 2015 | Electrical and Computer Engineering Department Course ECE255 : Electronic Design and Analysis ### Researcher @ Purdue University Jan 2014 – Jan 2015 | Purdue Neurotrauma Group Cerebrovascular reactivity alterations due to repetitive sub-concussive head trauma in asymptomatic high school football players ### Graduate Teaching Assistant @ Purdue University Jan 2014 – Jan 2014 | Computer Science department Course CS252: Systems Programming ### Tutor @ Purdue University Jan 2013 – Jan 2013 | Science Bound Tutored undergraduate students at Purdue University in three courses (1) Linear Circuit Analysis (2) Python Programming (3) Electronic Systems ### Software Engineer @ Sears Holdings Corporation Jan 2014 – Jan 2014 | Hoffman Estate, Illinois Project: Predictive Analytics using Machine Learning Goal: To develop an automated prediction process to predict the Daily Call Volume at the Call Centers Usage: Workforce Management and Staff Training (1) Did JavaScript coding for data mining (MapReduce and Aggregation techniques) from MongoDB (NoSQL database) (2) Identified Parameters that affect the Call Volume (3) Given the past information available in the database, decided to go with two parameters : Reason Code (there are different reasons people call at the call centers) and Season (4) Used R (Statistical Computing tool: freeware) to do time series analysis (Regression) for each Reason Code (5) Wrote Java code to run R script so that it gets deployed into production Result: (i) Could achieve good prediction accuracy (% variance: less than 5%) except on the Dates that had promotional events associated with them (ii) Suggested to store the data related to promotional events (iii) Suggested to store data at the item level so that all the parameters affecting the Call Volume can be used and prediction can be made more accurate ### Research Assistant @ Singapore Institute for Neurotechnology (SINAPSE) Jan 2012 – Jan 2012 | Singapore Project : Determining the optimal window length for pattern recognition (PR)-based myoelectric control: balancing the competing effects of classification error and controller delay Goal : Identify highest classification accuracy while being within the controller delay limit Usage : Prosthetic hand Used PR technique (LDA) in MATLAB for the identification of upper limb movement given EMG data of a set of movements keeping in mind the controller delay Result: Found that higher accuracy can be achieved even at low latency (controller delay) if less number of upper limb movements is taken ### Trainee @ Essar Steel Jan 2011 – Jan 2011 | Surat Project: "Study of Power distribution at the CSP (Compact Strip Production) Plant" ## Education ### Master of Science (MS) in Electrical and Computer Engineering Purdue University ### Bachelor of Technology (B.Tech.) in Electrical, Electronics and Communications Engineering Indian Institute of Technology Gandhinagar ### SSC(Secondary School Certificate) Smt. V.D. Desai secondary school ## Contact & Social - LinkedIn: https://linkedin.com/in/chetas-joshi --- Source: https://flows.cv/chetas JSON Resume: https://flows.cv/chetas/resume.json Last updated: 2026-04-11