•Implemented a microservice to automatically create, segment, clean and organize data from multiple client facing sources and collate into an organized database structure.
•Wrote backend scripts to pipeline the database data using Apache Spark into machine learning prediction models for different types of sources directly affecting client facing services.
•Formulated prediction models for different metrics such as Ecommerce sales, cashflow, etc. using XGBoost, skLearn and RandomForestRegressor libraries.
•Used Docker containers to deploy the prediction models onto real-time client services that improved user experience and retention inside the product.