•Designed and implemented cloud architecture that supports running Stratifyd’s production web application, machine learning micro-services and system monitoring components.
•Designed and templated cloud infrastructure from scratch using Terraform and Packer. Reduced manual effort for cluster preparation time from 7 days to hours.
•Designed and implemented monitoring framework for data analytics system. Framework is used to diagnosed system error, traced service request latency and ML training progress.
•Developed and maintained tools and workflows to automate cluster scaling, server-patching, secrets management.
•Designed and customized on-premise cloud infrastructure for client with different use cases.
•Data Integration and ETL Service
•Developed web service based on Tornado to retrieve and process multi-schema data from multiple sources concurrently, including AWS S3, SFTP and 3rd party APIs.
•Designed and implemented Speech to Text audio data transcription parallel processing pipeline using SQS queue with micro-services of machine learning models.
•Redesigned and implemented the CI-CD pipeline responsible for building and delivering all agile development code 24/7 to test and UAT environments. Facilitated 50% in end-to-end product development lifecycle.
•Improved performance of container build process by parallel processing jobs, adding compliance security scanning and runtime-only application certificate retrieval.
•Database Configuration and Migration
•Configured PostgreSQL, MongoDB to support multi-tier environments with high availability. Reduced 30% connection loss and ensured zero downtime. Performed database scale-out from single point server to replica set while ensuring data accessibility and consistency.
•Migrated terabyte size production database cross region from US to Europa for existing customers.