Pioneered a scalable AWS serverless platform, enhancing data scientists' capabilities to build, train, and deploy machine learning models with ease.
Experience
2020 — Now
2020 — Now
United States
Designed and implemented scalable platform on AWS Serverless Lambda using Nodejs/Typescript and MongoDB, to help researchers to generate reports automatically.
Architecture design, product demo and mentoring.
Implemented backend APIs with AWS services (AWS Lambda, DocumentDB, ElasticSearch, SQS, Cognito)
Built scalable machine learning platform on AWS SageMaker for data scientists' team, to easily build, train and deploy ML models. Using Python/Jupyter analyze report data. (Python, Jupyter Notebooks)
Performed full DevOps duties, built CI/CD pipeline from scratch and integrated with automation test.
Implemented Micro-Service cache system with Kubernetes on AWS to reduce deep API calls. (Docker/EKS/Typescript/DynamoDB/Kafka)
2018 — 2019
2018 — 2019
Developed software for use on internal agriculture planning and management platform. Generated reports to help growers to maximize the yield of every square inch of their field. Using Nodejs, React, Kafka, Docker, AWS EKS, Restful API, AWS Lambda functions and PostgreSQL/MongoDB
Determined architect and technical stack to be used for the platform.
Decreased expenses 30% by building multiply workflow for different users in one application.
Reduced work time required to collect data and generate reports from 2 weeks down to less than15 mins.
Handled over 7000 fields around the world in 11 countries, and managed over 200K rows data. (Java, React, AWS)
Gathered requirements and directly interfaced with internal teams and users. Collected data from 6 upstream apps, passing data between different apps by using restful API and Kafka message. Implemented Micro-services and Serverless on AWS. Using AWS EC2, S3, PostgreSQL DB, Lambda. (REST API, Kafka, AWS)
Collected data from 6 upstream apps, gathered requirements and directly interfaced with the users
Implemented UI by using React.js, Redux and material UI, with JavaScript, handle over 15000+ rows’ table on frontend.
Performed full DevOps duties, deploy changes to Prod every week. (Jenkins, Cloudfoundry)
2017 — 2018
2017 — 2018
Greater St. Louis Area
Developed wordpress theme and plugins for multiple sites, including domino.com, fashionbeans.com and healthyway.com
Worked on products management suite and SEO management suite.
Imported data from legacy CMS to wordpress for domino.com
Worked on Healthyway Collective features
2017 — 2017
2017 — 2017
Greater St. Louis Area
Developed Arch application alone to provide Inbound/Outbound and ICQA data for Amazon Fulfillment Center STL6/7. Arch used Ruby on Rails MVC web framework and implemented with RESTful API. (Ruby on Rails)
Arch has multiply data sources. Utilizing Rails ‘oci8’ gem to read data from Amazon multiple databases and storing the data into AWS. Using Rails web scraper to collect data from 8 different internal webpages. Also using VBA to automatically collect data from Amazon Excel Tables.(Rails, MySQL)
Using Rails update database every 15 minutes and utilizing Ajax, Javascript to update webpages automatically. Now it has been used by over 20 managers, which greatly improved their working efficiency.
2016 — 2016
Greater St. Louis Area
Responsible for implementing machine learning algorithm to analyze patient location data on EMR Spark.
Scrubbing semi-structured and unstructured data into a standardized format and stored each patient as object into Spark RDD.
Implemented K-Means Clustering to classify closest patients together and found the best clusters number. Optimized the algorithm by Simulating Annealing, improved the model’s accuracy from 65% to 90%. (Python)
Stored clusters into Priority Queue and utilizing Spark RDD persistence, saved 25% of running time.
Analyzed 4G data and implemented visualization from text to points on map by Spark GraphX
Education
Washington University in St. Louis
Master's degree
太原理工大学 Taiyuan University of Technology