# Shang Liu > Software Development Engineer at Workday Location: Greater Boston, United States Profile: https://flows.cv/shang Experienced Software Development Engineer with 8 years in building scalable, high-performance systems, specializing in observability and reliability. Proven leader in designing complex platform components, driving cross-functional initiatives, and mentoring teams. Skilled at architecting solutions that improve system health monitoring, streamline onboarding, and enhance global service observability across cloud- native environments. ## Work Experience ### Software Development Engineer, distributed system, Observability @ Workday Jan 2024 – Present | 美国 马萨诸塞州 波士顿 • Architected and delivered a scalable dynamic endpoint-discovery component for the ping monitoring platform, enabling robust, high-availability health checks across all Workday customer tenants endpoints. • Drove the platform’s multi-region rollout, coordinating deployment, configuration, and validation to establish a second AWS region and improve global observability accuracy by removing cross- region network noises. • Architected and built Workday’s internal service-ping infrastructure, establishing a Tekton-based self-service onboarding pipeline that streamlined adoption across teams. • Led planning, PoC, and delivery of the Fed IL4 of the ping service. Collaborated cross- functionally with Security, Compliance, and Federal teams to meet federal requirements and elevate platform security posture. • Designed, implemented, and deployed a Fluent Bit–based logging pipeline to ingest service logs into Kibana, establishing the standard logging pattern and delivering cross-team knowledge transfer adopted across the Observability organization. • Resolved core conflicts between Watchdog and the customized Prometheus Operator, enabling reliable platform-level health monitoring for the ping service and improving system stability. • Designed and delivered a new onboarding workflow for newly acquired Workday services, enabling them to integrate with the ping platform for tenant-level SLA calculation. • Refined SRE alerting logic to eliminate false signals caused by manual operations during major incidents, improving alert accuracy and ensuring more reliable MTTR pipeline calculations. • On-call rotation responsibility for diagnosing false alerts and maintaining real-time service health dashboard accuracy. ### Software Development Engineer III @ TraceLink Jan 2022 – Jan 2023 | 美国 • Initiated and built new user admin authorization authentication RESTful microservice application with use of bearer token OAuth2.0 and OPA policy. • Initiated and built new Enterprise and multi-Enterprise access privilege Administrator RESTful microservice application based on Node.js, SSO, NoSQL and Kafka. • Built across team functionalities including streaming user notification, workflow transactions, file upload bulk data operations. • Built, deployed, and maintained Administration services CI/CD pipline during the early stage of the new Kubernetes based Tracelink Opus platform. • Providing both data driven and customized GraphQL derived from REST APIs and collaborated with UI/UX team’s React Component for user UI experience. • Monitor and improve performance with use tools of Grafana, Weave scope and Kibana. ### Software Development Engineer II @ TraceLink Jan 2021 – Jan 2023 ### Software Development Engineer @ TraceLink Jan 2019 – Jan 2021 • Implement building EU Compliance Government Reporting service for pharmaceutical manufacturer customers to comply European GDPR 2018 regulatory based on Java, SOAP, docker, AWS SNS. • Implement building Russia Compliance Reporting microservice based on REST, Java. ### Software Development Engineer in Test II - Automation @ TraceLink Jan 2019 – Jan 2019 | 美国 • Initiated new automated test framework by using Karate for a new micro service structure project. • Developed TestNG framework and automation tests for SOAP, REST and SFTP protocols data transactions. • Developed baseline automated project for large manufacturing customers to conform GDPR regulatory with third-party European Compliance certification systems (EMVS/NMVS). • Deployed, integrated, and maintained microservices to test environment including SNS and Docker Swarm. • Collaborated with product manager and developer manager for team road map planning. • Responsible for release sign off and daily sprint plan assignments. ### Software Development Engineer in Test - Automation @ TraceLink Jan 2017 – Jan 2019 ### Software Developer Intern @ Pearson Jan 2015 – Jan 2015 | Greater Boston Area Developed an application of content editor to optimize workflow among interpreters and developers by implement Atlassian Stash Rest API. Developed basic adaptive learning demo according to the Knewton API and followed team for further full product release in Agile environment. Resolve part of code accuracy issue on Pearson Realize product reported by SonarQube. Developed web video call application by WebRTC during Pearson Hackathon. ### Data Analysis Intern @ renren.com Jan 2013 – Jan 2013 | Beijing City, China Analysis of the data changes and forecasted the affected result according to the abnormal data. Made optimal advertising according to the media life cycle and media attributes. ## Education ### Master's Degree in Information Systems Northeastern University ### Bachelor's Degree in E-Commerce/Electronic Commerce Beijing University of Posts and Telecommunications ### Bachelor’s Degree in E-Commerce/Electronic Commerce Queen Mary University of London ## Contact & Social - LinkedIn: https://linkedin.com/in/shangliu3 --- Source: https://flows.cv/shang JSON Resume: https://flows.cv/shang/resume.json Last updated: 2026-03-28