# Saurabh Singh > Sr SDE at AWS Bedrock Location: Seattle, Washington, United States Profile: https://flows.cv/saurabhsingh Software development engineer II at Amazon. Currently working on Alexa Assist projects, backend development along with hands on with frontend by using React. Computer Science Graduate from Stony Brook University. ## Work Experience ### Senior Software Development Engineer @ Amazon Web Services (AWS) Jan 2025 – Present ### Software Engineer II @ Amazon Web Services (AWS) Jan 2024 – Jan 2025 | Seattle, WA Control Plane Architecture: Engineered a high-throughput Control Plane for LLM fine-tuning (Anthropic Claude 3 Haiku), utilizing Java and Kubernetes to orchestrate complex training lifecycles across thousands of nodes. GPU Orchestration & Reliability: Architected a highly scalable platform for GPU compute management; developed custom health-check and session monitoring tools to ensure 99.9% uptime for long-running training jobs. Observability & Throughput: Built comprehensive observability stacks for GPU workloads using internal AWS telemetry, identifying bottlenecks that led to a 25% increase in compute efficiency. Production Optimization: Developed a high-performance model-loading mechanism to streamline the transition of fine-tuned weights into Amazon Bedrock production environments, significantly reducing Time-to-First-Token (TTFT). Technical Leadership: Mentored 4+ SDEs on distributed systems patterns and fault-tolerant design, resulting in a more resilient GPU utilization pipeline and improved team velocity. Business Impact: Directly enabled multi-million dollar revenue streams by delivering customized Anthropic model fine-tuning solutions for strategic enterprise clients like SK Telecom (SKT). ### Software development engineer II @ Amazon Jan 2021 – Jan 2023 | Seattle, Washington, United States Alexa Comms: Working on providing better experience to millions of Alexa users by adding new features enabling users to get help in case of emergency and non emergency situation. Roadside Assistance - Service provides customers a way to connect with agents when they use “alexa, call roadside assistance” who accesses the situation and dispatches help to the shared location. I am involved in designing the critical components for the services and end to end implementation. Also responsible for maintaining the best Operational Excellence principles with using different cloud technologies and following the software development principles. ### Software Development Engineer @ Amazon Jan 2020 – Jan 2021 | Seattle, Washington 1. Emergency Contacts Worked on project which enables customers to setup Emergency Contacts through Alexa App and call Emergency Contacts when customer call the utterances like - "Alexa, call my emergency contact". 2. Emergency Helpline Worked on building Emergency Helpline for customers which enables them to call for help in case of an emergency. Customer can ask Alexa for help and our external partners can dispatch help to customer's location. The feature is available as GuardPlus on Amazon - https://www.amazon.com/b?ie=UTF8&node=18021383011 ### Graduate Teaching Assistant @ Stony Brook University Jan 2019 – Jan 2019 | Stony Brook, New York Graduate Teaching Assistant for Object Oriented Programming ### Software Development Engineer Intern @ Audible, Inc. Jan 2019 – Jan 2019 | Newark Audible Content License Microservice • Developed a microservice for serving content license and metadata used for downloading and playing an audiobook. • Redesigned the architecture to move away from monolithic legacy service, used AWS ECS Fargate, DynamoDB, and S3. Followed Infrastructure As Code methodology to write cloud formation templates. • Worked on Audible iOS app written in Swift and Objective-C to consume microservice APIs. ### Graduate Teaching Assistant @ Stony Brook University Jan 2019 – Jan 2019 | Stony Brook, New York Course: Data Structures and Algorithms ### Senior Software Engineer @ Practo Jan 2017 – Jan 2018 | Bengaluru Area, India 1. Online Appointment Booking Service • Worked on Practo online appointment booking system empowering the users to book doctor's appointment online with near real-time experience. • Helped in designing architecture and built real-time sync between central inventory and external inventories. • Built new REST APIs with minimized response time providing a seamless user experience. • Technology and language used were AWS (SQS, SNS, CloudWatch, RDS, EC2), MySql, JAVA, PHP, Spring Boot, Symfony, Swagger, Containers. 2. Integration Platform (SDK and Agent) • Worked on the platform that enables HMS (Hospital Management Systems) to connect with in house Practo products like online appointment booking, patient record sharing, online doctor consultation. • The platform consisted of three major pieces; the integration platform, SDK (https://developers.practo.com/sdk/java) for external third party HMS and changes required in the legacy products to be connected with the SDK. • It offers any HMS to use Practo healthcare products directly just by buying the SDK enabled with specific services. • Build the integration platform completely from scratch along with the changes required in legacy products for making compatible to understand the SDK requests. • Helped in designing the database schema and SDK having methods which internally calls the APIs of integration platform with subsequently to communicate with Practo in-house products. • Technology and languages used were Amazon Web Services, Spring Boot, JAVA, MySql, PHP. ### Software Engineer @ Practo Jan 2015 – Jan 2017 | Bengaluru Area, India 1. Online Search Platform • Worked on Practo search (https://www.practo.com/) : Faster search and intelligent suggestion using data driven adaptive ranking algorithm for specific search results. Scalable system having multi-language support using language modelling and knowledge source graphs. • Technology - PHP, AWS, ElasticSearch, MySql. 2. Profile Management Service • Worked on a product called as Practo Partner Profiles (https://www.practo.com/providers) - System for making doctors onboarding process smooth. It consists of REST APIs for making and editing the profiles, validating the mobile numbers using OTP, mailing system for updating the profile owners about their profile status. • Designed the database schema and implemented the security for accessing the profiles by different users having different roles, and upload service, OTP service. • Technology - PHP, AWS, ElasticSearch, MySql, Python. 3. Data Collection and Processing Pipeline • Worked on Mobile app backend server used for collecting multiple data points about doctors and clinics information which is curated and then consumed by Practo search. •Designed database schema and supported mobile app periodic sync capability with minimal data loss and data duplication. • Built an internal tool used for processing doctor and clinic profiles which pass through workflow consisting hierarchical team-based access control system. Processing the data collected and making it live for online Practo search. ### Data Science Intern @ Innovaccer Jan 2015 – Jan 2015 | Noida Area, India • Worked on Topic Modelling for NASA Aviation Data: Text mining to help a researcher at NASA by identifying the most frequent aircraft problems. Using unsupervised machine learning algorithm, researching Latent Dirichlet Allocation (LDA) and introducing 'Bigram Bag Of Words' in it. • Worked on Sports News Text Summarization: Extraction-based summarization of News article by extracting important sentences and creating comprehensive summaries by using domain knowledge. ## Education ### Master of Science - MS in Computer Science Stony Brook University Jan 2018 – Jan 2019 ### Bachelor's degree in Information Technology Indian Institute Of Information Technology Allahabad Jan 2011 – Jan 2015 ### Senior Secondary Army School, Allahabad Jan 2008 – Jan 2010 ### High School Maharishi Vidya Mandir, Allahabad Jan 1998 – Jan 2008 ## Contact & Social - LinkedIn: https://linkedin.com/in/saurabh-singh-22189976 --- Source: https://flows.cv/saurabhsingh JSON Resume: https://flows.cv/saurabhsingh/resume.json Last updated: 2026-03-29