•Developed Spark Streaming pipeline to feature engineer internal and AWS data, improving efficiency for community detection algorithm
•Fixed data serialization issues of internal data product, saving development hours and increasing reliability
•Simplified pipeline by consolidating three Spark jobs into one, alleviating need for AWS resources
•Participated in Agile methodologies, including standup and two-week sprints, to receive constant feedback