# Jonathan L. > Software Engineer - Backend, Data Platform, and MLOps Location: New York City Metropolitan Area, United States Profile: https://flows.cv/jonathanl3 Software engineer with 10+ years of experience in data modeling and building data platforms for analytical workflows. ## Work Experience ### Software Engineer @ Foursquare Jan 2025 – Present | New York, New York, United States ### Instructional Associate @ Georgia Institute of Technology Jan 2026 – Present ### Founding Board Member @ No Volleyball No Life Inc. Jan 2023 – Present ### Data Products & Analytics Engineer @ Hearst Jan 2024 – Jan 2025 | New York, New York, United States Established a foundational data tech stack on GCP to support new data and analytics initiatives using Dagster on GKE, Iceberg tables on BigQuery, Daft on Ray, and Looker. Led migration of open source Dagster to Dagster+ and developed standards for data observability. Developed Dagster assets and protocols to enforce consumer privacy requests and mask PII data. Instituted DevOps practices using OpenTofu, Helm, Crossplane, and GitHub Actions. ### Staff Software Engineer @ Creyon Bio Jan 2023 – Jan 2024 ### Senior Software Engineer @ Creyon Bio Jan 2022 – Jan 2023 As the first full time software engineer and MLOps Tech Lead, I wore many hats and developed and managed many systems, some highlights are: Architected a data lake platform and pipelines for early research data using Python and BigQuery. Designed and built a gRPC service to manage lab and ML metadata with PostgreSQL on GKE. Developed a vector database using BigQuery to manage computational chemistry simulation data. Mentored fellow software engineers on system design, coding practices, and infrastructure as code. ### Software Engineer, Data Infrastructure @ Fox Corporation Jan 2019 – Jan 2022 Digital Advertising Operations Data Engineering and Science Advocated for and implemented best practices for code organization, infrastructure as code, continuous integration and delivery, observability, and data validation and testing as DataOps lead. Developed a real-time behavioral pixel for streaming audience data to analyze audience behavior, ad performance, and conversion using Go, Kinesis, and AWS API Gateway. Designed and implemented data lake-house infrastructure and ETL/ELT and financial reporting pipelines for all digital ad-tech data from internal sources and external vendors and APIs using pandas, pySpark, Airflow, S3, Glue, Athena, Kinesis, Redshift, and Looker on a team of 5. Provisioned and maintained Kubernetes cluster on EKS using Terraform and Helm. Mentored an intern who automated data testing for ETL pipelines using Great Expectations. ### Process Development Engineer II @ Regeneron Pharmaceuticals, Inc. Jan 2017 – Jan 2019 Evaluated and developed >4 proof of concepts using Python and Apache Spark in a team of 2 for machine learning and data science applications in bioprocess development. Developed an API gateway in Scala in a team of 7 to harmonize access of process data from SQL, time series, and unstructured data sources for engineers to visualize and analyze their data sets retrospectively to derive new scientific insights. Implemented continuous integration and continuous delivery using Jenkins and Kubernetes. Managed a direct report responsible for scientific database administration and data science. Mentored 5 co-ops and interns responsible for developing ETL and scientific analysis tools for high throughput screening experiments and data entry web applications to reduce the amount of time for experiment setup by hours. ### Process Development Engineer I @ Regeneron Pharmaceuticals, Inc. Jan 2015 – Jan 2016 Researched, designed, and implemented systems for data storage, access, visualization, and analysis, and enforced best practices for data integrity as a data management technical lead. Developed >20 scripts in Python, R, and Perl to automate data pipelines from and to scientific instruments allowing for robotic laboratory automation to run 24/7. Developed an application in Python and JSL in a team of 6 to automate statistical data analysis for design of experiments, including outlier analysis, multiple linear regression, and simulations. Coached and trained department of >100 in Kanban, Scrum, and XP as an Agile evangelist. Designed and performed >35 design of experiments and >5 automated high throughput methods to support quality by design approach for protein purification process development. ### Molecular Engineer @ Emerald Therapeutics Jan 2011 – Jan 2014 Implemented clustering analysis in the Wolfram Language for flow cytometry data. Implemented interactive image analysis in the Wolfram Language for gel electrophoresis data. Designed and developed a data management system in the Wolfram Language, Perl, and SQL to track laboratory inventory and store scientific data. Developed and maintained automated unit testing and documentation libraries in the Wolfram Language that are used to test and document code written by scientists. Automated >8 laboratory tasks and their data pipelines using liquid handlers, Python, and Perl. Streamlined and optimized lab scale ion exchange HPLC for purification of oligonucleotides. Maintained a micromole scale DNA production pipeline with day to day responsibilities such as DNA synthesis, purification, quantification, quality control, and instrument maintenance. (http://www.wolfram.com/mathematica/customer-stories/research-workflow-mathematica.html) (http://www.emeraldcloudlab.com/) ### Undergraduate Researcher @ Carnegie Mellon University Jan 2011 – Jan 2011 Determined the zeta potential of chemical mechanical planarization consumables as a function of pH. Developed a computational fluid dynamics model in COMSOL of flow in porous media in a rotating disk geometry in order to determine the zeta potential of the porous surface. ### R&D Summer Intern @ Colgate Palmolive Jan 2010 – Jan 2010 Developed and optimized a biofilm assay for high throughput screening of mouthwash formulations. ### Undergraduate Researcher @ Carnegie Mellon University Jan 2009 – Jan 2010 Evaluated a nano-structured star polymer for the delivery of siRNA to prevent craniosynostosis. ## Education ### Master of Science - MS in Computer Science Georgia Institute of Technology ### B.S. in Chemical Engineering Carnegie Mellon University ### Master’s Degree in Data Analytics Penn State World Campus ### Advanced Certificate in C++ Programming Washtenaw Community College ## Contact & Social - LinkedIn: https://linkedin.com/in/iamjkleung --- Source: https://flows.cv/jonathanl3 JSON Resume: https://flows.cv/jonathanl3/resume.json Last updated: 2026-04-13