Strong suits include but not limited to: Node, Python, React, Relational Databases, NoSQL Databases, GCP, AWS, and Linux.
Experience
2022 — 2024
New York City Metropolitan Area
Part of the Anaconda Distribution department responsible for distributing all packages by Anaconda. From the packages in our premium products such as Anaconda Business and Anaconda Enterprise to custom-built solutions for our external partners(Snowflake, Microsoft, IBM, etc.).
Our team developed data pipelines responsible for generating SBOMs(Software Bill of Materials) following the SPDX(Software Package Data Exchange) standard, as well as custom CVE(Common Vulnerabilities and Exposures) reports for all of the packages offered by Anaconda
Developed and designed a microservice and API for maintaining and accessing CVE data using FastAPI and MongoDB. Contributed to ETL pipelines written in Rust.
Built event-driven pipelines to integrate CVEs and SBOMs with Anaconda Business and Anaconda Enterprise using AWS Step Functions.
Created a data model for a Neo4j-hosted database that shows the relationship between packages, their dependencies, and other package metadata. Wrote ETL pipelines for the data stored using PySpark and AWS Glue.
Maintained OSS packages written in Rust and Python for matching packages with vulnerabilities and generating SBOMs
Technologies used: Python(FastAPI, PySpark, Conda, Prefect), Rust, MongoDB, Postgres, AWS(Step Functions, Event Bridge, S3, SNS, SQS, RDS, DocumentDB, DynamoDB), Azure(AKS, CosmosDB, App Services), Kubernetes, Docker, Terraform, Hashicorp Vault
2020 — 2021
New York City Metropolitan Area
Developed and maintained end-to-end ETL pipelines using AWS(EKS, MSK, Lambda, S3, RDS, Dynamo, Athena, Glue), GCP(App Engine, Big Query, GCS), and Airflow.
Built a data visualization app using Django, Leaflet.js, App Engine, and Big Query to give customers insights on their product subscriptions
Wrote CI/CD pipelines using Bitbucket, Docker, and Terraform.
Deployed and maintained an internal Web Archive using ECS, Fargate, Cognito, EFS to provide unlabeled training data for annotators
Contributed to admin portal built with React used to monitor pipelines, the performance of annotators, annotation services/tools, and databases.
Technologies used: Python, SQL, Node.js, AWS, GCP, Kafka, Docker, Kubernetes, Looker
Hoboken, New Jersey
Built UI features using technologies like Apollo and React to fetch data from multiple sources to render robust components
Developed a microservice responsible for ingesting millions of annotations and applying syntax checks to fields in XML outputs
Led training for numerous models, and enrolled new classes in a Siamese network to improve an object recognition model using Tensorflow
Generated and annotated scenes for an Object Detection model and I3D Action Classifier to increase NPD by 5%
Scripted and executed ETL tasks with Minio to implement changes needed across hundreds of thousands of annotations
Greater New York City Area
Worked on ad distribution and custom ad products
Analyzed and invested data which resulted in a 400% increase in ad view impressions for a particular site
Set up a system for batch uploading user segments to DFP.
Assisted in development of ad spawning system for numerous sites
Technologies used: Node, Python, React, AWS Lambda, EC2, S3
Education
Rutgers University