Refactored Wayfair’s Product Search data pipeline from SQL-based batch system to multi-source event-based low latency data update system supporting 500+ search indexes, leveraging NoSQL datastore Aerospike, Apache Kafka, Dropwizard and Google Guice
Supported 1000+ instances of Apache Solr across multiple datacenters on-premise and in Google Cloud, utilizing the Elastic stack, Datadog, InfluxDB and Jenkins
Optimized Solr instances for query performance and data update speed, leveraging caching strategies, cluster rebalancing, commit strategies and index merging policies
Designed, built and deployed Wayfair’s keyword search spellchecker, written in Python based on Wilbur Et. Al.’s spellchecker at Pub Med
Owned and rebuilt Search Tech’s monitoring and alerting, reducing engineering support time and increasing system resiliency