Build huge impact products to advertising industry
• Architect scalable ad data pipeline from scratch using Apache Spark(40B+/Month)
• Built scalable real time Apache Spark Streaming pipeline with Apache Kafka(100K+ records/sec)
• Maintainable data workflow management using Spotify Luigi
• Micro service architecture(Fraud detection, Delivery report, Segment audience, Machine Learning for CTR optimization and etc)
• Lambda architecture(Real time + Batch)
• Built Big Data process on top of Amazon cost effective EMR infra structure
• Integrated data processing with external companies for BI
• Data process optimization with Apache Parquet
• AWS Athena, Quicksight integration