Responsible for building DBLytix® on Apache Spark from the ground up, which is a library comprising of a variety of machine learning tools used for data analytics on Big Data.
• Led the entire software development cycle for the product and supported it on various Enterprise Hadoop distributions.
• Implemented various advanced machine learning algorithms for high performance and scalability.
• Designed and Implemented a SQL interface to run the analytics functions written in Spark using Livy REST server.