Seasoned Principal Software Engineer with extensive expertise in Big Data, Search, Spark, Ray.io, Kubernetes, and Docker, seeking a challenging role to leverage cutting-edge tech stacks and drive innovation in scalable and efficient solutions in the world of AI.

Experience

ClouderaPrincipal Engineer

2016 — Now

Santa Clara, California, United States

𝐀𝐈-𝐃𝐨𝐜𝐮𝐦𝐞𝐧𝐭𝐬 𝐒𝐞𝐚𝐫𝐜𝐡 (RAG/Agents)

Developed internal documents search/RAG solution using Opensearch. Uses ml-commons, embedding, rerank, conversational search, agents, guardrails and memory components of Opensearch.

𝐒𝐚𝐚𝐒 𝐀𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐃𝐞𝐯𝐞𝐥𝐨𝐩𝐦𝐞𝐧𝐭

Architected and rolled out a SaaS enterprise app on hybrid cloud using Kubernetes and Docker.

Implemented security measures: Authentication, SAML SSO, LDAP, and OAuth2.

Advocated for customer-focused solutions, innovative approaches, and data-informed decisions.

𝐀𝐈-𝐏𝐨𝐰𝐞𝐫𝐞𝐝 𝐍𝐋𝐏 𝐭𝐨 𝐒𝐐𝐋

Part of the team who AI-enhanced NLP to SQL for Impala and Hive using Langchain, Prompt Builder and Retrieval Augmented Generation (RAG).

𝐇𝐮𝐞'𝐬 𝐓𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐚𝐭𝐢𝐨𝐧

Co-led the evolution and development of Hue, a premier open-source Hadoop UI.

Elevated Hue from a basic utility to a top-tier solution, now a staple in numerous enterprises.

𝐋𝐞𝐚𝐝𝐞𝐫𝐬𝐡𝐢𝐩 & 𝐏𝐥𝐚𝐭𝐟𝐨𝐫𝐦 𝐄𝐱𝐩𝐞𝐫𝐭𝐢𝐬𝐞

Proficiency in PaaS and SaaS solutions.

Strong coding skills in Go and Python.

Steered a vibrant and productive team to success.

Crafted and realized product visions, roadmaps, and pivotal strategic shifts.

𝐂𝐥𝐨𝐮𝐝 𝐖𝐚𝐫𝐞𝐡𝐨𝐮𝐬𝐢𝐧𝐠 & 𝐈𝐧𝐟𝐫𝐚𝐬𝐭𝐫𝐮𝐜𝐭𝐮𝐫𝐞

Developed Data Warehouses on AWS, GCP, and Azure using Kubernetes/Docker.

Created a tailored SQL Assistant for Cloudera's Data Warehouse & Cloud services.

Revamped platform UX to enhance usability.

Embedded CI/CD, modular approaches, and scalability tactics for heightened productivity.

Palo Alto NetworksSenior Staff Engineer

2013 — 2016

Santa Clara, CA, USA

𝐂𝐲𝐛𝐞𝐫 𝐓𝐡𝐫𝐞𝐚𝐭 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞 & 𝐁𝐢𝐠 𝐃𝐚𝐭𝐚

Spearheaded the development of the malware cyber threat intelligence tool, "AutoFocus".

Transitioned the tool from a mere prototype to a full-fledged product.

Constructed a comprehensive big data pipeline, aggregating multiple data sources with Python and Java producers/consumers.

Authored data transformation jobs using Hadoop and Spark and data search using the Elasticsearch infrastructure.

𝐋𝐨𝐠 𝐃𝐚𝐭𝐚 𝐌𝐢𝐧𝐢𝐧𝐠 & 𝐀𝐧𝐚𝐥𝐲𝐭𝐢𝐜𝐬

Innovated and delivered the "PAN v2" firewall log data mining and analytics tool.

Instrumental in addressing Support escalations and offering actionable insights to management.

Enriched the tool with features such as term search, diagnostic metrics, and an intuitive dashboard.

Designed and implemented functionalities like queuing, log parsing, indexing, and report generation.

𝐂𝐨𝐧𝐟𝐢𝐠𝐮𝐫𝐚𝐭𝐢𝐨𝐧 𝐌𝐚𝐧𝐚𝐠𝐞𝐦𝐞𝐧𝐭 & 𝐎𝐫𝐜𝐡𝐞𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧

Established a configuration management system using SaltStack.

Orchestrated rolling version upgrades, significantly reducing downtime for the SaaS application.

Conceptualized and designed a large-scale log data monitoring system, transitioning from a service-based architecture.

𝐃𝐚𝐭𝐚 𝐒𝐭𝐨𝐫𝐚𝐠𝐞 & 𝐑𝐄𝐒𝐓𝐟𝐮𝐥 𝐒𝐞𝐫𝐯𝐢𝐜𝐞𝐬

Engineered a robust data storage system using HDFS/HBase.

Integrated a REST API interface, utilizing Tomcat, JAX-RS, and the CXF framework.

Leveraged Hadoop, HBase libraries, and adopted AVRO for effective data serialization.

𝐓𝐞𝐜𝐡𝐧𝐢𝐜𝐚𝐥 𝐏𝐫𝐨𝐟𝐢𝐜𝐢𝐞𝐧𝐜𝐢𝐞𝐬:

Languages: Java, Python, Scala

Big Data Tools: HBase, Spark, Spark Streaming, Elasticsearch, Kafka, RabbitMQ

Web Technologies: Node.js, React.js, Tomcat, JAX-RS, Async, Tornado

DevOps & Management Tools: SaltStack, Rundeck, ELK

VMwareStaff Engineer

2006 — 2013

Palo Alto, CA, USA

𝐋𝐨𝐠 𝐃𝐚𝐭𝐚 𝐀𝐧𝐚𝐥𝐲𝐭𝐢𝐜𝐬:

Architected a data mining and analytics dashboard for logs leveraging both Hadoop and Python stacks.

𝐌𝐢𝐝𝐝𝐥𝐞𝐰𝐚𝐫𝐞 & 𝐁𝐚𝐜𝐤𝐞𝐧𝐝 𝐃𝐞𝐯𝐞𝐥𝐨𝐩𝐦𝐞𝐧𝐭:

Engineered Python-based middleware and web services, contributing to critical internal projects:

Elastic Test Cloud.

Continuous Automated Testing.

Build/Release Automation.

ESX Automation.

𝐎𝐩𝐭𝐢𝐦𝐢𝐳𝐞𝐝 𝐃𝐚𝐭𝐚 𝐑𝐞𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧:

Devised a C-based solution for Perforce journal data replication.

Established a Perforce connection broker to efficiently redirect read/write queries, enhancing Perforce's operational efficiency.

Network Appliance, IndiaMTS4, Software Tools

2003 — 2005

Bangalore, India

Ensim CorporationManager, Build and Release

2001 — 2003

Sunnyvale, CA, USA

Experience+2

Experience