• Designed and deployed efficient data pipelines for financial stock forecasting using Python and PySpark, achieving a 75% improvement in data accuracy through optimized preprocessing techniques.
• Integrated and analysed large-scale financial datasets from diverse sources using SQL and Python, ensuring data accuracy and integrity, and conducted advanced statistical analyses to identify patterns and market trends.
• Improved data transformation efficiency by 30% using Min-Max and Standard scaling techniques, enhancing data quality for predictive modelling.
• Developed interactive Tableau dashboards to visualize data insights and key performance indicators (KPIs), enabling stakeholders to access real-time analytics and reducing report generation time by 40%.
• Optimized SQL queries, cutting execution time by 30% and enhancing reporting efficiency for large datasets.
• Leveraged Matplotlib to design data visualization strategies for financial trend analysis, uncovering stock patterns and outliers to enhance predictive modelling and decision-making.
• Utilized PySpark for big data processing, enabling efficient distributed analysis of large-scale financial datasets.