Developed and optimized Python-based ETL pipelines utilizing Pandas, SQLAlchemy, and Apache Airflow to efficiently process over 4,000 sustainability-focused data entries.
Reduced manual data processing effort by 50% through automated pipelines, ensuring clean and accurate ESG datasets that supported sustainability portfolio analysis.
Designed and implemented a sustainability review framework using Power BI, creating interactive dashboards to analyze sustainability scores for over 5,000 products across 2023 and 2024.
Led a 2-person intern team to compile and validate data for an annual energy use audit of 4,000+ products, automating initial validation steps with Python to reduce manual checks by 30%.