Multinational Retail Pipeline Summary
Github Repository
- The Multinational Retail Data Centralisation project consolidates over 100,000 rows of sales data from AWS RDS, S3, PDFs, and APIs into a central PostgreSQL database using a robust ETL pipeline. This pipeline automates the extraction, transformation, and loading of complex, multi-source data.
- Python libraries such as Pandas, SQLAlchemy, psycopg2, and boto3 facilitate seamless data extraction, transformation, and loading, while optimising complex queries for business analytics. The project improved query efficiency by 30%, significantly reducing data processing time.