This data pipeline project is designed to extract, transform, and load (ETL) real estate data from Redfin’s website into a Snowflake data warehouse, with Apache Airflow managing the entire workflow on an AWS EC2 instance. The final data is used to generate reports in Power BI. The pipeline ensures data is processed efficiently, securely, and is readily available for analysis and reporting.