Data Modeling & Warehousing
Introduction
Star Schema vs Snowflake Schema
Star Schema
βββββββββββββββββββ
β Dim_Customer β
β - customer_key β
β - name β
β - email β
βββββββββββ¬ββββββββ
β
βββββββββββΌβββββββββββββββββ
β Fact_Transactions β
β - transaction_id β
β - customer_key (FK) β
β - product_key (FK) β
β - date_key (FK) β
β - amount β
β - quantity β
ββββββββββββ¬ββββββββββββββββ
β
βββββββββββ΄ββββββββββ¬βββββββββββββββ
β β β
ββββββΌβββββ ββββββββΌβββββββ βββββΌβββββββ
βDim_ β β Dim_Date β βDim_ β
βProduct β β - date_key β βLocation β
βββββββββββ βββββββββββββββ ββββββββββββSlowly Changing Dimensions (SCD)
SCD Type 1 - Overwrite
SCD Type 2 - Track History
Data Warehouse vs Data Lake
Aspect
Data Warehouse
Data Lake
Dimensional Modeling Best Practices
Date Dimension Generation
Fact Table Design
Partitioning Strategies
Indexing Strategies
Conclusion
Last updated