Intentionally feed "bad" data into the pipeline to ensure it is caught by error Satellites rather than crashing the load. Success Metrics (KPIs)
Test the performance of PIT tables; they should significantly speed up the joining of multiple Satellites. Testing Data Vault-Based Data Warehouse
Run row counts and sum-checks between the Source Landing Zone and the Raw Vault. Intentionally feed "bad" data into the pipeline to
Data Vault excels at "insert-only" logic. Your tests should mirror this. Data Vault excels at "insert-only" logic
Ensure "Zero Keys" or "Ghost Records" exist in Hubs to handle late-arriving data or missing lookups without breaking the model. 3. Data Integrity & Reconciliation This ensures that "what went in is what came out."
Run the same load twice. The system should recognize the data exists and result in zero new inserts.
Insert a record with a modified attribute. Verify that a new Satellite record is created with the updated data while the old record remains (historical tracking).