Skip to content
Surf Wiki
Save to docs
general/business-intelligence-terms

From Surf Wiki (app.surf) — the open knowledge base

Early-arriving fact

Dimensional Data modeling scenario


Dimensional Data modeling scenario

In the data warehouse practice of extract, transform, load (ETL), an early fact or early-arriving fact, also known as late-arriving dimension or late-arriving data, denotes the detection of a dimensional natural key during fact table source loading, prior to the assignment of a corresponding primary key or surrogate key in the dimension table. Hence, the fact which cites the dimension arrives early, relative to the definition of the dimension value. An example could be backdating or making corrections to data.

Handling

Procedurally, an early fact can be treated several ways:

  • As an error: On the presumption that the dimensional attribute values should have been collected before fact source loading
  • As a valid fact, pause loading: The collection pauses whilst the missing dimensional attribute value itself is collected
  • As a valid fact, load with dummy keys: A primary key value is generated on the dimension with no attributes (stub / dummy row), the fact completes processing, and the dimension attributes are populated (overwritten) later in the load processing on the new row
  • Classify as a Suspense record: Assuming that the associated dimensional attribute was expected by process, move this fact record in a Suspense table and activate alert/SOPs (reporting mismatch [sum/count/aggr], business/data steward, manual correction etc.) In rare circumstances, the suspense records may also be combined (UNION) with the fact table to ensure the metrics are correctly calculated.

References

References

  1. "Kimball, Ralph. Design Tip #57: Early Arriving Facts. August, 2004.".
  2. [https://www.leapfrogbi.com/early-arriving-facts-late-arriving-dimensions/ Early Arriving Facts / Late Arriving Dimensions - LeapFrogBI]
  3. [https://roelantvos.com/blog/a-gentle-introduction-to-bitemporal-data-challenges/ A gentle introduction to bitemporal data challenges - Roelant Vos]
Info: Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about Early-arriving fact — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report