I've been reading through "Super Charge your
Data Warehouse," and have learned a great deal. Still, I find myself struggling with a key element of this. I understand that
Hubs contain of a number sequence, a business key, load date, and a record source. As I've come to understand it,
hubs act are representative of real world assets. For example, an itinerary of products may be logged as business keys that are itemized by IDs assigned to them by a CRM system. Understandably, these numbers serve a purpose in
satellite tables, condensing historical data into one place. This, however, fails to explain just why
hubs are given these IDs. As
Hubs lend themselves to a substantial amount of varia
bility, changing the
satellite table at any given time. Herein lies my confusion.
Am I to understand that these numbers are there in order to itemize records from stratified groups in order to give a juxtaposed view of different records?
If so, does this not complicate the view if it is possible for the same ID to be used for multiple items reflecting real-world assets?
And if this is the case, doesn't normalization of a proper record fly out the window if the same ID is also a primary key?