Data Vault & Ensemble Modeling - BI-Podium · The Genesee Academy CDVDM – Data Vault Modeling Course. The CDVDM is the data vault certification course covering all main topics of
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
• Data Vault is the leading data modeling approach among new options for the flexible/agile data warehouse.
Data Modeling Approaches:
Operational Data Warehouse Data Mart
• For data warehouse agility there are other techniques as well. The
broader family of techniques are all flavors of Ensemble Modeling. • In effect Ensemble modeling = EDW modeling. • Ensemble is based on the premise: The flexibility required by the data
warehouse needs a model that de-couples changing context from relationships from the business keys (Unified Decomposition).
• Background Topics: – Core Business Concepts – Agility
• Unified Decomposition • Ensemble Modeling • Data Vault Agility • The Data Vault Ensemble • Data Vault Core Constructs • Applying Data Vault • Core Concepts and the Backbone • DV Pattern applied • Bottom Line and Summary
• The Core Business Concept is the basis for our Data Vault Data Warehouse. It is similar to the Entity in 3NF or a Dimension in a Star Schema. And so it commonly includes Customer, Product, Employee, and etc.
• Important to note: 1) Business Driven, and 2) Enterprise Wide.
• The EDW is constantly needing to adapt to change
– New Sources – New Attributes – Changing Sources – New and Changing Requirements – New and Changing Business Rules – New and Changing Deliveries – Expanding Subject Areas
Separate things that change from things that are not changing.
• Break things out into component parts for flexibility and to facilitate the capture of things that are either interpreted in different ways or changing independently of each other. Decomposition.
• These parts however need to be integrated to define the core business concept (the Entity, the Dimension, etc.). So they must be kept together. Unified.
All the parts of a thing taken together, so that each part is considered only in relation to the whole.
• The constellation of component parts acts as a whole – an Ensemble.
• With Ensemble Modeling the Core Business Concepts that we define and model are represented as a whole – an ensemble – including all of the component parts.
• An Ensemble is based on all things defining a Core Business Concept that can be uniquely and specifically said for one instance of that Concept.
• The Data Vault Ensemble conforms to a single key embodied in the Hub construct.
• The component parts for the Data Vault Ensemble include: – Hub The Natural Business Key – Link The Natural Business Relationships – Satellite All Context, Descriptive Data and History
• The Data Warehouse needs to adapt to change easily, be based on central business concepts, integrate data from several sources, track history of changing context, contain trusted and auditable information, and it needs to perform.
• Answering this call means a data warehouse program that is designed to meet these requirements with the people, processes, and the modeling techniques that support them.
• Data Warehouse modeling => Ensemble modeling. Techniques that are based on Unified Decomposition. There are several forms of Ensemble methods in play today.
• Data Vault modeling is the leading form of Ensemble modeling today.
The Genesee Academy CDVDM – Data Vault Modeling Course. The CDVDM is the data vault certification course covering all main topics of data vault modeling. The course is delivered in a blended learning method using online video lessons (2 weeks), classroom lectures, exercises, labs and small group modeling cases. Public courses are offered on a regular schedule www.GeneseeAcademy.com and there are in-company options as well.
• Hans Hultgren is an author, speaker, educator and advisor in the data warehousing and business intelligence space. He is an expert on data vault modeling and the author of Modeling the Agile Data Warehouse with Data Vault where he introduced Ensemble Modeling and Unified Decomposition.
• Hans is the President of Genesee Academy, LLC (including also
www.DataVaultAcademy.com) which provides the CDVDM data vault certification around globe.
• For 20 years Hans was a professor at DU where he was the founder and
director of the masters of science degree in business intelligence and data warehousing MSBI.