The document discusses dimensional modeling for data warehousing. Dimensional modeling uses a star schema with fact and dimension tables. Facts are numeric measures located in large fact tables with keys linking to smaller dimension tables containing descriptive, relatively static data. Dimensional modeling provides an intuitive structure for high performance querying to answer business questions by analyzing facts across dimensions like product, time, customer etc. The document outlines characteristics of dimensions and facts and considerations for determining grain, dimensions and measures in dimensional modeling.
3. ï‚¡ While an entity-relationship
modeling approach from relational
database design could be used, the
dimensional modeling approach to
logical design is more often used
for a data warehouse.
3
4. ï‚¡ End users cannot understand,
remember, navigate an E/R model
(not even with a GUI)
 One reason is that an enterprise-
level ERM would be too complex to
understand.
4
8. Several distinct dimensions, combined with
facts, enable you to answer business
Dimension questions.
Tables
Geographic Dimension
Measures
Fact Table
Geographic Product Time Units $
Product Facts
Time
8
21. Facts
ï‚¡ Non-additive facts cannot be added at all.
 An example of this is averages.
ï‚¡ Semi-additive facts can be aggregated along some of
the dimensions and not along others:
 current_Balance is a semi-additive fact as it makes
sense to add them up for all accounts (what's the
total current balance for all accounts in the bank?)
but it does not make sense to add them up through
time (adding up all current balances for a given
account for each day of the month does not give us
any useful information
ï‚¡ The most useful measures are: Numeric, Additive
21
22. ï‚¡ Atomic level of data of the business
process
ï‚¡ A definition of the highest level of
detail that is supported in a data
warehouse
22
23. ï‚¡ A fact table usually contains facts
with the same level of aggregation
ï‚¡ a proper dimensional design allows
only facts of a uniform grain (the same
dimensionality) to coexist in a single
fact table
23
24. ï‚¡ Some perfectly good fact tables represent
measurements that have no facts! This kind of
measurements is often called an event. The
classic example of such a factless fact table is a
record representing a student attending a class
on a specific day. The dimensions are Day,
Student, Professor, Course, and Location, but
there are no obvious numeric facts. The tuition
paid and grade received are good facts but not
at the grain of the daily attendance.
24
25. ï‚¡ Dimensions without attributes. (Such as a
transaction number or order number.)
ï‚¡ Put the attribute value into the fact table
even though it is not an additive fact.
25
29. 1. Choosing the data mart for the
small group of end users we deal
with.
 Choose a business process to
model, e.g., orders, invoices, etc.
29
30. 2. Fact table granularity (the smallest
defined level of data in the table) is
determined.
30
31. 3. Fact table dimensions are selected.
 Choose the dimensions that will
apply to each fact table record
 Add dimensions for "everything
you know" about this grain.
31
32. 4. Determine the facts for the table. In
most cases, the granularity is at the
transaction level, so the fact is the
amount.
 Choose the measure that will
populate each fact table record
 Add numeric measured facts true to
the grain
32
33. ï‚¡ The Data Warehouse Toolkit.Second
Edition.The Complete Guide to Dimensional
Modeling.Ralph Kimball.Margy Ross