Javascript is not enabled in your browser. Enabling JavaScript in your browser will allow you to experience all the features of our site.

Learn how to enable JavaScript on your browser

The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses (Includes CD-Rom)

by Ralph Kimball

All Formats & Editions

Paperback

View All Available Formats & Editions

Overview
Editorial Reviews
Product Details
Related Subjects
Read an Excerpt

Overview

One of the most dramatic new developments in database design, the dimensional data warehouse is a powerful database model that significantly enhances managers' ability to quickly analyze large, multidimensional data sets. Written by the leading proponent of this revolutionary new approach, this valuable book/CD toolkit outfits you with all the nuts-and-bolts information you need to design, build, manage, and use dimensional data warehouses for virtually any type of business application, as well as software for querying dimensional data warehouses. Employing many real-life case studies of data warehouses, Ralph Kimball provides clear-cut guidelines on how to model data and design data warehouses to support advanced multidimensional decision support systems. Beginning with the relatively simple example of a data warehouse for a grocery store, he progresses, step-by-step, through an increasingly complex array of business applications in retail, manufacturing, banking, insurance, subscriptions, and airline reservations. By the end of the book, you will have mastered the full range of powerful techniques for creating, controlling, and navigating dimensional business databases that are easy to understand and navigate.

Editorial Reviews

From the Publisher

"Ralph Kimball provides clear-cut guidelines on how to model data and design data warehouses to support advanced multidimensional decision support systems. With Kimball's step-by-step instructions, readers will have mastered the full range of powerful techniques for creating, controlling and navigating dimensional business databases that are easy to understand and navigate." (Fatbrain.com; Ganthead.com, 9/01)

Booknews

A leading proponent of the dimensional data warehouse, a database model designed to improve managers' ability to quickly analyze large, multidimensional data sets, provides concrete tools for designing, building, managing, and using dimensional data warehouses for different types of business applications. A CD- ROM includes software for querying dimensional data warehouses and working models of all databases described in the book. Annotation c. Book News, Inc., Portland, OR (booknews.com)

Product Details

ISBN-13:: 9780471153375
Publisher:: Wiley
Publication date:: 01/28/1996
Edition description:: Older Edition
Pages:: 416
Product dimensions:: 7.54(w) x 9.22(h) x 0.89(d)

Related Subjects

Read an Excerpt

Chapter 9: Insurance

In this chapter we bring together ideas from nearly all the previous chapters to build a data warehouse for a typical insurance company. If you are in the insurance industry and you have jumped directly to this chapter for a quick fix, please accept the author's apologies, but this chapter depends heavily on ideas from the previous chapters. You need to read the previous chapters!

We will imagine that our insurance company is a $3 billion property and casualty insurer for automobiles, home fire protection, and personal liability. There are two main production data sources: all transactions relating to the formulation of policies, and all transactions involved in processing claims. Based on extensive interviews of claims processing managers, field sales managers, financial managers, and senior management, we conclude that the insurance company wants to analyze both the written policies as well as the claims. They want to see which coverages are most profitable and which are least profitable. They want to measure profit over time by covered item type (i.e., which kinds of houses and which kinds of cars), state and county, demographic profile, underwriter, sales broker and sales region, and event. Events are usually called catastrophes in this industry. The desire to see profit implies that both revenues and costs can be identified and tracked. The insurance company wants to understand what happens during the life of a policy, particularly when a claim is processed. The time from when a claim is first made to when the first payment is made is an important measure of the efficiency of the claims handling process. This can be measured only by looking at the detailed transactions.

In this chapter we will develop the design in the same way we developed the household data warehouse for the bank. We will start with the central dimensional schemas for policy creation and claims processing and then embellish them to handle monthly snapshots, heterogeneous products, and demographic minidimensions.

POLICY CREATION

We will assume that a policy is a "header" for a set of coverages sold to the insured party. Coverages are really the products that insurance companies sell. Homeowner coverages include fire, flood, theft, and personal liability. Automobile coverages include comprehensive, collision damage, uninsured motorist, and personal liability. In a property and casualty insurance company, such as the one we are modeling, coverages typically apply to a specific covered item, such as a particular house or car. Both the coverage and covered item are identified carefully in the policy. A particular covered item will usually have several coverages listed in the policy. We will assume that a policy can contain multiple covered items.

A policy is sold to an insured party by an agent or a broker. Agents are either employees of the insurance company or may be independent. Brokers are independent. Before the final policy can be created, a rater in the insurance company determines the rate that will be charged, given the particular coverages, the particular covered items, and the qualifications of the insured party. The final step of approval is made by an underwriter, who takes the ultimate responsibility for doing the business with the insured party. Typically the agent (or broker), rater, and underwriter are all different people.

We assume we capture the following transactions in the production policy creation system:

Create Policy; Alter Policy; Cancel Policy (with Reason)
Create Coverage on Covered Item; Alter Coverage; Cancel Coverage (with Reason)
Rate Coverage; Decline to Rate Coverage (with Reason)
Underwrite Policy; Decline to Underwrite Policy (with Reason)

In general, a given transaction comes with a lot of context. For example, when a Create Coverage on Covered Item transaction is processed, many things are known, including the insured party, the agent, the policy number, the coverage identification, and the covered item identification. If rating and underwriting have not happened yet, then the rater and underwriter are not known. it is important when designing the production extract system not to take the data found in the atomic transaction record so literally that some of the context is treated as unknown. Or, to state it more positively, a given atomic transaction must be embellished with as much context as possible, to create a full-dimensional description of the transaction.

The dimensional framework for the policy transaction fact table consists of the following:

Transaction date
Effective date
Insured party
Employee
Coverage
Covered item
Policy
Transaction

The transaction date is the date when the transaction was entered into the production system. The effective date is when the transaction is legally effective. These two independent dimensions can be implemented via a single physical table, which is declared via the SYNONYM construct to be two different tables as far as SQL is concerned, as described in Chapter 2.

The insured party is the customer. In all probability, the insured party dimension is a ditty customer dimension, in that no really serious attempt is made to identify previous instances of an insured party on other policies. The insured party can be multiple people, such as a person and their spouse, or the insured party can be a business entity. In general, insured party is a big dimension, which in the case of our $3 billion insurance company probably has a million or more entries. Later in this chapter we will definitely be using the big-dimension techniques on insured party that we developed in Chapter 6. . . .

Customer Reviews

Average Review:

Write a Review

and post it to your social network

Most Helpful Customer Reviews

See all customer reviews >