Limit search to available items
Book Cover
E-book
Author Kimball, Ralph, author.

Title The data warehouse ETL toolkit : practical techniques for extracting, cleaning, conforming, and delivering data / Ralph Kimball, Joe Caserta
Published Indianapolis, IN : Wiley, [2004]
©2004

Copies

Description 1 online resource (xxxiv, 491 pages) : illustrations
Contents Requirements, Realities, and Architecture -- Surrounding the Requirements -- The Mission of the Data Warehouse -- The Mission of the ETL Team -- ETL Data Structures -- To Stage or Not to Stage -- Designing the Staging Area -- Data Structures in the ETL System -- Planning and Design Standards -- Data Flow -- Extracting -- The Logical Data Map -- Building the Logical Data Map -- Integrating Heterogeneous Data Sources -- The Challenge of Extracting from Disparate Platforms -- Mainframe Sources -- Flat Files -- XML Sources -- Web Log Sources -- ERP System Sources -- Extracting Changed Data -- Cleaning and Conforming -- Defining Data Quality -- Assumptions -- Design Objectives -- Cleaning Deliverables -- Screens and Their Measurements -- Conforming Deliverables -- Delivering Dimension Tables -- The Basic Structure of a Dimension -- The Grain of a Dimension -- The Basic Load Plan for a Dimension -- Flat Dimensions and Snowflaked Dimensions -- Date and Time Dimensions -- Big Dimensions -- Small Dimensions -- One Dimension or Two -- Dimensional Roles -- Dimensions as Subdimensions of Another Dimension -- Degenerate Dimensions -- Slowly Changing Dimensions -- Type 1 Slowly Changing Dimension (Overwrite) -- Type 2 Slowly Changing Dimension (Partitioning History) -- Precise Time Stamping of a Type 2 Slowly Changing Dimension -- Type 3 Slowly Changing Dimension (Alternate Realities) -- Hybrid Slowly Changing Dimensions -- Late-Arriving Dimension Records and Correcting Bad Data -- Multivalued Dimensions and Bridge Tables
Summary Annotation Cowritten by Ralph Kimball, the world's leading data warehousing authority, whose previous books have sold more than 150,000 copies. Delivers real-world solutions for the most time- and labor-intensive portion of data warehousing-data staging, or the extract, transform, load (ETL) process. Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse. Offers proven time-saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality
Notes Includes index
Master and use copy. Digital master created according to Benchmark for Faithful Digital Reproductions of Monographs and Serials, Version 1. Digital Library Federation, December 2002. http://purl.oclc.org/DLF/benchrepro0212 MiAaHDL
Description based on online resource; title from digital title page (viewed on February 24, 2022)
digitized 2010 HathiTrust Digital Library committed to preserve pda MiAaHDL
Subject Data warehousing.
Database design.
COMPUTERS -- Desktop Applications -- Databases.
COMPUTERS -- Database Management -- General.
COMPUTERS -- System Administration -- Storage & Retrieval.
Data warehousing.
Database design.
Data warehousing
Database design
Data-Warehouse-Konzept
Form Electronic book
Author Caserta, Joe, 1965- author.
LC no. 2004016909
ISBN 0764579231
9780764579233
0764567578
9780764567575