Home > Ask the Data Management Experts > DBMS and data warehousing Questions & Answers > Data warehouse testing
Ask The Data Management Expert: Questions & Answers
EMAIL THIS

Data warehouse testing

Mark Whitehorn EXPERT RESPONSE FROM: Mark Whitehorn

Pose a Question
Other Data Management Categories
Meet all Data Management Experts
Become an Expert for this site


Tips, expert advice and sample chapters
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


>
QUESTION POSED ON: 08 January 2008
Do you have any information about data warehouse testing?

>
EXPERT RESPONSE
This is an excellent question because, as we all know, testing is vital in any development project. The bigger the project is, the more important the testing becomes -- and data warehouses are usually large projects.

So the answer is "No, I don't really have any specific information about data warehouse testing."

"What!? It's very important, but you don't know anything about it?! And you claim to be an expert!?"

Ah no, I didn't say I didn't know; I said I didn't have any specific information about it. The reason is that while the design of a data warehouse is not the same as other applications, the testing part is pretty similar. So everything that you already know about software (and hardware) testing is applicable here.

So, let's take a specific example, say, an ETL routine. You write it and then you test it. You send it a small block of typical data. Then a bigger block, then a huge block, then a much bigger block than it would normally see in operational terms. You time all the runs and plot a scalability graph. You look for errors in the output.

Then you feed in a block of data that examines all the edge and boundary effects. What happens if a value which is normally a positive integer is zero? Or negative, or has a decimal? What happens if it is 255 or 1000 or 1000000 or 1023 or 1024 or 65535?

And on and on and on. I do realize that I'm not telling you anything new here, this is all part of standard testing. But that's the point; a data warehouse is simply a mix of large database and large application. So we can test it as such.

So the question is still an excellent one because it highlights the fact that we have to test our data warehouses very rigorously. The good news is that we already have most of the skills inhouse and they can be transferred to the data warehouse. This is why most of the classic books on the subject don't have a huge section on testing. For example, Kimball's classic "The data warehouse lifecycle toolkit" has one page on the subject, and that's in the chapter on "Building End User Applications".

Incidentally, if you think that those binary boundaries (like 1024, 1024) don't matter anymore in modern software, check out the recent stories about the bug in Excel 2007 with 65,535.....

More on data warehouses

  • The data warehouse toolkit
  • Demystifying data warehouse appliances
  • Data warehouse development: Four strategic steps
  • Gartner data warehouse DBMS Magic Quadrant 2007: New tools, old mantras


  • Sound Off! -   Be the first to post a message to Sound Off!


    Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


    RELATED CONTENT
    DBMS and data warehousing
    Software developer career growth
    How to evaluate data warehouse software in five steps
    Data warehousing, data mining and data querying: Terms and definitions
    The difference between data definition language (DDL) and data manipulation language (DML)
    What is an operational data store vs. a data warehouse?
    Can a dimension table be a fact table for another data mart?
    Top three database management system (DBMS) trends
    Can I have two data warehouses?
    Database administrator job roles: Organizing the DBAs
    Data modeling for data warehouse projects

    Data warehouse management
    Microsoft to acquire data warehouse appliance specialist DATAllegro
    On-demand business intelligence takes aim at the midmarket
    Data warehousing five-year outlook: Technology trends and tips
    Do business intelligence tools require a data warehouse?
    How to evaluate data warehouse software in five steps
    Data quality management for data warehouses
    On-demand data warehousing has arrived, but will customers embrace cloud-based business intelligence?
    Data warehouse software: Top five headlines
    Data warehousing, data mining and data querying: Terms and definitions
    What is an operational data store vs. a data warehouse?

    RELATED GLOSSARY TERMS
    Terms from Whatis.com − the technology online dictionary
    data modeling  (SearchDataManagement.com)
    extract, transform, load  (SearchDataManagement.com)
    OLAP  (SearchDataManagement.com)
    tree structure  (SearchDataManagement.com)

    RELATED RESOURCES
    2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
    Search Bitpipe.com for the latest white papers and business webcasts
    Whatis.com, the online computer dictionary



    Search and Browse the Expert Answer Center
    Search and browse more than 25,000 question and answer pairs from more than 250 TechTarget industry experts.
    Browse our Expert Advice

    About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
    SEARCH 
    TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

    TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




    All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
      TechTarget - The IT Media ROI Experts