Because data are typically collected from multiple sources, they must be reviewed to eliminate duplicates. To this end, the Postal Service rationalizes and classifies spending data elements and attributes.
Typical cleansing needs that should be noted include:
Tasks that are evident in this process include compiling data into database tools, categorizing purchased items, identifying gaps and inconsistencies, and producing high-level spend baseline and priority listings.
Each system containing purchasing data likely uses different data structures and nomenclature by item, commodity, category, and supplier. As a result, after the data are refined, they should then be classified to remove errors and inconsistencies and to create a standard language for useful analysis.
Examples of classification schemes include:
This analysis requires consolidation of data from multiple sources and varied formats into structured and manageable database(s).