Data naming and abbreviation standards for BI applications provide consistency and
a common look and feel useful for both developers and business people. Proven
standards can be applied (such as the convention of name compositions using prime,
qualifier or modifier, and class words), or new organization-specific standards can be
created. The data administration group usually has been trained in the various
industry-standard naming conventions.
Abbreviations are part of naming standards, but they apply only to physical names
(e.g., column names, table names, program names), not to business names. The
organization should publish a standard enterprise-wide abbreviations list that
includes industry-specific and organization-specific acronyms. Every BI project team
should use these abbreviations and acronyms.
Meta Data Capture
Meta data is a world unto itself. Large amounts of descriptive information can be
collected about business functions, business processes, business data objects,
business data elements, business rules, data quality, and other architectural
components. The organization needs standards or guidelines that govern who
captures which meta data components and how, when, and where to capture them.
The meta data repository should be set up in such a way that it supports the
standards for meta data capture and usage.
Logical Data Modeling
Logical data modeling is a business analysis technique (not to be confused with
logical database design). Every business activity or business function uses or
manipulates business data in some fashion. A logical data model documents those
logical data relationships irrespective of how the functions or the data are
implemented in the physical databases and applications.
Project-specific logical data models should be merged into one cohesive, integrated
enterprise logical data model. This activity usually is—and should be—included in the
job description for the data administration department, which may be part of the
enterprise architecture group. The enterprise logical data model is the baseline
business information architecture into which physical systems (operational or
decision-support, including BI applications) are mapped. The organization should
establish standards for creating the project-specific logical data models for BI
projects and for merging the models into the enterprise logical data model.
Data Quality
Information can be only as good as the raw data on which it is based. Most
organizations have a lot of dirty data—too much to cleanse it all. Each organization
must establish guidelines about triaging (categorizing and prioritizing) dirty data for
cleansing. In addition, the organization must create standards that define acceptable
quality thresholds and specify how to measure data quality during database loads.
Instructions for error handling and suspending dirty data records should also be part
of the standards.