
As you can see, the raw infor-
mation entered on the shop floor
is barely English. Figure 10.23
shows a cleaned-up version of
thesametext.
Even the cleaned-up version
is difficult to read. The com-
panies paying out warranty
claims want each claim cate-
gorized in various ways, to track
what problems are occurring.
One option is to hire many peo-
ple to read the claims and deter-
mine how each claim should be
categorized. Categorizing the
claims manually is tedious
work. A more viable option,
developed in the last few years,
is to apply a software solution.
Figure 10.24 shows some of the information
that can be gleaned automatically from the
text in Figure 10.22.
The software processes the text, determin-
ing the concepts likely represented in the text.
This is not a simple word search. Synonyms
map to the same concept. Some words map
to different concepts depending on the con-
text. The software uses an ontology that relates
words and concepts to each other. After each
warranty is categorized in various ways, it
becomes possible to obtain useful aggregate
information, as shown in Figure 10.25.
Summary
Data warehousing, OLAP, and data mining are three
areas of computer science that are tightly interlinked and
marketed under the heading of business intelligence. The
functionalities of these three areas complement each other.
Data warehousing provides an infrastructure for storing
and accessing large amounts of data in an efficient and
user-friendly manner. Dimensional data modeling is the
7 DD40 BASC 54566 CK OUT AC INOP PREFORM PID CK CK PCM
PID ACC CK OK OPERATING ON AND OFF PREFORM POWER AND
GRONED CK AT COMPRESOR FONED NO GRONED PREFORM
PINPONT DIAG AND TRACE GRONED FONED BAD CO NECTION
AT S778 REPAIR AND RETEST OK CK AC OPERATION
Figure 10.22 Example
verbatim description in
warranty claim.
Courtesy of
Ubiquiti Inc.
7 DD40 Basic 54566 Check Out Air Conditioning Inoperable Perform PID
Check Check Power Control Module PID Accessory Check OK Operating
On And Off Perform Power And Ground Check At Compressor Found No
Ground Perform Pinpoint Diagnosis And Trace Ground Found Bad
Connection At Splice 778 Repair And Retest OK Check Air Conditioning
Operation
Figure 10.23 Cleaned-up
version of description in
warranty claim.
Courtesy of
Ubiquiti Inc.
Automated Coding
Primary Group: Electrical
Subgroup: Climate Control
Part: Connector 1008
Problem: Bad Connection
Repair: Reconnect
Location: Engin. Cmprt.
Confidence
90%
85%
93%
72%
75%
90%
Figure 10.24 Useful
information extracted from
verbatim description in
warranty claim.
Courtesy of
Ubiquiti Inc.
Chapter 10 BUSINESS INTELLIGENCE 229