News
|
Best Practices in Data Resource Management
Takeaway notes from ”The Learning Curve” webinar. By Aliza Bornstein, copywriter, Melissa Data
Many people today still confuse the words ‘data,’
‘information,’ and ‘knowledge.’
‘Data’ is the individual facts that are out of
context with little meaning, such as “$123.15.”
‘Data in context’ are individual facts that are
wrapped with some type of meaning, such as “123.15
is the account balance at noon on January 12, 2001.”
‘Information’ is a set of data in context, relevant
to one or more people at some point in time.
Information must have a relevant component, and a
time component, to the information.
‘Knowledge’ is information retained by individuals
and then combined with experience.
As we look at this sequence of definitions, it’s
obvious that data is the foundation, and that’s what
we have to manage.
Business information demand is an organization’s
continuously increasing, constantly changing need
for current, accurate, integrated information, often
on short notice, to support its business activities.
The impact we see on most business today is that
this information demand is not being met. This is
happening in both public and private sector
businesses. Business decisions are less informed,
both citizens and customers are being impacted, and
opportunities are being missed.
Looked at in detail, the problems are high
quantities of disparate data in most organizations.
The data is not thoroughly understood, the data
source is not formally managed or readily available,
and
data quality is low.
Disparate data is data that is essentially not
alike, or is distinctly different in kind, quality
or character. They are unequal and cannot be readily
integrated to adequately meet the business
information demand.
There are four basic problems found in disparate
data:
1) Unknown Data Existence
Organizations at large are not aware of all data at
their disposal, and usually are not inventoried.
2) Unknown Data Meaning
Content and meaning of data are not fully known in
the organization, or thoroughly understood.
3) High Data Redundancy
Data is highly redundant and inconsistent across an
organization. The average redundancy factor of ten
for any organization that’s of any size, that’s been
in business for any length of time. That means that
each business fact exists, on the average, ten times
throughout the data resource.
4) High Data Variability
Data is highly variable in format and content. In
large organizations, there’s an average factor of 15
to 20 in the data variability for any one business
fact.
That’s the good news! It gets worse as we look
further…
An organization now has disparate data resource, and
that’s a data resource substantially composed of
disparate data, that’s disintegrated, and not
subject-oriented. It’s a state of disarray, where
the low quality does not, and cannot, adequately
support the business information demand.
Disparate data cycle is a self-perpetuating cycle
where disparate data continue to be produced at an
ever-increasing rate, because people do not know
about existing data, or do not want to use existing
data.
That’s the bad news!
What we must do now is break the disparate data
cycle, stop the spiraling disparity, and create a
high quality, sharable data resource.
We need to begin an initiative to thoroughly
understand, formally manage, and fully utilize all
data that are available to an organization, within
one common organization-wide data architecture.
We have to stop the burgeoning data disparity and
resolve the existing data disparity through
integration.
We must take the initiative now!
To start on that approach, we have to define what we
mean by “quality.”
Data resource quality is a measure of how well the
data resource supports the current and the future
business information demand.
Data quality is a subset of overall data resource
quality that deals just with the data values.
The ultimate data resource quality is a data
resource that is stable across changing business and
changing technology, so it continues to support the
current and future business information demand.
Information quality is a measure of the ability to
get the right data, to the right
people, in the right place, at the right time, in
the right form, so they can make the right
decisions, and take the right actions.
Remember: An intelligent learning organization needs
a high-quality, stable data resource!
---Source: The eLearning Curve data
quality series, hosted by Eric Kavanagh and
conducted by Mike Brackett, October 2, 2009 (www.elearningcurve.com).
|
|
|
Melissa Data
|
 |

| Enhance your
website, software or database with
easy-to-integrate data quality programming tools
and web services. |
|
|
|
|
 |

|
Save money on postage using leading
mail preparation software and other
direct marketing products. |
|
|
|
|
 |

Update & standardize addresses and
find out more about contacts in your
database.
|
|
|
|
|
 |

Find new customers perfect for your
business with our online and
specialty mailing lists.
|
|
|
|
|
 |

Locate the business information you
need such as ZIP Codes, address
verification, maps.
|
|
|
|
|

Download
your free copy of the Melissa Data product catalog.
|
|