Why do I care?
In data we can visualize relationships in a hierarchical structure. As scientists our goal is to not only understand the data, but more importantly we want to be able to visualize the connections between pieces of information. Manual definition of concept hierarchies can be a tedious and time-consuming task for a user or a domain expert. Fortunately, many hierarchies are implicit within the database schema and can be automatically defined at the schema definition level. The concept hierarchies can be used to transform the data into multiple levels of granularity. For example, data mining patterns regarding sales may be found relating to specific regions or countries, in addition to individual branch locations.
What is a concept?
A group of records that have been assigned a label.
What is a concept hierarchy?
Means generating a hierarchical order among concepts.
Let’s get deeper
1- Ordering of the attributes of the schema level by user or expert.
Let us assume that a set of the following attributes are given:
School, Department, college, school and an expert defines the hierarchy as follows:
Department-> College -> School
This means we have 3 attributes for the above three concepts, and we want to automatically generate hierarchy among the three attributes.
2- Ordering by adding hierarchy within a footstep
For example, college could be further divided into:
Science-oriented
Health-oriented
Humanity-oriented
3- Ordering by set grouping or value grouping
Attribute values for an attribute age could go with the hierarchy among the set of groups:
{20–39}, {40–59}, {60–82}
4- Ordering by decoding operation data in an attribute is a set of emails:
dmbrook@cs.sfu.ca
Concept hierarchies can be created by separating different components of the email data.
Dmbrook->cs->sfu->ca
Concept hierarchies can be created by separating different components of the email data.
5- Ordering by data clustering and data distribution analysis