Data Mining Query Language (DMQL) – For Knowledge Discovery
Free Machine Learning courses with 130+ real-time projects Start Now!!
In this Data Mining Tutorial, we will study Data Mining Query Language. As we will study every concept related to Query Language in Data Mining (DMQL), First we will study Query Language, Data Mining Query Language motivation and design.
Further, will learn the syntax for every task and specification. Moreover, we will cover standardization and Query Language purposes.
Introduction to Data Mining Query Language
It was proposed by Han, Fu, Wang, et al. for the DBMiner data mining system. Although, it was based on the structured Data Mining Query Language. These query languages are designed to support ad hoc and interactive data mining. Also, it provides commands for specifying primitives.
We can use Data Mining Query Language to work with databases and data warehouses as well. We can also use it to define data mining tasks. Particularly we examine how to define data warehouses and data marts in DMQL.
Motivation to Data Mining Query Language
It can provide the ability to support ad-hoc and interactive data mining.
Syntax of Data Mining Query Language
Syntax of DMQL for specifying task-relevant data
use database database_name
or
use data warehouse data_warehouse_name in relevance to att_or_dim_list from relation(s)/cube(s) [where condition] order by order_list group by grouping_list
Syntax – Specifying Kind of Knowledge
Syntax for Characterization, Discrimination, Association, Classification, and Prediction.
a. Data Mining Characterization
The syntax for characterization is −
mine characteristics [as pattern_name] analyze {measure(s) }
The analyze clause, specifies aggregate measures, such as count, sum, or count%
b. Data Mining Discrimination
The syntax for Discrimination is −
mine comparison [as {pattern_name]} For {target_class } where {t arget_condition } {versus {contrast_class_i } where {contrast_condition_i}} analyze {measure(s) }
c. Data Mining Association
The syntax for Association is−
mine associations [ as {pattern_name} ] {matching {metapattern} }
d. Data Mining Classification
The syntax for Classification is −
mine classification [as pattern_name] analyze classifying_attribute_or_dimension
e. Data Mining Prediction
The syntax for prediction is −
mine prediction [as pattern_name] analyze prediction_attribute_or_dimension {set {attribute_or_dimension_i= value_i}}
Syntax – Concept Hierarchy Specification
We use the following syntax to specify concept hierarchies−
use hierarchy <hierarchy> for <attribute_or_dimension>
We use different syntaxes to define different types of hierarchies such as−
-schema hierarchies
define hierarchy time_hierarchy on date as [date,month quarter,year]
– set-grouping hierarchies
define hierarchy age_hierarchy for age on customer as
level1: {young, middle_aged, senior} < level0: all
level2: {20, …, 39} < level1: young
level3: {40, …, 59} < level1: middle_aged
level4: {60, …, 89} < level1: senior
-operation-derived hierarchies
define hierarchy age_hierarchy for age on customer
as {age_category(1), …, age_category(5)}
:= cluster(default, age, 5)< all(age)
-rule-based hierarchies
define hierarchy profit_margin_hierarchy on item as
level_1: low_profit_margin < level_0: all
if (price – cost)< $50
level_1: medium-profit_margin < level_0: all
if ((price – cost) > $50) and ((price – cost) ≤ $250))
level_1: high_profit_margin < level_0: all
Syntax – Interestingness Measures Specification
Interestingness measures and thresholds can be specified by the user with the statement −
with <interest_measure_name> threshold = threshold_value
Syntax – Pattern Presentation & Visualization Specification
We have a syntax, which allows users to specify the display of discovered patterns in one or more forms.
display as <result_form>
Data Mining Languages Standardization
This will serve the following purposes −
- Basically, it helps the systematic development of data mining solutions.
- Also, improves interoperability among multiple data mining systems and functions.
- Generally, it helps in Promoting education and rapid learning.
- Also, promotes the use of data mining systems in industry and society.
Purposes – Data Mining Query
Data Mining Queries are useful for many purposes are:
- Apply the model to new data, to make single or multiple predictions. You can provide input values as parameters, or in a batch.
- Get a statistical summary of the data used for training.
- Extract patterns and rule of the typical case representing a pattern in the model.
- Extract regression formulas and other calculations that explain patterns.
- Get the cases that fit a particular pattern.
- Retrieve details about individual cases used in the model. Also, it includes data not used in an analysis.
- Retrain a model by adding new data, or perform cross-prediction.
Conclusion
As a result, we have studied Data mining Query Language for Knowledge Discovery and it’s all related concepts. Also, we have covered all syntax’s specification and standardization along with Query Language purposes.
I hope this blog will help you to understand the concept of Data Mining Query Language (DMQL). Furthermore, if you feel any query feel free to ask in a comment section.
We work very hard to provide you quality material
Could you take 15 seconds and share your happy experience on Google