Overview of Data Mining

Posted by

What is Data Mining?

Data mining is the process of discovering useful correlations hidden among vast amounts of data, extracting actionable information for future use, and utilizing it for decision-making.

Doosan Encyclopedia

The term “Data Mining” does not accurately represent all the core components of mining.

Therefore, some people also use the term “Knowledge Discovery from Data(KDD). KDD encompasses the entire process of discovering useful knowledge from data, including data preprocessing, data mining, and post-processing of discovered patterns.

Although KDD is a term that conveys the meaning more clearly, we will continue to use the term “Data Mining” which is already widely used and has a well-established meaning, in this context.

The Process of Data Mining

Data mining involves repeating the following steps in sequence:

  1. Data Cleaning: Removing noise and inconsistent data
  2. Data Integration: Combining data from various sources
  3. Data Selection: Retrieving relevant data for analysis tasks from the database
  4. Data Transformation: Summarizing or aggregating data to transform and integrate it into a form suitable for mining analysis
  5. Data Mining: Applying intelligent analysis methods to extract data patterns
  6. Pattern Evaluation: Identifying patterns that represent knowledge based on interest
  7. Knowledge Presentation: Utilizing visualization and knowledge representation techniques to present mined knowledge to users

Leave a Reply

이메일 주소는 공개되지 않습니다. 필수 필드는 *로 표시됩니다