Saturday, March 12, 2011

What is data mining ?

Data mining is process to find the pattern or discover knowledge from large amount of data. The data mining is process to discover the knowledge.  Here is some sequential process/step to complete this activities


Step 1:  Data cleaning( Remove the inconsistent and noise data)
Step 2:  Data Integration( Multiple data source  combined together)
Step 3: Data selection ( fetch the relevant data for analysis)
Step 4: Data transformation( data  transformed or consolidate by performing summery or aggregation operations for mining)
Step 5: Data mining ( Here intelligent method are applied to find the data pattern)
Step 6: Pattern evaluation:
Step 7: Knowledge presentation: ( This  is presentation layer)


References:
Jiawei Han and Micheline Kamber
Data Mining: Concepts and Techniques, 2nd ed.
The Morgan Kaufmann Series in Data Management Systems, Jim Gray, Series Editor
Morgan Kaufmann Publishers, March 2006. ISBN 1-55860-901-6

Web Address for further study
1. http://en.wikipedia.org/wiki/Data_mining

No comments:

Post a Comment