Professional Documents
Culture Documents
Laggards
Existing data mining systems are too generic Need business-specific data mining solutions and smooth integration of business logic with data mining functions
Data mining will surely be an important tool for managers decision making
The amount of the available data is increasing, and data mining systems will be more affordable Multiple personal uses
Mine your family's medical history to identify geneticallyrelated medical conditions Mine the records of the companies you deal with Mine data on stocks and company performance, etc. Build data mining functions into many intelligent tools
Big Brother, Big Banker, and Big Business are carefully watching you Profiling information is collected every time
credit card, debit card, supermarket loyalty card, or frequent flyer card, or apply for any of the above You surf the Web, rent a video, fill out a contest entry form, You pay for prescription drugs, or present you medical care number when visiting the doctor
Collection of personal data may be beneficial for companies and consumers, there is also potential for misuse
International guidelines for data privacy protection Cover aspects relating to data collection, purpose, use, quality, openness, individual participation, and accountability Purpose specification and use limitation Openness: Individuals have the right to know what information is collected about them, who has access to the data, and how the data are being used
Application exploration
development of application-specific data mining system Invisible data mining (mining as built-in function) Constraint-based mining: use of constraints to guide data mining systems in their search for interesting patterns
Integration of data mining with database systems, data warehouse systems, and Web database systems
This will ensure data availability, data mining portability, scalability, high performance, and an integrated information processing environment for multidimensional data analysis and exploration.
A standard will facilitate systematic development, improve interoperability, and promote the education and use of data mining systems in industry and society The systematic study and development of visual data mining techniques will facilitate the promotion and use of data mining as a tool for data analysis.
More research is required towards the integration of data mining methods with existing data analysis techniques for the complex types of data
Web mining
Web content mining, Weblog mining, and data mining services on the Internet will become one of the most important and flourishing subfields in data mining. Traditional data mining methods, designed to work at a centralized location, do not work well in many of the distributed computing environments present today (e.g., the Internet, intranets, local area networks, high-speed wireless networks, and sensor networks). Many applications involving stream data (such as e-commerce, Web mining, stock analysis, intrusion detection, mobile data mining, and data mining for counterterrorism) require dynamic data mining models to be built in real time. Additional development is needed in this area.
An abundance of recorded personal information available in electronic forms and on the Web, coupled with increasingly powerful data mining tools, poses a threat to our privacy and data security. Growing interest in data mining for counterterrorism also adds to the threat. Graph mining, link analysis, and social network analysis are useful for capturing sequential, topological, geo- metric, and other relational characteristics of many scientific data sets and social data sets Most data mining approaches search for patterns in a single relational table or in a single database.