Data Mining - Mehmed Kantardzic [268]
VLDB conference is a premier annual international forum for data management and database researchers, vendors, practitioners, application developers, and users. The conference features research talks, tutorials, demonstrations, and workshops. It covers current issues in data management, database and information systems research. Data management and databases remain among the main technological cornerstones of emerging applications of the twenty-first century.
A.3 DATA-MINING FORUMS/BLOGS
1. KDnuggets Forums
http://www.kdnuggets.com/phpBB/index.php
Good resource for sharing experience and asking questions.
2. Data Mining
http://dataminingwarehousing.blogspot.com/
This blog is helpful for data-mining beginners. It presents basic data-mining concepts with examples and applications.
3. Data Mining and Predictive Analytics
http://abbottanalytics.blogspot.com/
The posts on this blog cover topics related to data mining and predictive analytics from the perspectives of both research and industry.
4. AI, Data Mining, Machine Learning, and Other things
http://blog.markus-breitenbach.com/
This blog discusses machine learning with emphasis on AI and statistics.
5. Geeking with Greg
http://glinden.blogspot.com
This blog focuses on the topic of personalization and related research.
6. Data Miners Blog
http://blog.data-miners.com/
The posts on this blog provide industry-oriented reflections on topics from data analysis and visualization.
7. Data-Mining Research
http://www.dataminingblog.com/
This blog provides a venue for exchanging ideas and comments about data-mining techniques and applications.
8. Data Wrangling
http://www.datawrangling.com/
This blog provides across the board posts on news and technology related to machine learning and data mining.
9. Intelligent Machines
http://www.damienfrancois.be/blog/
This blog is dedicated to artificial intelligence and machine learning, and focuses on applications in business, science and every-day life.
10. Mininglabs
http://www.mininglabs.com/
This blog is established by a group of French independent researchers in the field of data mining, analyzing and data visualization. They are mostly interested in analyzing data coming from the internet at large (Web, peer-to-peer networks).
11. Machine Learning (Theory)
http://hunch.net/
A blog dedicated to the various aspects of machine learning theory and applications.
A.4 DATA SETS
This section describes a number of freely available data sets ready for use in data-mining algorithms. We selected a few examples for students who are starting to learn data mining and they would like to practice traditional data-mining tasks. A majority of these data sets are hosted on the UCI Machine Learning Repository. For more data sets look up this repository at http://archive.ics.uci.edu/ml/index.html.
A.4.1 Classification
Iris Data Set.
http://archive.ics.uci.edu/ml/datasets/Iris
The Iris Data Set is a small data set often used in machine learning and data mining. It includes 150 data points each representing three different kinds of iris. The task is to learn to classify iris based on four measurements. This data set was used by R. A. Fisher in 1936 as an example for discriminant analysis.
Adult Data Set.
http://archive.ics.uci.edu/ml/datasets/Adult
The Adult Data Set contains 48,842 samples extracted from the U.S. Census. The task is to classify individuals as having an income that does or does not exceed $50,000/year based on factors such as age, education, race, sex, and native country.
Breast Cancer Wisconsin (Diagnostic) Data Set.
http://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29
This data set consists of a number of measurements