Line 21: Line 21:
 
Clustering is a form of unsupervised learning. "Clustering is the problem of identifying groups, or clusters of data points in multidimensional space". For example, consider the following data set:
 
Clustering is a form of unsupervised learning. "Clustering is the problem of identifying groups, or clusters of data points in multidimensional space". For example, consider the following data set:
  
 
+
[[Image:fig1.jpg]]
 
+
In class we covered various forms of supervised learning, such as classification and density estimation. These applications required the use of training data. Supervised learning, as i.e. learning using training data. Clustering, on the other hand is a form of unsupervised learning. We are not given a training set, yet wish to discover some form of underling structure in a dataset.
+
  
 
----
 
----

Revision as of 20:41, 5 May 2014


Introduction to Clustering A slecture by CS student David Runyan


Introduction


In class, we covered the simple Bayesian classifier. This form of classification falls under a category known as supervised learning. What this means is that a set of labelled data data is used to to "train" the underlying model. However, it is not always possible to have such a data set, yet we may still wish to discover some form of underlying structure in an unlabelled data set. Such a task falls under the category of unsupervised learning.

Clustering is a form of unsupervised learning. "Clustering is the problem of identifying groups, or clusters of data points in multidimensional space". For example, consider the following data set:

Fig1.jpg




Back to ECE662, Spring 2014

Alumni Liaison

Basic linear algebra uncovers and clarifies very important geometry and algebra.

Dr. Paul Garrett