Revision as of 09:01, 13 December 2011 by Lee832 (Talk | contribs)

Back to Daniel's Honor Project


On Principal Component Analysis (PCA)

The other day, my friend E.E major friend, A.G., asked me if I knew what eigenvalues and eigenvectors were. I said no, because I didn't know what they were at the time. A.G. was working on his research project for detecting and characterizing movements of mice using Xbox kinect and wanted to understand and use principal component analysis (PCA) for image processing. He had not taken linear algebra yet; I was in the middle of taking the course. I promised him that I will tell him once I learn it in class.

Now, at the conclusion of my linear algebra course, I had acceptable understanding of linear algebra, elementary understanding of eigenvalues and eigenvectors, and, unfortunately, no idea of the utility of what I learned in the field of image processing. But, ideally, I now do have the background to understand PCA. I have spent a little time trying to understand what is achieved by applying PCA on a particular data. Although my interpretation of how PCA work is prone to errors, I hope the lack of sophistication in my language will render process of PCA understandable(without being entirely inaccurate).

I've relied heavily on the article by Jonathon Shlens's tutorial on PCA, inheriting his tutorial structure and examples. His article itself is a very readable and comprehensive one and can be accessed by clicking here.


Principal Component Analysis is a process of extracting useful information from a noisy data set. Complex data sets may hid significant relationship between variables, and the aim of PCA is to extricate those relevant information shrouded by unimportant data. So, really, its application is not limited to image processing.

What is a noisy, complex, clouded data?

Suppose we have an oscillating ideal spring with a mass attached at its tip. This spring oscillates in one direction, namely in the x-direction, at a set frequency. We live in a three dimensional world, so we decide to set up three cameras in a rather random directions to collect the data on the motion of the spring.

Spring.png

This is not a brilliant idea. If we recall from our introductory physics course, the motion of the spring will always be in one-direction - the x-direction. So the entire dynamic of the spring system could be expressed with one variable, x. In lieu of three cameras recording the motion of the spring with respect to its axis, one camera positioned orthogonal to the motion of the spring would have sufficed.

Spring2.png

In our hypothetical experiment, we record the x and y position of the spring every 0.1 second for 10 seconds, making 100 observations.

In the case of one camera positioned orthogonal to the spring's motion, one observation can be expressed as a matrix:

$ O_{n}=\left(\begin{array}{cccc}x_{n}\\y_{n}\end{array}\right) $

Since we would expect the value of x or y to be zero for all 100 observations (spring moves in only one direction), what we really have is data with one dimension. We can used the data to express the dynamic of the system with one variable, as we have expected.

What of the data collected via the dull positioning of the three cameras? Each camera will collect data in its own x,y direction. Each observation will assume the form:

$ O_{n}=\left(\begin{array}{cccc}x_{An}\\y_{An}\\x_{Bn}\\y_{Bn}\\x_{Cn}\\y_{Cn}\end{array}\right) $

This isn't such a friendly data, but it captures the dynamic of the spring's movement, albeit clumsily. Since we prefer to describe the dynamic of the spring system in simpler terms, how can use this data set to write the motion of spring in terms of just one variable, x? That is, how can re-express our data set so that the simple dynamics of the system is revealed?


Back to MA265 Fall 2011 Prof. Walther

Back to MA265

Alumni Liaison

Meet a recent graduate heading to Sweden for a Postdoctorate.

Christine Berkesch