PCA in face recognition

When used in face recognition, principal components gets the name of eigenfaces. Now let’s see in more details how this is used in order to perform face recognition.

We define the traning vector as the vector of dimensions $1 x n$ that is obtained stretching the images from the trainig set all in one column.

We can form the matrix of training vectors of dimensions $m x n$ where $m$ is the number of images in the training set.

We can create the Mean Vector $\overset{x}{ˉ}$ starting from the matrix of training vector by just averaging the value of all the rows. The mean vector will be of dimensions $1 x n$ .

\overset{x}{ˉ} = \frac{1}{m} i = 1 \sum m x_{i}

The mean vector can be represented as an image, and can represent the face that has the most common traits among the all traning set faces.

We can compute the covariance matrix:

C = \frac{1}{m} i = 1 \sum m (x_{i} - \overset{x}{ˉ}) (x_{i} - \overset{x}{ˉ})^{T}

The covariance matrix $C$ has dimension $n x n$ , and so each column can be represented as an image itself.

What we want now is to reduce the dimensionality from $n$ to $k$ , where $k$ is the smallest possible number of dimensions that preseve most of the information.

Once we have $C$ , we proceed to get the its eigenvectors, which will be our eigenfaces. In order to find an eigenvector $v$ , we have to find the eigenvalues $λ$ such that $C v = λ v$ . We can see that the equation can be rewritten as $∣ C - λ I ∣ = 0$ where $I$ is the identity matrix.

Once we have the eigenvalues $λ_{1}, λ_{2} ...$ , we put it in the original equation and find the eigenvectors. Source

In order to obtain a $k$ -dimension subspace, we order the $k$ eigenvectors that correspond to the highest $k$ values of the matrix. The projection matrix $ϕ_{k}$ is built using the $k$ eigenvectors as columns, and has dimensions $k x n$ .

The projection of a vector $x$ onto the new $k$ -dimension sub-space is defined as following:

Proj (x) = ϕ_{k}^{T} (x - \overset{x}{ˉ}) where Proj : ℜ^{n} \to ℜ^{k}

And represent the new feature vector for the image.

The basic idea of PCA for face recognition is that, instead of comparing faces pixels by pixel, which is a technique that gives unreliable results, we can compare images onto the new $k$ -dimensional subspace. We can just compare the projetion coefficients $Proj (x_{i})$ corresponding to each image $i$ .

The most similar face will have the $Proj (x^{'})$ and calculate the distance between every other feature vector for the images in the gallery. The face more similar will be the least distant to $Proj (x^{'})$ .

biometric-systems statistics

Quartz 4

Explorer

PCA in face recognition

PCA in face recognition

Graph View

Backlinks