CS153 Fall 2000 Assignment 2

Harvey Mudd College
Computer Science 153
Assignment 2
Due Sunday, September 17, by midnight

Back to Assignment 2, top-level page

Back to Assignment 2, Section 1: Pictures!

Section 2: Representing Images as Sums of Frequencies

aka, Fourier analysis without the pain ...

Images are usually thought of as sums of individual pixels, weighted by the intensity (or 3 intensities, for 3 images) of those pixels. This part of the assignment asks you to investigate an alternative representation of images, that is as sums of frequency components, each of which is the size of the entire image. This part of the assignment asks you to build some tools that will help investigate the connection between the frequency and spatial representations of an image.

Matlab's Fast Fourier Transform

In general, the Fourier transform of an image has complex coefficients. With real images, the imaginary parts of these complex frequency coefficients always cancel out. The goal of this first problem is to demonstrate exactly how they cancel out.

Load the grayscale image sf.tif from /cs/cs153/Images/a2/fourier/sf.tif into your matlab workspace as A with

A = imread('/cs/cs153/Images/a2/fourier/sf.tif','tif');

Use the fftShow.m script (in the same place as imframe, /cs/cs153/matlab/visioncode) to show both the image and its frequency representation.

At this point, in your matlab workspace, B is the 256x256 matrix of frequency coefficients that is equivalent to the spatial image A. Because B is a matrix, its rows and columns run from 1 to 256, but it contains the uniform frequency (u=0, v=0) at its center, that is, at (129,129). (The fftshift function in matlab does this. If you look at fftShow.m, you'll see that this was done before the script ended.) What are the values of the B matrix of frequency coefficients for the nine frequency combinations with u = {-1,0,1} and v = {-1,0,1} ? (This is just a submatrix of B -- which submatrix is it?) Looking at these values (you can copy them directly to the html or link them in a file), why do the imaginary frequency components cancel?

Results:

Removing frequency components

Add the capability to "zero out" parts of the frequency representation -- that is, to set portions of the B matrix to zero. To do this, write a matlab function or script (or, perhaps, more than 1) that takes in some representation of the frequencies to be zeroed out, along with the fourier coefficients B and then returns a new coefficient matrix, suitably changed. In particular, make sure your function(s)/script(s) let you

remove all of the low-frequency components up to some value (this is a highpass filter)
remove all of the high-frequency components above some value (this is a lowpass filter)
remove all of the frequency components in some range of orientations, for example, all of the components between -5 degrees and 5 degrees, which would get rid of horizontal (and near-horizontal) frequencies. (You will want to keep the frequency (0,0), however.)

To implement your filters, copy the fftShow script to a working directory and add the capability (now commented out) to change the fourier coefficients B and then redisplay the new spatial image (called D in the script). Also, add code to fftShow so that the newly abridged fourier coefficients appear too. (They should look like the old, except with a hole somewhere.) You can emulate code from earlier in the script to do this.

With these tools, generate spatial images and frequency representations of a 256x256 grayscale image of your choice (you can use the program xv to change the size of images easily). Feel free to use the San Fransisco image, if you'd prefer. Be sure to include

a lowpass-filtered image (a smoothed image)
a highpass-filtered image (you may need to leave in the uniform (0,0) frequency term, or else you won't be able to see what's happening)
an image with the frequencies around a particular orientation filtered out (a notch filter)

Describe qualitatively what happens to the spatial image when all of the near-horizontal frequencies are removed.

Results:

Applications of Fourier representations

Try creating noise images (use the matlab rand function) and adding them to the grayscale image you investigated above. As we demonstrated in class, lowpass filters can smooth the resulting images.

Also, as you might guess, frequency representations are particularly good at removing periodic noise (or unwanted information, at least) from images. Use its frequency representation to remove the grid lines from the following image. Don't download this one -- it's at /cs/cs153/Images/a2/fourier/grid.tif. Notice that it's 512 x 512, which will require some adjustments, at least to the original fftShow.m code.

Include in these results your "cleaned up" version of the above image, as wellas the results of adding noise and smoothing it with lowpass filters. What happens to an image as you let less and less of the high frequency components pass? (Give an example.)

Results:

Possible Extensions

Remember that you need to pursue only one extension in one part of the assignment, and it can be anything you'd like to investigate. These are only suggestions.

Fourier representations are also used in "image sharpening," which is a process that highlights image edges to make them easier for humans to discern. How might you use the frequency representation of an image to implement image sharpening? Suggest a technique and try it out.
Right now, the fftShow.m script has a "magic number" that decides which frequencies get displayed and which do not in an image's frequency representation. (It only affects the display of the frequency coefficients in the frequency domain, it does not affect the spatial images at all.) Implement a better approach to deciding what and how to display the frequency coefficients.
Consider trying to distinguish objects by using their fourier representations. For example, can you distinguish Impressionist paintings from cubist ones (or another genre)?
Consider looking at the frequency descriptions of small portions of an image in order to characterize their texture. Imagine scanning in a page of newsprint with both text and images on it. Could you use this kind of texture analysis to segment the text from the pictures? (This is an important problem for OCR systems.)
Investigate wavelet decompositions of images and compare/contrast them with Fourier decompositions.
Use matlab's GUI-building resources to create a tool for investigating or demonstrating the relationship between the frequency representation and the spatial representation of an image. The imframe.m code would be a good place to start... .

Harvey Mudd College Computer Science 153 Assignment 2 Due Sunday, September 17, by midnight

Back to Assignment 2, top-level page

Back to Assignment 2, Section 1: Pictures!

Section 2: Representing Images as Sums of Frequencies

Matlab's Fast Fourier Transform

Removing frequency components

Applications of Fourier representations

Possible Extensions

Next section on segmentation via thresholding

Harvey Mudd College
Computer Science 153
Assignment 2
Due Sunday, September 17, by midnight