CS 60, Assignment 9

Computer Science 60
Principles of Computer Science
Spring 2015

Assignment 9: Seam Carving [100 Points]
Due Tuesday, April 7, 11:59pm

This week's homework is a natural way to get comfortable with loops - and nested loops - in Java, while implementing a cool image-processing application known as Seam Carving. In addition, this is an exemplar-based introduction to a powerful CS technique for creating fast algorithms known as Dynamic Programming, or DP. You will design several DP algorithms next week; this week, we focus on implementing an example of DP.

Pair Programming

You may do this assignment as a pair, if you wish. However, remember that working as a pair obligates you to obey the course pair-programming guidelines:

Both members of the pair must be at one computer together while collaborating.
One student will be the "driver" who controls the keyboard and mouse. The other is the "navigator" who observes, asks questions, suggests solutions, and thinks about slightly longer-term strategies.
It is not permissible for one member of the pair to work on the code while his or her partner is not there or not participating actively.
The person typing at the keyboard should be swapped at least every 30 minutes. Be vigilant about this!
Both students should be able to explain the resulting code.
Both students should submit the Picture.java file, and include in the partner textfield #otherlogin where otherlogin is their partner's CS submission login name.

Seam Carving (100 Points)

Seam carving was first published in this paper by S. Avidan and A. Shamir in 2007. It's a relatively accessible paper -- take a look! That said, all of the background needed should be available from lecture and this write-up.

Seam carving finds the "least-noticeable" seam from top-to-bottom or left-to-right in an image. Then, repeated seam removal allows for image-aware resizing to any smaller size. As with all of CS, this requires quantifying what is meant by "least-noticeable"!

For us, "least-noticeable" refers to the shortest path through the image, where distance is defined to be the total strength of the image edges traversed along the way. The paper offers a seam-insertion algorithm for resizing to larger image sizes, too, though that won't be an official part of this assignment.

This assignment starts with the files in hw9src.zip

In that folder there are many images (used for testing) and a number of .java source files. The only .java file you'll need to edit is Picture.java. The other ones you'll use are PictureTest.java (for running the JUnit tests) and Pixel.java (for accessing the images' pixels). The other files are listed here, but knowing their details are not important for completing this assignment. Here is an overview of all of them:

Files to edit:

Picture.java: This class is a subclass of the SimplePicture class, and has placeholders where you'll implement various pixel-processing routines.

Files to use:

Pixel.java: This class provides static and non-static accessor methods that allow you to obtain information regarding a pixel in a picture. This also uses Java's built-in Color class. Here is the online reference for that Color class.
PictureTest.java: This class contains the JUnit test cases that test the functions in the Picture class. You won't need to write additional tests, but you will want to use these tests.

Other files: (you don't need to look inside these; if you want to do so, see the extra credit...)

FileChooser.java: This class controls and abstracts the file choosers that allow the user to select files to open and save.
ImageDisplay.java: This class controls and abstracts the image display and the crosshair feature.
PictureExplorer.java: This class brings together and controls the various elements of the graphical user interface.
PictureFrame.java: This class controls and abstracts the frame containing the picture.
SimplePicture.java: This class is the parent of the Picture class, and also stores variables and methods relating to a particular picture file, allowing the Picture class to deal only with the various features that can be implemented on that picture file.

How to get the starter code working...

It's a bit different getting an existing folder to serve as source code within Eclipse. Here are the steps:

Create a new Java project via File - New Java Project
Give your new project a name, e.g., hw9
Click Next (instead of Finish) - if you accidentally do click finish, you can change the settings later on through the "Java Build Settings"...
If you clicked Next, you'll see a dialog of Java Build Settings. Within that dialog, there are two things to do:
1. Under the source tab, click Link Additional Source. In the interface that appears, under Linked folder location, click Browse and then navigate to the hw9src file you downloaded and unzipped from the starter code, above. Note: depending on how it is unzipped, you may have a hw9src folder within another hw9src folder -- choose the inner one. Then, click Finish.
2. Back in the Java Settings dialog, click the Libraries tab. You'll need to include the JUnit library. To do this, click the Add library... button, choose JUnit, the default of version 4 is OK, and then click Finish.
Then, you can Finish the entire process.
To see your new project, go to the Project Explorer tab or view in Eclipse. It may already be open, but if it's not, you can open it with Window - Show View - Project Explorer (or Other... - Project Explorer).
Within that Project Explorer view, you should see the new project you just created. Within it will be the hw9src source folder you linked. Within that, you'll see lots of images, but at the top you should see default package and, within that, all of the source files from the starter code. If there is a red "error" icon by PictureTest.java, that is because the JUnit library did not get addd to the project yet.
If you already have a project and want to go back to add the hw9src folder as an official "source folder" for it - which you need in order to run things - then select that project within the Project Explorer tab and use the menu options Project - Properties to bring up the "Java Build Path" dialog box. From there, you can go through the numbered steps above (Source tab - Link Source... and Libraries tab - Add Library...) in order to make sure everything is set.
Phew!

Running the program

Once the hw9src folder is a source folder and JUnit 4 is an included library, you should be able to run the program itself:

Open the Picture.java file - this is where the main method is located.
With this file in focus, choose Run - Run As... - Java Application. You should see a user interface appear with a picture of Maria Klave (holding one of her paintings) as the starting image.
Try out loading a new image with File - Open. You'll have to navigate to the hw9src folder the first time, but for additional images it should start wherever you left off the previous time. Okinawa.bmp and Camel.bmp and several others are available. Also, try your own images or ones you grab online and download to your Desktop/Downloads... the images do not have to come from the Java source folder.
Also, try zooming in/out. If you use the 5000% zoom on anything but a very tiny image, it will likely run out of memory - this is Ok.
The only image-processing method that works within the starter code is the Change colors - Grayscale. Try that, too.
The rest of the options are the assignment -- or the extra credit...!
File - Reset Picture tries to reload the file with the same filename as the last image displayed. If the last image displayed did not have a filename, you'll have to use File - Open... instead.

Image-processing methods (functions) to write

First, from the Project Explorer tab, open up the Picture.java file. From Eclipse's Outline tab - or simply scrolling about halfway down the file - you should see the methods grayscale2 and grayscale. These are examples from which to start. Below them are several methods marked with TODO, which are the ones to implement.

You will gradually extend the functionality of the program by writing several methods in the Picture class within the Picture.java file. Here are the signatures of these methods. Afterwards are some thoughts and hints on how to approach this!

public Picture negate() should return the "photoinverse" of this Picture
public Picture lighten(int lightenAmount) returns a new Picture that is lighter in red, green, and blue by lightenAmount (maxing at 255)
public Picture darken(int darkenenAmount) similar for creating a darker version of this Picture
Picture addBlue(int amount) should return a new Picture with only the blue channel changed (keeping r,g,b values beteween 0 and 255, inclusive)
Picture addGreen(int amount) and Picture addRed(int amount) return similar new Pictures, changed in their respective channels (green and red)
Picture rotateRight() returns a brand-new Picture that holds this Picture, only rotated right by 90 degrees. Thus, if the original Picture was 50x70, the the new image will be 70x50.
public Picture luminosity() should return a different grayscale image, using the conversion in the starter file (and described below)
private int luminosityOfPixel(int x, int y) returns the luminosity of this Picture at Pixel (x,y). Implement this before luminosity().
public Picture energy() should return the edge-energy image, as described below
private int getEnergy(int x, int y) is similar to luminosityOfPixel but returns the edge-energy of this Picture at Pixel (x,y). Implement this before energy().
private int[] computeSeam() should return an array of integers representing the location of the least-notcieable seam in this Picture. This is the crucial subroutine and the DP algorithm this week! See below for a pretty extensive explanation.
Picture showSeam() should return a new Picture, a modified copy of this, which includes the vertical seam (found from computeSeam) drawn in red.
public Picture carve() should return a new Picture of a size one pixel less in width than this. The new Picture should be the same as this except with the seam (found from computeSeam) removed.
Picture carveMany(int numSeams) should return a new image similar to carve, but it should have removed numSeams number of seams. You'll want to call carve in a loop to implement carveMany.

Background, context, and hints

In all of our images, whether color or grayscale, each individual pixel has:

a redness level (0-255, where 0=no redness and 255=maximum redness)
a blueness level (0-255),
a greenness level (0-255),
an opacity or opaqueness (0-255, where 0=completely transparent and 255=totally opaque).

We will only use the red, green, and blue components; opacity will always be 255. Shades of gray arise when you have equal amounts of red, green, and blue, with black containing 0 for all three and white containing 255 for all three.

The grayscale2 and grayscale functions convert from RGB to gray by computing the average of the three. they also show how to use many of the methods in the Picture class (for example, the constructor to copy and image) and in the Pixel class, e.g., the way to access individual components of a Pixel or to access a whole Color object that, in turn, has all of the color components needed.

To implement the first six methods, adapt one of the two approaches to computing grayscale, either grayscale2 or grayscale (with helper methods). Be sure to try out your image transformations in the graphical user interface (GUI). Also, be sure to test your implementations with PictureTest.java

rotateRight

For the rotateRight method, you will need to use a different one of the Picture class's constructors in order to create a brand-new image of a different size. Here is an example call:

Picture newPicture = new Picture(newWidth, newHeight);

of course, this presumes that you have defined newHeight and newWidth beforehand!

From there, you will want to loop through all of the pixels, copying their color from the old image (this) to the appropriate place in the new image (whatever you've named it). Be sure to return the new image in the end! Personally, I looped through the destination picture's pixels and - for each one - computed the location from which to grab the source pixel that corresponded to it. The reverse works equally well.

Note that objects of type Pixel have methods that will get and set all of the components of a color at once! They use Java's Color class, e.g.,

Color c = some_pixel.getColor();
some_other_pixel.setColor( c );

or even the equivalent

some_other_pixel.setColor( some_pixel.getColor() );

luminosityOfPixel and luminosity

These two methods compute a single (grayscale) value from the r, g, and b of a color's channels different than uniform averaging. Luminosity is a weighted average of red, green, and blue in which the weights take into account that human vision has much better visual acuity within some color frequencies (e.g., greens) than in others (e.g., blues):


  luminosity = (int)(0.21 * redness + 0.72 * greenness + 0.07 * blueness);

Warning! The order in which you perform the luminosity calculation, above, matters! Be sure to use the order above (adding red, then green, then blue) with the coefficients shown. Small floating-point errors can accumulate and, once in a while, they will change a pixel value by 1 gray level -- not enough to see, but enough to cause a test to fail...

Here is a visual example showing the difference between uniform averaging and luminosity.

Image energy: background

The key idea in seam-carving is finding the edge-energy-minimizing path through the image. But what is this "energy"?

Intuitively, the strength of the edges around a particular pixel is that pixel's "energy." High-energy pixels are ones at a strong edge in the image -- those are pixels we're most likely to notice if they disappear. Low-energy pixels are those in the middle of uniform patches of color or gray -- those are least likely to be missed!

To quantify this idea of "energy," we use simple approximations to the image derivative, which is simply how fast the luminosity is changing between adjacent pixels. If we abbreviate the luminosity at point (x,y) as \(i(x,y)\). Moving horizontally, we can numerically approximate the horizontal derivative (rate of horizontal change in intensity) at (x,y) with any of three methods:

The forward difference \[ i(x+1, y) - i(x, y) \]
The backwards difference \[ i(x, y) - i(x-1, y) \]
The central difference \[ \frac{i(x+1, y) - i(x-1, y)}{2} \]

where i(x,y) denotes the luminosity at point (x,y).

Although the central difference is the most accurate approximation numerically, Avidan and Shamir simply usd the forward energy, which is what we will do.

Vertical derivatives are similar. For example the vertical central difference would be computed as \[ \frac{i(x, y+1) - i(x,y-1)}{2}. \] Again, however, we will use the forward vertical energy.

The Seam Carving paper defines the energy at each pixel to be the sum of the absolute value (Math.abs) of two derivatives: the horizontal and vertical rates of change in luminosity. Specifically, you should use the forward difference for both the horizontal and vertical directions:

\[ energy = abs( i(x+1, y) - i(x, y)) + abs( i(x, y+1) - i(x, y) ) \] except when the pixels are on the edge of the image so that they do not have a "forward" neighbor, either x+1 and/or y+1. Whenever one of those cases happens, your implementation should use the backwards difference. For one pixel (the lower right corner), you'll need to use two backwards differences.

Important: again, the test cases provided in PictureTest.java assume that when possible you will use the forward difference. And when the forward difference is not available you will use the backwards difference.

computeSeam

This is the dynamic-programming part of the assignment. A vertical seam is a sequence of pixels, one per row of pixels, such that pixels in adjacent rows are no more than one column apart. Since there's exactly one pixel per row, we only have to remember the columns of each pixel; this can be represented as a single array of integers.

Step one is to create an array table of integers, of the same size as the image. Each entry table[x][y] is the total energy of all pixels in the least-energy seam starting on the top row and moving down to the ending at pixel (x,y). It is sometimes called the cumulative-energy table: it does not contain single energies, but the smallest cumulative energy starting from anywhere at the top and ending at pixel (x,y). As discussed in the paper, we can define the entries in table recursively by \[ \begin{array}{lcl} \mathrm{table[x][0]} &=& \mathrm{getEnergy[x][0]}\\ \mathrm{table[x][y]} &=& \mathrm{getEnergy[x][y]} + \mathrm{min} \begin{cases} \mathrm{table[x-1][y-1]}\\ \mathrm{table[x][y-1]}\\ \mathrm{table[x+1][y-1]}\\ \end{cases} \end{array} \] That is, the least-energy seam ending at (x,y) extends the best seam ending either directly above-and-to-the-left, directly above, or directly above-and-to-the-right. (Note that for pixels on the far left and far right borders, only two of these three possibilities exist.)

The dynamic programming approach is to fill out this table in order of increasing y: first all the entries where y=0, then the entries where y=1, etc.

We also need code to keep track, at each pixel, which of the 3 possible parent seams was chosen. This means creating a second 2D array, perhaps named parent. The idea is that parent[x][y] will hold the column number (x-value) of the previous-row's pixel in the best seam ending at (x,y).

For example, if we had a 3-by-3 image whose energies were \[ \begin{array}{ccc} 1 & 2 & 3\\ 8 & 6 & 4\\ 5 & 6 & 5\\ \end{array} \] then table would be \[ \begin{array}{ccc} 1 & 2 & 3\\ 9 & 7 & 6\\ 12 & 12 & 11\\ \end{array} \] and parent would be \[ \begin{array}{ccc} X&X&X\\ 0&0&1\\ 1&2&2\\ \end{array} \]

The top row of the parent array doesn't matter, since there are no more pixels above.

Slightly more generally, if while computing table[x][y] we decided the smallest of the three possible parent locations was table[x+1][y-1], then parent[x][y] should be set to x+1. If the smallest of the three possible parent locations was table[x][y-1], then parent[x][y] should be set to x. And so on.

Finally, when we have filled out the table and parent arrays, we can find the best seam. You won't be surprised that several loops are involved in this!

First, we loop across the bottom row of the table array. As we do so, we find the smallest table value in that bottom row (where y == height-1). The location you find has the minimum cumulative energy and will be where the seam ends. Then, using the parent array, you can determine whether the seam extends up-and-left, up, or up-and-right. Then, you continue working your way up, until reaching the top row.

As this process happens, you can collect each of the column (x) values for the pixels in the seam at each row. A one-dimensional array whose length is the height of the picture is ideal for this: the index into the array is the row number and the contents of the array is the column location of the seam at that row. (Think of it as a "column" array, though it doesn't really have any preferred direction.) My implementation called this array seam.

Continuing from the 3x3 example above, the best seam ends in the lower right corner, in column 2, because that's the smallest value of table in the bottom row. Thus, seam[2] = 2. Working backward through the parent array, we find that the best north-neighboring pixel in row 1 (the one with lowest cumulative energy) is also in column 2, so seam[1] = 2. From there, the best north- neighboring pixel in row 0 is in column 1, so that seam[0] = 1. Thus we get the seam (from top to bottom) {1,2,2}, corresponding to original pixels that had intensities 2, 4, and 5.

Tiebreaking!

(Thanks to Savannah Baron for pointing this out!) We need a rule for breaking ties when defining the seam: more than one of the neighbors in the next-row-up may share the same minimum value!

If there is a tie when you're identifying which Pixel to select on the minimum-cost path, please select the Pixel with the same x value as the first choice (if it is the min or shares the min). Choose the Pixel to the left (x-1) as the second choice (if it's the min but the previous one is not). Finally, choose the pixel to the right (x+1) - if it's the only one with the minimum value.

If there are multiple paths that have same, minimum cost: Select the path that has the minimum X value on the bottom row (i.e. the largest Y value)

Please make sure you understand what table, parent, and seam are before you start coding!

showSeam

This method should return a copy of this with the lowest-cost seam, as computed by computeSeam, shown in red. To do this, you'll probably want to start by (1) computing the best seam and (2) making a new copy of this. For instance, these lines will accomplish these two things:

    int[] seam = this.computeSeam();
    Picture newPicture = new Picture(this);

From there, you'll need to loop through and change some of the pixels of newPicture to be red -- namely those on the seam! Here is an example from the okinawa.jpg image:

carve and carveMany

The carve method is similar in spirit to showSeam, except that instead of coloring the seam red, it removed those pixels altogether. Because of this, you won't be able to use the Picture constructor that copies this, because your newPicture has to have a different size! In this csae, use the brand-new Picture constructor, e.g., starting with

Picture newPicture = new Picture(newWidth, newHeight);

where you'll need to define newWidth and newHeight based on the height and width of this!

From there, you'll need to copy pixel colors from this to newPicture. Notice that we always copy the colors of Pixels, not the Pixels themselves... . However, in doing this, your code will need to be careful to skip the seam's pixels and to shift over all of the pixels to the right of the seam by one pixel place to the left.

For the carveMany method, you will want to loop numSeams times, carving once each time! Note that you can write a line such as this:

    Picture p = this;

and then reassign p to the result of p.carve() the correct number of times!

Testing

The file PictureTest.java has 34 of tests with which to check your implementations. You do not have to write your own tests, though you may write your own testing code in order to understand and/or debug some of the image-processing functions. Several helpful debugging methods have already been written -- feel free to use them:

printIntensity()
printEnergy()
printArray(int[][] A)
showDifferences(Picture p)

The file PictureECTest.java has 14 extra-credit tests, as well.

Style guidelines

About 10-15% of this week's hw score will be based on style. The usual style guidelines apply, but there are also a few that are specific to Java:

Your file should be indented to clearly show the structure of nested statements like loops and if statements. Nonuniform indentation or use of curly braces will entail deductions, as will not indenting your code to reflect the nexting (which Python requires) In addition, it's a good idea that all if, else, while, do, and for statements use braces, even if they are not syntactically required.
All classes start with a capital letter, all methods and (non-final) data fields start with a lower case letter, and in both cases, each new word within the name starts with a capital letter. Constants (final fields) are all capital letters only.
Numerical constants with special meaning should always be represented by all-caps final static constants.
All class, method, field, and variable names should be meaningful to a human reader.
Methods should not exceed about ~50 lines, though computeSeam can get close! Any method too long can probably be broken up into logical smaller pieces. The same is probably true for any method that needs more than 4-5 levels of indentation (even that is an awful lot!)
Overall, the goal is to create programs that are easy to read and understand!

FAQ

Q. I keep getting the following error: Uncaught error fetching image: java.lang.NullPointerException. A. This error message arises because some, or all, of the images are not in the same folder as the result of your code. Please place any additional images files in the same place!

Q. I keep getting the following error: Exception in thread "main" java.lang.OutOfMemoryError: Java heap space. A. This error message arises when you are trying to load too big a file. Given the current framework, the project works best with moderately-sized images.

Q. I tried to compare two images together using the equals method in the Picture class, and even though they are obviously visually the same, the equals method returns false. A. Check to see whether one of the image files is a JPEG file. The equals method fails on JPEG files because JPEG files use lossy compression to store images: in other words, they compress an image to store it, but doing so causes information to be lost. Since the equals method attempts a perfect information match, it does not work well with JPEG files. This problem will not arise with GIF, BMP, and PNG files, since these file types involve lossless compression.

Q: I run my program many times and after a while eclipse starts to get really slow. A: When you close the picture explorer using the red X at the top right of the window (instead of using File->Exit), Eclipse doesn.t really end your process. Go to the debug perspective in Eclipse and click on the red square to stop each process individually.

Q: Any tips on these two dimensional arrays? A: Here is a helpful resource for Arrays: A: Here is a quick-summary reference on Java Arrays and here is Java's official (longer) tutorial on arrays.

Q: If there is a tie for a minimum path . which path should we pick? A: By default, please pick the cell above in a tie, if there is only a tie between the left and the right pixels, please pick the left pixel.

Q: My energy is off by just a few pixels! What is wrong? A: Inside the energy() method you'll be working with two different Picture objects. The object the method was called on (this) and the one you plan to return. When you call the method getEnergy(x, y) make sure you call it on this and not the Picture object you plan to return because the contents of this won't be changing... .

Q. My showEdges method does not pass the tests, even though my resultant picture and the test picture look exactly the same. What can I do?
A. If you are using the Pixel.colorDistance method, cast its result to an integer before comparing it with the threshold.

Submission

Submit your Picture.java sourcefile in the usual way under homework 8. You'll onle need to submit PictureExplorer.java if you try some of the additional extra-credit.

Extra Credit

For up to +10 points of extra credit, there are several additional image-processing functions in Picture.java. These have tests already written with which to try things out. They also have descriptions in the Picture.java source file:

chromaKey - changing pixels based on color
flip - more general geometric transformations than rotateRight
showEdges - an edge-detection routine
blur - smoothing the features of an image
paintBucket - changing the color of connected pixels within a certain color distance...

For even more extra credit, depending on what's implemented, you're welcome to add features and/or other image-processing capabilities by changing the user interface. If you'd like to do this, you'll want to change PictureExplorer.java and submit your new version of that file.

For example, you could include a menu option for

horizontal seam-showing and -carving. (And the image processing for this can be done by composing existing code, in fact!)
seam-insertion - as described in the paper, this is a small variation on carveMany
pixel-weighting - in which you give pixels very high or very low energies regardless of their content in order to protect or intentionally remove them, respectively...
You're welcome to create other features you'd like to design, as well!

Computer Science 60 Principles of Computer Science Spring 2015

Assignment 9: Seam Carving [100 Points] Due Tuesday, April 7, 11:59pm