CS 60, Assignment 8

Computer Science 60
Principles of Computer Science
Fall 2005

Assignment 8 "Lite": Parsing and Prolog!
Due Monday, October 24 by 11:59 PM

Please remember to complete the on-line mid-semester course evaluation before you submit this assignment. Thank you!

This assignment has three parts. In the first part, you will implement a recursive descent parser for a small "language". In the second and third parts, you will have a chance to start playing with Prolog.

Part 1: Recursive Descent Parsing (35 Points)

In this part of the assignment you will be writing a recursive descent parser and an evaluator for arithmetic expressions with addition, multiplication, exponentiation, negation, and parenthesization. In particular, we will use the unambiguous context-free grammar that we saw in class:

S -> P + S | P
P -> E * P | E
E -> U ^ E | U
U -> ( S ) | - V | V
V -> '0' | ... | '9'

Your task is to write a recursive descent parser in rex for this grammar. One of the functions in your file should be called parse. This function takes a string to parse and returns a list of two things:

A parse tree, a tree of operations and variables that reflects the computational structure of the input expression as we demonstrated in class.
A residue list, a list of characters (tokens) that have not yet been parsed.

Note that the first element in each case will be a parse tree, and the second element will be the residue list. If the parsing was successful, the residue will be the empty list.

From our experience, a good way to approach this problem is to first build a parser for a simplified version of this grammar. For example, you might begin with the grammar:

S -> P + S | P
P -> E * P | E
E -> V
V -> '0' | ... | '9'

Once you get the parser working for this grammar, you can extend it to the full grammar for this problem.

Note that in the full grammar, the rules for variable U are a little different from the other rules, in that they handle parentheses and negation. Think about these rules! While they seem a bit strange, they actually are "easier" for the parser than the other rules!

Here's an example of how your parse function will behave once you have implemented the full grammar. Notice that in the first two examples, the residue is empty because parsing was successful. The third example did not succeed in parsing because the input was malformed.

rex > parse("1+2^3");
[[+, 1, [^, 2, 3]], []]

rex > parse("(1+2)^3+-4");
[[+, [^, [+, 1, 2], 3], [-, 4]], []]

rex > parse("1+2*+3");
[[+, 1, [*, 2, Error]], [BADTOKEN, 3]]

Take a look at (and feel free to use any part of) the file part1.rex which is in the directory /cs/cs60/assignments/assignment8. It shows a recursive descent parser for expressions using only addition. Note the use of explode to tokenize the original string, i.e., to break it into individual meaningful units, which in this case are single characters. Note that your input string should have no whitespace, since whitespace will be converted into additional "space" characters by explode.

Once you have successfully written your recursive descent parser, implement an Evaluate function and whatever other helper functions you need. Evaluate takes as input an arithmetic expression (with addition and multiplication and one digit numbers) and returns its arithmetic value. For example, here is how your Evaluate function should behave.

rex > Evaluate("1+2^4");
17

rex > Evaluate("1*2+3");
5

3 rex > Evaluate("1+");
Bad Input

Notice that this function should return the string "Bad Input" if the expression is malformed. The Evaluate function will need to call the parser and some helper functions. You may wish to use the rex built-in function make_number which takes a character as input returns a number corresponding to that character. For example,

rex > make_number('4');
4

rex > 1 + make_number('2');
3

Remember to place all of your code for this part of the assignment in a file called part1.rex.

For 5 extra credit points augment your grammar, parser and evaluator to permit the use of multi-digit numbers as input. For example, we could ask Evaluate("(60+42)+-1"). If you do this bonus, make sure to put a comment in the first line of your part1.rex file stating that "THIS SUBMISSION HANDLES THE BONUS PROBLEM".

Part 2: Playing with Prolog! (15 Points)

In the directory /cs/cs60/assignments/assignment8 you will find a file called part2.pl. This file contains an expanded database of the Simpsons family. You should make a copy of this file and add the following rules to this file. You may also add any additional helper rules that you wish, including anything that we looked at in class. However, please don't change the facts about parenthood/relationships among the Simpsons!

grandparent(X, Y) is true if and only if X is a grandparent of Y.
cousins(X, Y) is true if and only if X and Y are first cousins. Note that a person cannot be their own cousin. Also, do not consider brothers and sisters to be cousins for this problem.
hasYS(X) is true if and only if X has a younger sister. (Consider "younger" to mean strictly younger in age.)

Recall that "," is the symbol for AND, ";" is the SYMBOL for OR, and "\+" is the symbol for NOT. Please look at the class notes for other Prolog syntax. Your file should be called part2.pl and submit it using cs60submit.

Part 3: Lists in Prolog (25 Points)

Here, you will write three Prolog rules for lists. Please submit these rules in one file called part3.pl using cs60submit. You may use any helper rules that you like. In particular, some of the rules that we saw in class may be useful to you here. You may implement and use any of them.

reverse(X, Y) is true if and only if list Y is the reversal of list X. Notice that you can use this rule to now actually compute the reversal R of a list by typing, for example, reverse([1, 2, 3], R). Make sure this works.
occurs(X, L, N) is true if and only if item X occurs in list L exactly N times. Here is an example:
```
      | ?- occurs(spam, [oh, spam, spam, we, love, spam], X).

      X = 3 ;
      no
    
```
When writing this predicate, be careful that it "computes" exactly the correct number of occurences (an easy pitfall is to write a version of this function that returns all values of X less than or equal to the number of occurences). Also, your occurs predicate should be able to generate the terms that occur a particular number of times, e.g.,
```
      | ?- occurs(T, [oh, spam, spam, we, love, spam], 3).

      T = spam ;
      no
    
```

One-dimensional pattern matching is an important task in many applications such as cryptography and DNA sequencing. We'll encode a pattern as a list of items and we'll search for a pattern in another list called the "target". The rule find(Pattern, Target, Index) takes two lists Pattern and Target and a non-negative integer Index as input. This rule is true if and only if Pattern occurs inside list Target beginning at position Index. For example, here is a Prolog query and response:
```
      | ?- find([1, 2], [1, 2, 1, 2, 1], 0).
      yes
    
```
Notice that pattern [1, 2] occurs in target [1, 2, 1, 2, 1] beginning at position 0 of the target (that is, the very beginning of the target list).
```
    [1, 2, 1, 2, 1]
     ^
     |
     The pattern [1, 2] appears beginning right here at location 0.
     I'm looking for a 1 followed immediately by a 2 and I find it
     beginning here!
    
```
It also occurs again at position 2 of the target.
```
    [1, 2, 1, 2, 1]
           ^
           |
           The pattern [1, 2] appears beginning right here at location
    2.
    
```
Thus, here's another Prolog query and response:
```
      | ?- find([1, 2], [1, 2, 1, 2, 1], 2).        
      yes
    
```
In fact, these are the only two occurences as indicated by the following Prolog query and response:
```
      | ?- find([1, 2], [1, 2, 1, 2, 1], X).   

      X = 0 ;
      X = 2 ;
      no
    
```
find(P,T,I) should also work when both P and I are unbound variables. Write the rules for find. Submit your code in a file named part3.pl.

Computer Science 60 Principles of Computer Science Fall 2005

Assignment 8 "Lite": Parsing and Prolog! Due Monday, October 24 by 11:59 PM

Part 1: Recursive Descent Parsing (35 Points)

Part 2: Playing with Prolog! (15 Points)

Part 3: Lists in Prolog (25 Points)

Computer Science 60
Principles of Computer Science
Fall 2005

Assignment 8 "Lite": Parsing and Prolog!
Due Monday, October 24 by 11:59 PM