All Databases MacTech Vol 09-1993

MPW Calculator

Volume Number: 9

Issue Number: 1

Column Tag: Jörg's Folder

MPW Calculator Tool in C++

An MPW tool written in bare bones C++ - not with MacApp.

By Jörg Langowski, MacTech Magazine Regular Contributing Author

Note: Source code files accompanying this article are located on MacTech CD-ROM

or source code disks.

After a long break, you’ll find another C++ example in my column. Not with

MacApp - we’re going back to the basics here and show a simple ‘bare C++’ program

that executes as an MPW tool, or in the Simple Input/Output Window environment

(SIOW) provided by Apple.

I came across this example reading a book on C++, “Programming in C++”, by

Stephen Dewhurst and Kathy Stark (1989, Prentice Hall). I very much recommend

this book for those of you who want to get the basic notions of C++ and an idea of its

‘programming flavor’. It may be not as comprehensive as the “C++ programming

language” by Stroustrup (Addison-Wesley), or as the manuals that come with Apple’s

C++ compiler; but it contains a lot of examples and exercises.

One classic exercise in computer science is to write a parser for algebraic

expressions, that is, a program that takes an input string like

(3 + 5) * (-4 + (2 - 9))

and calculates the result of this expression. In fact, what you do to compute that

expression is to convert it into an internal representation, and then evaluate that

representation.

You can represent the expression given above by a linked list:

where each node of the list either contains an operator or a number. Once the

arithmetic expression is given in list form, evaluating it is easy and can be done in a

straightforward, object oriented way. Suppose you define a class node (also in listing):

class node {
protected:
node() {}
public:
virtual ~node() {}

virtual int set (int)

{ cout << "Error: node::set(int) undefined" << eoln;
 return 0;}
virtual int eval()
{ cout << "Error: node::eval() undefined" << eoln;
return 0;}
};

where the constructor and destructor do nothing at the moment; they will have to be

overridden by the derived node classes. There are two more virtual methods: eval(),

which returns the value of the node, and set(), which can ‘set something’ in the node if

defined in a subclass. For the base class, the methods just print error messages and

return zero. These methods will be overridden, and they are virtual because we want

their behavior to be determined at run time. That is, if we define a pointer mynode

*node and assign it an object of a subclass of node, the actual eval() or set() methods

used will be the ones corresponding to the subclass of the object that the pointer

contains.

We now define two types of subclasses of node: dyadic operators and other stuff.

All dyadic operators will be derived from the subclass :

class dyad : public node {
protected:
node *left, *right;
dyad(node *l, node *r) {left=l; right=r;}
~dyad() {delete left; delete right;}
};

dyad only defines the two nodes that the operator connects: the constructor

assigns these two nodes to two instance variables, and the destructor deletes them

again. The classes derived from dyad define the four arithmetic operations, and the

assignment (see listing). For example, addition is defined as:

class plus : public dyad {
public:
plus(node *l, node *r) : dyad(l,r) {}
int eval() { return left->eval() + right->eval();}
};

where the constructor just calls the superclass constructor, and

eval() returns the value of the sum of the values of the left and right hand side of

the plus operation.

For the ‘expression tree’ shown above, you need one more class: numbers, which

are ‘end nodes’ of the list. Thus, they do not contain pointers to other nodes, but an

integer value in an instance variable. Their constructor assigns the value, and their

eval() method simply returns it:

class inumber : public node {
int value;
public:
inumber(int v) {value = v;}
int eval() {return value;}
};

Finally, the unary minus operator (uminus in the listing) will return the

negative value of the node that it points to.

The listing contains two more node classes: variables (id) and the assignment

operator (equals), Variables contain a pointer to the head of a symbol table, and a

pointer to the entry of this variable into the symbol table. The symbol table is a linked

list of entries, and the pointer to its head is a static class variable. This means that,

unlike instance variables, the pointer is not created again for every object of the class,

but only one copy exists. Thus, all objects of this class reference the same symbol

table, which is just what you want.

Assume we create a new node for a variable with its name given by the *char

pointer nm. The constructor then first looks up the name in the symbol table (see

listing, method look); if it is already there, it will put a pointer to the symbol table

entry in the instance variable ent of the node. Otherwise, it will add a new entry to the

symbol table, put its pointer into ent, make the new entry the head of the list, and

change the (static) pointer so that it references the new head. All other variable

objects will now have an updated reference to the symbol table.

A variable is assigned a value by the assignment operation equals. When an equals

node is created, its right and left hand sides are set just as for the arithmetic

operations; when its eval() method is called, the variable on the left hand side is set to

the result of the right hand side. No check is done whether the left hand side is really a

node of class id; but only those nodes contain a set method. The assignment operation

also returns a value (just as in C), the right hand side of the statement.

So now we have defined a syntax tree for simple arithmetic expressions with

assignment of variables. If, lets say, root is the pointer to the root of this tree (the

times symbol in the drawing), and we call root->eval(), the result returned should be

the value of the expression, in our case (3 + 5) * (-4 + (2 - 9)) = -88. With the

class definitions given so far, this should work. But how do we set up the syntax tree in

the first place? We need a routine, an expression parser, that takes the string

expression and constructs the tree from it.

In order to write the parser, we first need to formalize the syntax of our simple

arithmetic expressions. Such a formal description would for instance look like this:

 :==  { [+|-]  }
  :==  { [*|/]  }

:== | |

() | - |

We then define three parser routines that return an object of class node: e(),

t(), and f(), returning, respectively, a pointer to an expression, a term, and a factor.

Each of these routines makes repeated calls to another routine scan(), which gets the

next token from the input stream. A token is a separate syntactic element, such as an

open or close bracket, an arithmetic operator, a number or an identifier. scan()

returns a character value, either the ascii value of a symbol if the token consists of

one symbol, or a special value > 127. For the special values ID and INT, additional

information can be found in a static char variable; this string will be used for the

value of a number or the name of a variable. Other possible values are BAD (the

scanner found something it couldn’t interpret), or EOLN (end of line found).

The first of the three parser routines, e(), is called from the main program (see

listing for main()). On entry, one token has been read from the input stream; then e()

tries to parse the input into a valid arithmetic expression and assign a pointer to its

syntax tree to root. It does so by calling t() (see listing), which makes a term out of

one or several factors by calling f(), just like e() makes the expression out of one or

several term. When e() encounters a plus or minus sign after a term, is makes a new

plus resp. minus node which connects the first term with the next one. Same for t();

here eventually a times or divide node is made, which connects two factors. f(),

finally, will look for a number or identifier token and return a pointer to the

corresponding node; or it might find an open bracket or a minus sign, after which a

new expression or a factor have to follow. When all these recursive calls have been

evaluated and no syntax errors have been found, the top-level e() returns the

expression’s syntax tree.

In order to get the value of the expression, all we have to do then is call

root->eval(), and the syntax tree will be evaluated as described above.

As long as no syntax errors are found, the main program loops continuously and

asks for new expressions; the names and values of identifiers are remembered from

one evaluation to the next, so you can play around with stored values a little.

As you may imagine, it is not too difficult to add on to this extremely simple

calculator program e.g. to implement exponentiation or to include functions like

exp(), log() etc. The constructed syntax tree could also serve in a bigger program as

an internal representation of a function that the user has typed in and that has to be

evaluated a lot of times (and computed fast). One could even imagine to generate real

machine code out of the syntax tree, which makes this program some kind of a

rudimentary compiler. We’ll add to the example during the next columns.

On the source code disk, I have compiled two versions of the program, one as an

MPW tool, and the other one using the Simple Input/Output Window mechanism.

Actually, the only thing that has to be changed is the linking. The MPW sequence to

create the two versions of the program look like the following:

cplus calc.cp

Link -w -c 'MPS ' -t MPST ∂

calc.cp.o ∂

"{CLibraries}"StdCLib.o ∂
"{Libraries}"ToolLibs.o ∂
"{Libraries}"Runtime.o ∂
"{Libraries}"Interface.o ∂
"{CLibraries}"CPlusLib.o ∂
-o calc
Rez -a "{MPW}"Interfaces:Rincludes:SIOW.r -o calcAppl

Link -w -c 'JLMT' -t 'APPL' ∂

calc.cp.o ∂

"{CLibraries}"StdClib.o ∂
"{MPW}"Libraries:Libraries:SIOW.o ∂
"{Libraries}"Runtime.o ∂
"{Libraries}"Interface.o ∂
"{CLibraries}"CPlusLib.o ∂
-o calcAppl

So the second version can be run also by those of you who don’t have MPW.

FORTHcoming

I’m still waiting for those MacForth contributions coming in we’d really like to

see more of that in MacTutor. I’m sure there are quite a few of you satisfied MacForth

users who have something interesting to write about. Please contact me: I promise that

you get your space in this column. My network address has changed in the meantime,

since our institute is going from the Bitnet to the Internet world; so in the future,

Referenced by (3):