Bertrand

Wm Leler

Introduction

Bertrand is a computer language that I wrote back in 1986. I also wrote a book about it, called "Constraint Programming Languages" that was fairly popular for a while. As a computer language, Bertrand has some interesting and unique features.

There is a very primitive implementation of Bertrand available at http://leler.com/BERTRAND.zip

There is also a Google Group for Bertrand at http://groups.google.com/group/bertrand-constraint

There was a much more powerful version of Bertrand, but I wrote it on someone else's dime, so I had to rewrite it from scratch in order to distribute it. Recently, several people have started playing around with Bertrand again, probably because computers are now fast enough to execute Bertrand programs. You are welcome to play with it. Please tell the Google Group about what you did with Bertrand.

Constraint languages

Constraint Languages are languages where you do not provide an algorithm for solving a problem. Instead, you place "constraints" on what the solution looks like, and let the computer figure out how to solve the problem. By necessity, constraint languages tend to be fairly domain specific, since the language interpreter needs to know how to solve problems.

Bertrand looked at constraint languages another way. Bertrand itself is not a constraint language. Instead, it is a general problem solving language, which you can use to build constraint solving systems. A Bertrand program is a set of rules, which are executed to solve a problem. The rules are typically very domain specific. Bertrand itself knows absolutely nothing, other than how to execute rules. In fact, it doesn't even know how to do simple arithmetic.

Bertrand comes with a few sets of rules. One of these is called "bops" which defines standard arithmetic operators and rules. Another library is beep, which is a simultaneous equation solver. These libraries are made up of Bertrand rules.

Primitives

Bertrand even does arithmetic as if it were interpreting rules. For example, if given the expression 2 + 2, Bertrand acts as if it has a rule that says to replace the expression 2 + 2 with the expression 4. This rule would look like this in Bertrand:

2 + 2 { 4 }

Of course, it would be unweidy to have billions of rules to define arithmetic, so the Bertrand interpreter cheats by having simple arithmetic built in.

A rule has a head and a body. Rules can contain variables called parameters, which are local to the rule. For example,

0 + x { x }

This rule will rewrite 0 + 15 as 15.

Here are the (complete) rules for Boolean arithmetic, from bops:

a ~& b { ~(a & b) } a | b { ~(~a & ~b) } a ~| b { ~a & ~b } false & a { false } a & false { false } true & a { a } a & true { a } ~true { false } ~false { true } ~~a { a }

Constraints

With bops and beep, Bertrand is powerful enough to solve simple arithmetic constraints. A constraint to be solved by Bertrand is simply an expression. For example, here is a constraint to be solved:

F = (1.8 * C) + 32 0 = C what is the value of F?

In order to be solved by Bertrand, this must be written as a single expression, so we introduce the semicolon operator, which asserts that its left argument is true, and returns the value of its right argument. The Bertrand program for the above set of constraints is:

F = (1.8 * C) + 32 ; 0 = C ; F

Bertrand returns 32 as the value of this expression. Here's another one:

F = (1.8 * C) + 32 ; F = C ; C

Bertrand returns -40 as the value of this expression. Note that the equals sign is not used for assignment, and that simultaneous equations can be easily solved. In fact, the equals sign is used as in mathematics. The semicolon operator is what asserts something as true. Consider the following constraints:

F = (1.8 * C) + 32 ; -40 = C ; F = C

The value of this expression is "true". With the semicolon, the equals sign can be used both to assert that an equality is true (like assignment) and to test if something is true. There is no need for separate operators for assignment and testing for equality.

Variables

We have already seen parameters, used in rules. But Bertrand expressions can also contain variables. In the last section, F and C were variables. Variables are initially unbound, but they can become bound during the execution of a Bertrand program. Bertrand follows single assignment semantics, so a variable can only have a single value bound to it, and that value never changes.

Semantically, Bertrand treats binding variables as if a new rule was introduced. So for example, if the value 32 is bound to the variable F, this acts as if the following rule was added to the system:

F { 32 }

Note that F in the head of this rule is not a parameter, it is a litteral symbol (an atom) that matches a variable. Litterally, the symbol F is replaced by the number 32.

Types

Bertrand also has types, including structures. For example, litteral numbers (like 2 and 3.14159) are all of type constant. In Bertrand, a type is written beginning with a single quote. So the following rule (from bops) is used to add numbers together:

a'constant + b'constant { addition_primitive }

Where "addition_primitive" is the built-in primitive for adding two numbers together. This rule matches the expression 2 + 3 since both 2 and 3 are of type 'constant.

Within the body of a rule, you can declare new variables and give them a type. For example, here is a simple Bertrand program for solving a polynomial:

main { p: aNumber; q: aNumber; p = 3*q - 1*p - 5 ; 4 = q + 2 ; p, q}

The "aNumber" operator is defined in bops as:

#op aNumber nullary aNumber { true } 'numvar

Two new concepts here. First, a label can be placed in front of an expression (a variable name followed by a colon) to define a variable. Second, a rule can be given a type, which assigns that type to the variable name that was given in the label. For example, when p: aNumber is matched by the rule aNumber { true } 'numvar the varible p is created with type 'numvar. Note that as usual, none of this is fixed in Bertrand. The colon operator and the 'numvar type are defined in bops, and can be changed if you wish.

Note that the aNumber operator is rewritten to be just "true". See the section on structured types for more information on this.

The Simplicity of Bertrand

One of the unique things about Bertrand is that it has no semantics. As mentioned earlier, Bertrand itself only knows how to interpret rules and all semantic meaning in a Bertrand program is provided by those rules. Everything else has to be defined using rules, including arithmetic, declarations, and so on. So Bertrand itself is extremely simple.

In addition, Bertrand has very little syntax. Since Bertrand is used to define constraint languages, you want it to have as simple a syntax as possible. In Bertrand, the only reserved characters are period, curly braces, the hash sign, and all three quote marks (apostrophe, back tick, and double quote).

. { } # ' ` "

That's it! All other characters are used to define variables and operators. Variables are alphanumeric strings (including the underscore character) that begin with an alphabetic character.

Numbers are made up of digits and a single optional decimal point (for example, 5 or 3.14). Note that there are no negative litteral constants in Bertrand — if you write –2 that is interpreted as the minus operator followed by a number. Likewise, there are no constants in scientific notation in Bertrand. To write such a number, use the ^^ operator (defined in bops). For example, 2 ^^ 3 is the same as 2e3 in scientific notation, which is 2000. Likewise, –0.005 would be written as –5^^–3, where – and ^^ are operators, not part of a litteral number.

Pretty much everything else is an operator.

In addition to being used as a decimal point, the period symbol is also used for compound names (for example, rectangle.width). And two (or more) periods in a row start a comment, which runs to the end of the current line.

Operators

There are no built-in operators in Bertrand. Even operators like + and = are defined using rules (in bops). The minus sign is not even built-in, not even in litteral numbers like –3). In fact, parentheses are defined as an operator — they are not built-in to Bertrand. Operators can have names (like "sin" and "round", or they can be made up of one or two non-alphanumeric characters (like >= and *).

Operators are classified by how many arguments they take:

nullary operators take no arguments. For example, "true" is a nullary operator.
unary operators take one argument. Unary operators can be prefix (-x), postfix (5!), or outfix (parentheses).
binary operators take two arguments. Binary operators can be left associative (like + or /), right associative (like -> or ;), or non-associative (like | or &).

Note that a name or symbol can be used for more than one operator. For example, the minus sign is both a unary prefix operator (negation) and a binary left associative operator (subtraction).

Many operators are defined in bops, but you can easily define additional operators. In fact, you are not required to use bops at all, and can define all your own operators. Bops itself is very readable -- feel free to take a look at it.

Operators must be defined (using the Bertrand preprocessor) before they can be used in a rule. Bops contains lots of examples of the definition of operators.

Control Structures

Bertrand has a very simple and powerful conditional operator ->, defined in bops:

A -> B { ~(A & ~B) }

That's it. Just one rule (along with the other rules for Boolean arithmetic, shown above). This is read as "A implies B" (where a and b are both Boolean expressions). As in conventional IF statements, the -> operator says that if expression A is true, then expression B must also be true. In addition, like other operators in Bertrand, it works "both ways". In other words, if we know that expression B is false, then Bertrand will assert that expression A must also be false.

So, just like Bertrand is able to use the constraint F = (1.8 * C) + 32 to determine the value of either F or C, the expression A -> B allows us to determine either the value of A or B.

If you don't like the syntax of Bertrand's conditional operator, you can change it of course, by defining three operators and one rule:

#op if 240 unary prefix #op then 250 binary left #op else 250 binary left if a then b else c { a -> b; ~a -> c }

Bertrand has no iteration control structures (for or while loops). Instead, rules are allowed to be recursive. Here are the rules to calculate factorials (from the example "factorial"):

fact 1 { 1 } fact a'constant { a * fact (a-1) }

If you want, you could define your own interation operators.

Structured Types

Other languages call these "user defined types", but in Bertrand, all types are "user defined", including numbers and such. But the same facilities that are used to declare variables can be used to define structured types. For example, to define a datatype for a two-dimentional point:

#op aPoint nullary #type 'point

aPoint { x: aNumber ; y: aNumber; true } 'point

The aPoint operator is the constructor for a point, of type 'point. There is no data hiding in Bertrand, so you can access any member of a structured type:

p: aPoint; p.x = 5; p.y = 12;

The first line creates an object of type point, and the other two lines set the point's x and y values. Note that constructor operators do not have to be nullary, they can take one or two arguments that are used to initialize the new object.

#op newpoint binary xx newpoint yy { x: aNumber ; y: aNumber ; x = xx ; y = yy ; true }

Finally, note that our object constructing rules all return true. Instead, we can apply constraints to datatypes. For example, we could specify a constructor for a point that lies in the first quadrant, by replacing the final "true" with the expression "x > 0 & y > 0".

Rules and Constraints

A Bertrand program consists of two things: a set of rules that define a constraint solver, and a set of constraints that is solved. Bertrand merges these two sets using the common trick of having a special rule whose head is the single operator "main". We saw an example of this in the section on types. Initially, Bertrand is given a set of rules, and the set of constraints is a single constraint containing the "main" operator. This single constraint matches the rule whose head is "main" and replaces the set of constraints with the body of that rule.

In Bertrand, the set of constraints is written as a single expression. Individual constraints are typically separated by the semicolon operator (which really is an operator, not a statement terminator as in C or a statement separator as in Pascal). So a typical set of constraints will look like:

constraint1 ; constraint2 ; constraint3 ; result

For example,

F = (1.8 * C) + 32 ; F = C ; C

which contains two constraints and whose result is the value of C. Again, there is nothing special about the semicolon operator, the above program is completely equivalent to

F = (1.8 * C) + 32 & F = C ; C

Here the two constraints are separated by the AND operator instead of the semicolon.

Writing Rules

The purpose of rules in Bertrand is to solve a set of constraints. So in general, when you write rules, the rules should simplify the constraints. The main way to do this is to convert the constraint expression into a normal form. For example, the simultaneous equation solver puts equations into a normal form called an ordered linear expression. Look at the rules in beep to see how Bertrand does this.

A simpler example are the rules for Boolean algebra (which are found in bops). The rules convert Boolean expressions into normal form by converting expressions containing the OR operator "|" into equivalent expressions containing only the AND "&" and NOT "~" operators.

Another thing to remember is to avoid writing rules that never terminate. For example, you might be tempted to write a rule that expresses that addition is commutative:

a + b { b + a }

But such a rule will continue matching forever. Instead, the equation solver in beep uses a trick that orders variable names lexicographically. Thus the expression z + p will be rewritten by beep to put the variable names in alphabetical order p + z (but p + z will never be rewritten into z + p). This allows the equation solver to put expressions into normal form using the associative property of addition, without getting into infinite loops.