Computational Methods for the 

Generation of Spatially Balanced Latin Squares

 

Carla Gomes, Meinolf Sellmann, Cindy Van Es, Harold Van Es

Cornell University

 

 

Caption: spatially-balanced Latin square of order 6

Caption: Average distance between pairs of symbols.

Java Applet with Spatially Balanced Latin Squares

Generation of Spatially Balanced Latin Squares

In a spatially-balanced Latin square all pairs of symbols  have the same total distance in the Latin square. The distance of two symbols in a given row is the difference between the column indices of the symbols. The existence of spatially-balanced Latin squares is an open question in combinatorics, and no polynomial time constructions for the generation of spatially-balanced Latin squares have been found yet. Therefore, in order to get some insights into the structure of spatially-balanced Latin squares we used general local and complete search methods. We discovered that local search methods do not scale well on this domain, failing to find the global optimum for instances larger than order 9. On the other hand, our results with a CP based approach were very promising and we could generate totally spatially-balanced Latin squares up to order 18. Furthermore the CP based models provided us with interesting insights about the structure of this problem that allowed us to conjecture the existence of polynomial time constructions for generating spatially-balanced Latin squares. We are currently working on finding such efficient constructions.
 

An Applet visualizing Spatially Balanced Latin Squares

Using the local search and the Constraint Programming approaches, we pre-computed spatially balanced Latin squares. Go to our Java Applet with Hundreds of Spatially Balanced Designs. The applet contains fully-balanced designs as well as sub-optimal solutions found by the local search approach.

 

Motivational Application:  Agronomic Field Experiments [van Es and van Es 93; van Es 2002]

In agronomic field experiments, one has to test and compare different soil treatments. Two different soil treatments may correspond to two different fertilizers or two different ways of preparing the soil. In order to perform statistically significant field experiments, it is important to test different treatments in an unbiased way. Most agronomic field experiments are implemented through randomized complete block designs (RCBD) where each block has as many experimental units as treatments . Use of blocks is in most cases justified by spatial variability in fields, and this layout is an attractive way to organize replications. This approach to experimental design uses random allocation of treatments to plots, which is used to ensure that a treatment is not continually favored or handicapped in successive replications by user bias or some extraneous source of variation .

Consider the following example of an RCBD:

Use any fixed assignment of treatments to symbols:

 

Treatment 1 = A
Treatment 2 = B
Treatment 3 = C
Treatment 4 = D
Treatment 5 = E

Then permute the the symbols randomly within each block:

 

Block 1    D   B   A   E   C
Block 2    A   B   E   C   D
Block 3    B   A   C   D   E

Although this randomization approach is intuitively attractive, it has been shown to cause biases and imprecision under most field conditions (Van Es 92. The reason for this is that underlying soil characteristics are typically non-random and show field trends, spatial autocorrelation, or periodicity . For example, fertility patterns in fields often exhibit high and low areas due to, among others, erosion, drainage variability, and management history. The classical randomization process does not explicitly account for such field patterns, and many realizations of such RCBD designs may result in undesirable outcomes.

The first problem is the spatial distribution of individual treatments. In the example above, we see for example that treatments 1 and 2 (symbols A and B) are planted on the west side of the field in all blocks. What often happens in practice is that the researchers choosing the design will iterate the randomization process until they are content with the spatial distribution of the individual treatments - a fact that makes the whole purpose of randomization obsolete. To overcome this problem, it has been suggested to consider Latin square designs. A Latin square has the same number of blocks as there are treatments, such that one treatments occurs in each column exactly once:

Consider the following example of a Latin square design:

First randomize the assignment of treatments to symbols:

 

Treatment 1 = E
Treatment 2 = C
Treatment 3 = D
Treatment 4 = A
Treatment 5 = B

Then choose a pre-computed Latin square to assign treatments to concrete slots:

 

Block 1    D   B   A   E   C
Block 2    A   E   B   C   D
Block 3    B   A   C   D   E
Block 4    E   C   D   A   B
Block 5    C   D   E   B   A
Another problem with RCBDs is the expected distribution of pairs of treatments. Looking at the Latin square above, we see that, in every block, treatments 4 and 5 (symbols A and B) are always planted very close to each other, while treatments 2 and 5 (symbols B and C) have a larger average distance within each block. Due to spatial correlations in the field, such a design is highly undesirable. Therefore, van Es and van Es proposed spatially balanced experimental designs that are inherently robust to non-random field variability. Like for Latin square designs, this approach uses dummy indicators and the treatments are randomly assigned to the indicators. In other words, treatments are randomly allocated to optimized designs, rather than to plots. Such designs may be spatially-balanced complete block designs or spatially-balanced Latin squares, the latter being a special case of the former. In a spatially-balanced Latin square all pairs of symbols (treatments, in agronomic terms) have the same total distance in the Latin square. The distance of two symbols in a given row is the difference between the column indices of the symbols.

Consider the following example of a fully-balanced Latin square design:

First randomize the assignment of treatments to symbols:

 

Treatment 1 = A
Treatment 2 = C
Treatment 3 = B
Treatment 4 = E
Treatment 5 = D

Then choose a pre-computed spatially-balanced Latin square to assign treatments to concrete slots:

 

Block 1    A   B   C   D   E
Block 2    B   E   D   A   C
Block 3    C   D   B   E   A
Block 4    D   A   E   C   B
Block 5    E   C   A   B   D

References


Carla Gomes and Meinolf Sellmann Streamlined Constraint Reasoning. Proc. CP04, 2004.

Carla Gomes, Meinolf Sellmann, Cindy van Es, and Harold van Es The Challenge of Generating Spatially Balanced Scientific Experiment Designs. To appear in Proc. CP-AI-OR , 2004.

Harold van Es,  Sources of soil variability. In Methods of Soil Analysis, Part 4: Physical Properties, Soil Sci. Soc. Am, 2002.

Harold van Es and Cindy van Es. The spatial nature of randomization and its effects on outcome of field experiments. Agronomy Journal, 85, pp 420-428, 1993.