Spring 2005 - CS514 Fault-tolerant Distributed Computer Systems -- Assignment Phase I

Phase I: Distributed Banking System

Due: By the start of class on Tuesday February 22, 2005

General Instructions. Students should work together according to the teams formed in Phase 0. All members of the group are responsible for understanding the entire assignment.

No late assignments will be accepted.

Academic Integrity. Collaboration between groups is prohibited and will be treated as a violation of the University's academic integrity code. Within a group, we expect all members to do their share and team members can help one-another out.

Background: Building a Distributed Banking System

This assignment is intended to help you learn how to write distributed programs in Web Services using C# or, of you really don't want to work with C#, using Java. You will program a simple client/server that employs Web Services to talk from a client to a server. Eventually, the plan is for the servers to form groups (to load-balance and improve availability), and for these groups talk to one-another (thereby linking the branches into a single bank).

All the communication between servers will use UDP. You'll implement protocols from Part III of the book to replicate the data managed by the server so that you can demonstrate the ability to tolerate crashes. Then you'll measure performance and explore options for load-balancing and scaling the system up, so that the replication mechanisms used for fault-tolerance also bring a benefit of better scalability. But we're starting small, and these fancy server-to-server mechanisms will come in a second stage of the project. Right now you just need to support a routing infrastructure that will get messages from branch to branch over the unidirectional communication links defined by the connectivity graph you'll read as an input at startup time.

Our bank comprises a set of branches. Each branch manages a disjoint subset of the bank's accounts. Associated with each account is a unique account number, a balance, as well as other information that will be of less concern to us.

Customers may invoke the following operations on accounts:

Deposit( acnt, amt ): Cause the balance of account number acnt to be increased by the specified amount amt. Returns the new account balance.
Withdraw( acnt, amt ): Cause the balance of account number acnt to be decreased by specified amount amt. Returns the new (possibly negative) account balance.
Query( acnt ): Returns the balance of account number acnt.
Transfer( src_acnt, dest_acnt, amt ): Cause the balance of account number src_acnt to be decreased by amt and the balance of account number dest_acnt to be increased by amt. Returns the new (possibly negative) account balance of account number src_acnt.

What to Build

Structure of a Branch. The branches of a real bank would be physically separated and, therefore, each branch would have its own processors and would communicate with the other branches using some sort of network. We need not depart too far from reality, even if all branches execute on a single processor, by stipulating

that communication between branches is accomplished only by using UDP (unreliable datagram protocol), and
that communication from a client to a branch server is done by HTTP over TCP (later we might change this), in accordance with the Web Services standards

As noted above, we'll be using a cluster-styled server in Phase II and beyond, and will be interconnecting the clusters. Just to anticipate:

communication between members of a cluster of servers implementing a scalable, load-balanced, branch service is also accomplished only by using UDP, and
each branch can directly communicate only with the subset of branches defined to be its neighbors according to the network topology.

Notice that all of these options work equally well when a program is executed multiple times on a single computer or when it is executed once each on multiple computers connected by a network. A single machine could even run a whole server cluster. This will let you do development on a single machine or on a set of side-by-side machines in the CSUG lab.

You can visualize the bank's client systems as leaves on a tree or some other kind of connected graph, each linked to its local branch. The "nodes" to which the clients link are the members of a branch server cluster (perhaps just a single process, but perhaps multiple processes sharing replicated data for availability and scalability).

Don't assume that the graph is a simple rooted tree. To make our project a bit more interesting, we'll work with a connected graph, in which the links aren't even bi-directional. We recognize that this is a bit arbitrary, but it will let you gain experience with some mechanisms you might not otherwise have a chance to implement, like distributed routing algorithms.

Structuring the Phase I bank server. Structure your distributed bank as follows. Implement each branch as two C# or Java applications --- a branch server and a branch GUI. A client computer will run the branch GUI application, which communicates (only) with its branch server. In contrast, branch servers directly communicate with other branch servers. Most of the complexity of our project will be in the branch server applications.

Associated with each branch server is a pair of network addresses to which messages destined to that branch server can be sent. One of these network addresses will correspond to a TCP socket bound to an IP address and a port number for use by Web Services clients (namely, the branch GUI). The other will be a UDP socket bound to a different port number, for use in communication between branches. In these assignments, when we talk about UDP communication between branches, we'll write B.ServPort to denote the network address associated with the branch server B for inter-server communication. This UDP socket will be created and managed by you as a programmer, in contrast to the Web Services address, which is generated automatically and hence less visible to the programmer. Read about the UdpClient class to learn how to create and use this UDP socket.

One implication of this architecture that is even though we'll eventually be implementing our branches using clusters of servers, the cluster as a whole will still be accessed through just a single "load balancing" front end. Thus, once you start to deal with load-balancing, it will be necessary to redirect incoming Web Services requests from this single network address to an appropriate server within the cluster, so as to keep loads uniform. When we get to that stage, we'll solve the problem using the same mechanisms used by commercial datacenters. However, in Phase I we won't be implementing clustering, so this is just something to keep in mind for later.

We would like you to use Web Services as the client-to-server communication standard. Within C#, you can do this by pulling the ASP.NET template onto your application inside the Visual Studio C# programming environment; it will ask questions to guide you through the setup and you'll have a chance to tell it to use TCP, etc. Later, the option of reprogramming the client to server communication protocol to run over UDP is something you may want to consider; initially, though, we'll be content with the normal HTTP over TCP "default" provided by ASP.NET if you just let it do things in the most standard way.

If the Internet Information Service (IIS) is running on your server computer you can use the Web Services discovery mechanisms in client systems. But we know that not all machines are set up with IIS enabled. If IIS is not enabled on your machine, clients should still be able to access the server if you know the IP address to use (have the server check and display it somehow, then type it in on the client machine) and the port number (either pick a number, or let Windows assign one and type that in too).

Inter-Branch Communication Limitations. We're initially going to work with a single server for each branch. Communication between these branches is constrained to follow a given network topology (a weird constraint, but one that makes the problem more interesting). Therefore, construct your system to accept as an additional input the fixed, static, interconnection graph that defines which branches can send messages to which others. This graph should be input from a text file. Each line of that file should have the form

B1 B2

where B1 and B2 are names of branches; such a line in the file would assert that the network topology allows B1 to send messages to B2. The network topology might have uni-directional connections. Thus, just because branch B1 can send to B2 this does not imply that branch B2 can send to B1. However, you may assume that all branch servers have access to identical copies of this file, so all agree on the network topology for the system. Moreover, you can assume that there exists some "route" from any server Bi to any other server Bj. A route may require that messages be forwarded through some other branch; this routing is something you'll need to implement.

To ensure that your servers can't violate the restrictions in the topology file, simulate the limitations that the network topology imposes on inter-branch communication by writing a class that branches must use to send messages. This "wrapper" class (so named because it encapsulates network communications operations) should limit the allowed use of C#'s operation for performing a UDP send, UdpClient.Send.

Whenever the wrapped send operation at a given branch B is invoked, the wrapper checks to see if the destination address corresponds to a branch that is directly reachable from B (by consulting the interconnection graph).

IF the destination is reachable THEN the real UDP operation UdpClient.send is invoked for the named socket.
IF the destination is indirectly reachable THEN the wrapped send does not invoke UdpClient.send and instead forwards the message to the next branch along the route you've decided to use.
ELSE the destination is not reachable at all. This is a serious error and should throw an exception or pop up a "MessageBox", since it isn't supposed to happen.

Your wrapper class should also introduce a new operation --- whoNeighbors --- which returns the names of all branches that are directly reachable (along one edge of the interconnection graph) from the invoker.

Account Numbers. You may assume that account numbers are of the form bb.aaaaa where bb is a 2-digit numeric value that designates a branch and aaaaa is a 5-digit numeric value that identifies an individual account within that branch. Also assume that an account comes into being (with a 0 balance) by virtue of any invocation of any operation that names that account number --- an assumption that is unrealistic but will nevertheless eliminate the possibility of operations on undefined accounts.

Client and Server: Branch GUI and Branch Server. Here, then, are descriptions of the two C# applications to be built.

BranchGUI is the client program. It creates a window on the console. This window should allow invocation of Deposit, Withdraw, and Query operations for any account managed by that branch; the window should also allow invocation of a Transfer operations involving a src_acnt managed by that branch and a dest_acnt that is managed by an adjacent branch. Messages from BranchGUI will specify operations that the server should perform on accounts.
BranchGUI at a branch B communicates with the associated branch server using HTTP over TCP, which is the default for a Web Services application. In C#, much of the mechanism needed will be automated produced when you pull the ASP.NET template onto the application; you can then modify it in later stages of the project.
For communication between branch servers, BranchServer B receives UDP messages on B.ServPort. . These messages should be processed one at a time, in the sequence they are received.
The complexity of the application resides in the possibility that operations will be issued to remote accounts: accounts not associated with the branch server to which the branch GUI is talking. In such cases the request must be routed to the appropriate branch server and the reply routed back, and this must be done reliably so that even if UDP packets are lost, the operation will still be completed. (For now, don't worry about server crashes). Operations Deposit, Withdraw, and Query can be handled entirely by a single branch server. These are therefore, reasonably straightforward. A Transfer operation can be implemented by (i) doing a local Withdraw and then (ii) having that branch server sending a message to B'.ServPort for the branch B' that manages dest_acnt so that the appropriate Deposit occurs.
Again, keep in mind that all server to server communication should be by hand-coded UDP-based protocols of your own design. Web Services are used ONLY for branch GUI to branch server communication, not between servers.

Adhering to the above specified structure (branch server, branch GUI, Web Services network communication from GUI to server and UDP from server to server, and use of the branch server wrapper class) is necessary so that your implementation will be usable in subsequent phases of the project.

Implementation Notes

Develop your system under Windows/NT -- otherwise we can't help you if you get stuck.
Test your branch server to branch server architecture to confirm that your protocol can tolerate packet loss and can route messages correctly even when the branch topology is unidirectional (for example, try a "ring" topology).
Consider building a sophisticated testing environment. For example, you might try modifying the communications wrapper to drop UDP packets with some small random probability (e.g. it could drop 3% of all packets). Even under load, with loss, your application should still work....
Visual Studio C# can be run on the CSUG machines off the start menu.
For those who insist on using Java, that language is installed in the Undergraduate Lab and can be found in the path g:\jdk1.2.2\bin. To use JAVA, open a CMD shell and set the PATH to point to the JAVA binaries with the command
set PATH=g:\jdk1.2.2\bin;%PATH%.
You can then run all the JAVA binaries directly from the shell.
GUI programming is really easy in C#; you just build a "Windows Forms" application and drag and drop the various things you need onto the blank form it shows you, which will look just like the standard Windows interface. You define the operations that should be performed when a button is pushed, data is entered in a window, or a menu pull-down is clicked. The online help system is extremely useful and will include anything you could possibly want to do, in convenient cut-and-paste format.
With at least three people in a team, an obvious way to partition this assignment is for a different person to take the lead on (i) the branch server, (ii) the branch GUI, and (iii) the network wrapper class. But recall that everyone must understand the operation of the entire system once it is submitted for grading.

Submission Procedure. Create a directory containing the files you wish us to grade. Using the "Winzip" application, compress the contents of this directory into a single archive and email Vivek the archive with a reminder of who the team members are.

Should you wish to revise your submission after you have emailed it to Vivek, simply correct the files and resend the entire archive. We'll grade the last submission we receive and will discard older ones. No late submissions will be graded, so please get things in on time.

Your directory should contain the following files (at least):

TEAM which contains the names (and net-ids) for all team members. Also, for each team member give a 1 or 2 paragraph description of the tasks this team member performed and the number of hours this required.

README which contains

The names and a description of the contents for the other files in the directory.
Instructions for installing, compiling, and running your software on our Windows-NT system.
A tutorial that the grader can follow to start your software and to convince himself that your system implements the required functionality. Expect the grader to spend at most 10 minutes on this task.

TOPO should specify an interesting interconnection topology for a multi-branch bank that will be used to illustrate the operation of your system.

A source file that contain the C# or Java source needed to compile and run your system.

Don't underestimate what is involved in writing instructions for installing, compiling, and running your software. Do path values have to be set? Must the software be installed in a particular directory? Must the name of the host processor be put someplace? The easier it is to install your system, the more-kindly disposed the grader will be in evaluating your efforts.

Grading. Your grade will be based on the following elements:

Does your system satisfy the requirements on system-structure outlined above?
Does the system operate correctly?
How easy is it to follow the README file installation?
Is the source code easy to understand and does it exhibit good structure?