CS414 Fall 2003 Homework 4

CS414 Fall 2003 Homework 5. Due in class on November 4, 2003

1. In class, we learned that the TCP protocol uses a sliding window algorithm for two reasons. First, this allows it to overcome packet loss. But the algorithm also permits the sender to match its rate to what the network can accommodate, and also to the rate of the receiver. Specifically, TCP operates by increasing its sending rate until packet loss is detected, then reducing the rate. It does this by making the sliding window size larger, or smaller, respectively.

a) Suppose that you were to use TCP over a wireless network, in which packet loss occurs frequently. How will TCP behave?

b) Is the behavior you described in (a) the right reaction to packet loss in a wireless network?

c) Now suppose that you are designing a large corporate network with mostly wired links, but some wireless links to mobile machines “at the edge.” What might you consider doing to avoid the problem you described in part (a)?

d) [Extra credit*] Read about the “End to End Argument”. (J. Saltzer, D. Reed and D. Clark: “End-to-end arguments in system design”, ACM Trans. Comp. Sys., 2(4):277-88, Nov. 1984). Does your answer to part (c) violate the principle advocated in this paper? Explain.

* Bumps your cumulative homework score up by 3 points, but not beyond the “maximum” possible.

2. A computer system consists of five machines, named M₀ … M₄. Each machine has an attached input device, and each reads a single bit from its device (hence, 0 or 1).

a) Assuming that no failures occur and that messages are delivered promptly and reliably, design a protocol whereby the machines can “vote” for its value (0 or 1). Your protocol should pick a value that at least one machine actually voted for (you can’t just say “always pick 0”), Each machine must learn the outcome of the election. Design your protocol to run in “rounds”, such that in each round, each machine sends four messages (one to each of its counterparts), and receives four (one from each of its counterparts).

b) Modify your protocol from (a) to work correctly even if a single machine fails while the protocol is running (you may assume that failure is “reported” to each operational machine)

c) Modify your protocol from (a) to work correctly even if some machine might behave maliciously, for example by trying to vote twice, ignoring some messages, telling one machine that its input was a 0 and another that its input was a 1, etc. It cannot, however, impersonate some other machine – and more generally, you may assume that the faulty machine cannot prevent the correct machines from running rounds, or tamper with messages sent from one correct machine to another correct machine.

d) Can the problem from part (c) be solved with just three machines instead of five? Either explain how to do it, or explain briefly why this isn’t possible.

3. Suppose that a program has a virtual address space of size 20MB and is running on a computer with 2MB of memory. You would like to speed up the program, so you begin to add memory. At first, each time you add 1MB of memory you find that the program indeed speeds up. But after the physical memory size reaches 10MB, adding memory stops having any effect.

a) Explain why extra memory stops helping, even though the virtual memory of the program is twice as large as the physical memory of the computer.

b) Your close friend Doug “The Bug” Crump drops by and shows you that inside your computer, the “motherboard” has a speed setting. By changing it, he is able to double the speed of the CPU, without affecting the speed of anything else. To your surprise, you now find that adding physical memory now “helps” again, and you can obtain a further speedup. Explain why this happened. (Hint: This is actually not at all easy. If nobody gets it, Ken won’t be surprised).

4. When modifying the Linux kernel late one night, Doug accidentally modifies the context switching software so that sometimes, when context switching from process P₀ to process P₁, the system forgets to flush the TLB and main-memory (L2) cache.

a) What sorts of problems might result from this mistake?

b) Doug realizes what he has done, but in fixing it, modifies Linux to flush the TLB and cache not just on every context switch operation, but also on every interrupt (including page faults, system calls, device interrupts, etc). What impact would you expect this to have on the system?