CS4414: Systems Programming

Fall 2020, Tuesday and Thursday 3pm-4:15pm, with a required recitation Monday 4:55PM - 6:10PM (116 Kennedy Hall or via Zoom)
Online lectures, hybrid recitations, streamed synchronously (netid required).  Instructor:  Ken Birman
3 credits, letter grades only.  Enrollment capped.   You are welcome to sit in on the class (audit), but the university seems to have a no-AUD policy this fall, and we can't help you on that.  If you do audit this course yet you would need to do the homework to gain hands-on experience with C++.  However, we would not be able to grade that work. 

New: retroactive to Fall 2020, 4414 can be used to satisfy the systems-area course requirement for CS majors, as an alternative to taking CS4410.  However, it does not satisfy the practicum requirement, and cannot be used as a prerequisite or corequisite for people who wish to take CS4411.
Zoom links:  Online lectures (full zoom info), Recitations (full zoom info).  A list of recordings for asynchronous viewing is on our Piazza board, in a pinned post.

Systems programming is a new course.  It was introduced in the fall of 2020, and is gradually becoming one of the main options for students in the major as a primary way to learn about systems (side by side with CS3410, which teaches you about computer architecture, and optionally, with CS4410, which teaches how operating systems are implemented).

The class aims at students who are proficient in an object-oriented programming language like Java or Python, and have completed a course on data structures, but would like a deeper understanding of how "real world" computing systems are built.   In CS4414 you will learn an additional programming language, C++, and will learn to use the Linux system as a way to create high quality software.  This pair of technologies has gained nearly universal adoption at every level, from small devices hidden within "smart things" to the world's largest cloud computing systems.  As such, the skills you gain in CS4414 are very broadly relevant, no matter what you plan to do in your career.

We expect a mix of students: some may have only taken CS2110 and be taking this as a second or third course in computing; others could be declared CS majors, ECE majors, or even MEng students seeking to broaden their programming skills. CS4414 may also be of interest to students exploring embedded device programming in robotics, digital agriculture, Internet of Things, or other settings where sensors and actuators are deployed.  Because this is a new course, we are capping enrollment for the first few years, using the standard CS policies to prioritize the wait list.  However, over time, we hope to increase the cap, until the class reaches the same size as other undergraduate courses.

Among our practical goals will to learn to leverage existing Linux tools, to learn how to write correct code in C++, and how to achieve performance and efficiency.  Like any programming language, you really teach yourself by doing, but we will present C++ and Linux in the required section.  Assigned readings and homeworks will help you build up hands-on proficiency.  C++ and Linux are easy to learn if you are comfortable in some other object oriented programming language like Java, so we will move quickly (this is not a course for people who struggle with programming or who have never seen object-oriented code and learned about data structures).  You'll also be reading a famous C++ self-teaching textbook, written by the inventor of the language.  This will begin early in the semester, so be ready to work hard in the first few weeks! 

CS4414 is really not a programming course, or a Linux course.  The core intellectual material focuses on the way that modern applications are often created by combining two or more programs, which talk to one-another over some mixture of pipes, files, mapped files (shared memory), networking (messages sent over tcp), etc.  We will discuss and security abstractions for isolation and authorization, and the best ways of building applications that use these technologies in correct ways.  All of these are concepts you'll find valuable in your work, no matter where computing might lead you.

Modern computers are based on NUMA processors (chips with multiple CPUs in the single machine), and leveraging NUMA sometimes entails writing programs with multiple threads running in parallel while sharing memory.  NUMA computers have a variety of interesting features that include hardware support for parallelism (such as for image processing, computer vision tasks, machine learning), multiple levels of hardware caching (important for performance), and can support many styles of locking and synchronization.  We'll focus on monitors, an approach that is highly flexible, nicely supported in C++, and promotes correctness.  Late in the semester, we will also look at some distributed systems abstractions that extend the idea of correct synchronization to cover applications spanning more than one computer, specifically state machine replication, leader election and crash-failure tolerance.

Although there is some overlap between CS4414 and other CS courses, such as CS3410 (computer architecture) and CS4410 (operating systems), most material in CS4414 isn't covered in any other existing class, and this course is not really intended as a replacement for either of those, nor do we assume you have taken either of them.  

Prerequisites: CS4414 is not an introduction to programming.  Everyone is expected to already know an object-oriented language such as Java, at the level of CS2110 or equivalent (object oriented programming and data structures).  CS3410 would be useful, but is not required, and CS4414 can be taken side by side with that course. 

We do assume that you have encountered C in the past, but we really don't assume very much experience in C programming.   Students who are very nervous about their programming skills should take CS3410 first; they will learn C in that context, will get additional coding experience, and thus will enter CS4414 with a solid background (for example, they would know how to write methods in C, and how to use malloc and free, and how to write code that includes pointers, loops over arrays, etc).  But CS3410 is not a prerequisite for CS4414: 4414 will teach you everything you need to know with respect to the aspect of computer architecture (NUMA machines) that is important in the CS4414 setting, and in contrast, CS3414 has very little NUMA material.  The small amount of C that CS3410 students already know doesn't given them any major advantage, and students who enjoy programming and yet have never seen C can easily pick up this background by applying themselves and studying in the first few days of the class.

Use for the systems requirement:  It has been proposed that CS 4144 be approved as a way to satisfy either the systems course requirement in the CS major, or the practicum (but not both -- a student would need to designate their preference).  This proposal has NOT yet obtained department and CIS approval.  Until it does, CS 4414 does not satisfy either requirement.


Getting Help

Piazza Link
Office Hours Held online in Zoom
FAQ Link

Course Materials

Schedule Lecture schedule, slides, recitation notes, readings, and code
Assignments We post the assignments and quizzes on CMS.
Exams There will not be any in-person exams.
Lab Machines Instructions for using the lab machines are on the CS ugrad/MEng web pages
Resources Additional course resources are listed in the right-hand column on the syllabus

Course Information

For details See the course syllabus for details.
Lectures Tuesday and Thursday 3pm-4:30pm, with a required recitation Monday 4:10-5:25
Textbooks [1] Randal E. Bryant and David R. O'Hallaron,
Computer Systems: A Programmer's Perspective, Third Edition, Pearson, 2016
  [2] Bjarne Stroustrup,
A Tour of C++ (2nd Edition), Addison-Wesley Professional, (July 9, 2018)
  [3] Linux.  In fact we are not going to recommend a Linux book, although there are a great
many of them.  Linux has online documentation for all the commands we expect you to use, and
also for the "system calls" your code will use from C++.  Moreover, there is a built-in help feature in
the bash shell, so you can easily obtain a list of shell commands, or details on how each specific
command works and what arguments to use if you want custom behavior.  We prefer that you use
these standard ways of getting help.  Linux textbooks are fine but not really better in any sense.
Credit 3 credits
Grading In fall 2020, grading will be based on lab (homework) 75% and take-home quizzes 25%.
Labs There are 7 labs, not evenly weighted. See the assignments page for the breakdown.
Exams There are 5 take-home quizzes, which should be completed within 24 hours after you download them
Home http://www.cs.cornell.edu/courses/cs4414/2020fa
Questions Piazza, office hours, email
CMS We are using the department's CMS system for this course.

Main Instructor

Name Ken Birman
Contact ken@cs.cornell.edu,
x5-9199
Office 435 Gates Hall
Office Hours Tuesday and Thursday after class (just stay on the Zoom), or on Zoom by appointment.

Recitation Instructors

Name Sagar Jha, Alicia Yang

Undergraduate TAs

Name Lucy Jingyi Lu, Ishaan Thakur, Zheng Wang