CS 3700 - Networks and Distributed Systems
Project 1: Socket Basics
Description
This assignment is intended to familiarize you with writing simple network code.
You will implement a client program which communicates with a server using sockets.
The server will ask your program to do some basic string manipulation and counting.
If your program successfully counts all of the strings, then the server will return a secret flag that is unique for each student.
If you receive the secret flag, then you know that your program has run successfully.
This project must be completed individually.
Language
You can write your code in whatever language you choose, as long as your code compiles and runs on unmodified Khoury College Linux machines on the command line. Do not use libraries that are not installed by default on the Khoury College Linux machines. Similarly, your code must compile and run on the command line. You may use IDEs (e.g. Eclipse) during development, but do not turn in your IDE project without a Makefile. Make sure your code has no dependencies on your IDE and that you've successfully tested it on a Khoury Linux machine, such as the Vagrant image or login.
Protocol
The server runs on the machine fring.ccs.neu.edu and listens for requests on a TCP socket bound to port 27993.
This exercise has four types of messages: HELLO, FIND, COUNT, and BYE.
Each message is an ASCII string consisting of multiple fields separated by spaces (0x20) and terminated with a line feed (0x0A, \n).
The maximum length of each message is 8192 bytes.
Messages are case sensitive.
The protocol works as follows.
The client initiates the protocol by creating a TCP socket connection to the server.
Once the socket is connected, the client sends a HELLO message to the server.
The format of the HELLO message is:
cs3700spring2020 HELLO [your NEU ID]
In your program you should replace [your NEU ID] with your NEU ID (including any leading zeroes). You must supply your NEU ID so the server can look up the appropriate secret flag for you. The server will reply with either a FIND message or a BYE message below. The format of the FIND message is:
cs3700spring2020 FIND [A single ASCII symbol] [A string of random characters]
The two variable fields contain (1) a single ASCII symbol, such as "A", "f", "4", or "%", without quotes, and (2) a string of random characters. Your program must count the number of times the given ASCII symbol appears in the random string and return this count to the server in a COUNT message. The COUNT message has the following format:
cs3700spring2020 COUNT [the count of the given symbol in the given string]
It is okay for the count to be zero.
The server will respond to the COUNT message with either another FIND message, or a BYE message.
If the server terminates the connection, one of several things may have happened:
- The format of the message was incorrect
- The count your program returned was incorrect
- Your program took too long to respond and the server closed the connection
- (If you recieve no messages) The server was busy and couldn't handle your request
cs3700spring2020 BYE [a 64 byte secret flag]
The 64-byte string is your secret flag: write this value down, since you need to turn it in along with your code. Once your program has received the BYE message, it can close the connection to the server. If the server returns "Unknown_Husky_ID" in the place of the secret flag, that means it did not recognize the NEU ID that you supplied in the HELLO message. You may validly print out "Unknown_Husky_ID" in this case.
Your client program
Your client program must execute on the command line using the following command.
$ ./client <-p port> <-s> [hostname] [NEU ID]
- The -p port parameter is optional; it specifies the TCP port that the server is listening on. If this parameter is not supplied on the command line, your program must assume that the port is 27993.
- The -s flag is optional; if given, the client should use an SSL encrypted socket connection. Your client only needs to support -s if you are trying to get the extra credit point.
- The [hostname] parameter is required, and specifies the name of the server (either a DNS name or an IP address in dotted notation).
- The [NEU ID] parameter is required. Your code must support NEU IDs that have leading zeroes (do not strip them!).
Your program should print exactly one line of output: the secret flag from the server's BYE message. If your program encounters an error, it may print an error message before terminating. Your program must not write any files to disk, especially to the secret_flags file!
Other Considerations
You may test your client code with our server as many times as you like. Your client should conform to the protocol described above, otherwise the server will terminate the connection silently. Your client program must verify the validity of messages by strictly checking their format, i.e. the server may send corrupted messages just to try and crash your software. If a received message is not as expected, such as an incorrect field or wrong message type, you must assert an error and terminate your program. You should be strict; if the returned message does not exactly conform to the specification above, you should assert an error. Remember that network-facing code should be written defensively.
Submitting Your Project
If you have not done so already, register yourself for our grading system using the following command:
$ /course/cs3700sp20/bin/register-student [NUID]
NUID is your Northeastern ID number, including any leading zeroes.
To turn-in your project, you should submit your (thoroughly documented) code along with three other files:
- A Makefile that compiles your code. You must turn-in a Makefile, even if your code does not need to be compiled (e.g. your code is in Python or Ruby). You may leave the Makefile blank in this case.
- A plain-text (no Word or PDF) README.md file. In this file, you should briefly describe your high-level approach, any challenges you faced, and an overview of how you tested your code.
- A file called secret_flags. This file should contain your secret flag in plain ASCII.
$ /course/cs3700sp20/bin/turnin project1 [project directory]
[project directory] is the name of the directory with your submission. The script will print out every file that you are submitting, so make sure that it prints out all of the files you wish to submit! The turnin script will not accept submissions that are missing README.md, a Makefile, or a secret_flags file. You may submit as many times as you wish; only the last submission will be graded and the time of the last submission will determine whether your assignment is late.
Double Checking Your Submission
To try and make sure that your submission is (1) complete and (2) will work with our grading scripts, we provide a simple script that checks the formatting of your submission. This script is available on the Khoury College Linux machines and can be executed using the following command:
/course/cs3700sp20/code/project1/project1_format_check.py [path to your project directory]
This script will attempt to make sure that the correct files (e.g. README.md, secret_flags, and Makefile) are available in the given directory, that your secret_key file contains at least two 64-byte keys, that your Makefile will run without errors (or is empty), and that after running the Makefile a program named client exists in the directory.
The script will also try to determine if your files use Windows-style line endings (\r\n) as opposed to Unix-style line endings (\n).
If your files are Windows-encoded, you should convert them to Unix-encoding using the dos2unix
utility before turning in.
Grading
This project is worth 4% of your final grade.
We will grade your project based on if: your program compiles; your programs runs correctly and produces the secret flag; and you submitted a correct secret_flags
file.
We reserve the right to implement Code Reviews as a function of your grading.
The extra credit is All or Nothing, and you must fully impelment the feature to-spec as described in this assignment to qualify for the bonus.
All student code will be scanned by plagarism detection software to ensure that students are not copying code from the Internet or each other.
You can see your grades for this course at any time by using the gradesheet program that is available on the Khoury College machines.
$ /course/cs3700sp20/bin/gradesheet
Extra Credit
It is possible to earn 1% extra credit on this assignment.
To get the extra credit, you must modify your client such that it supports SSL connections as follows.
If the -s parameter is given to your program, it should connect to the server using an encrypted SSL socket and complete the protocol normally (i.e. HELLO, FIND, COUNT, and BYE).
You may assume that the server's SSL port is 27994, unless the port is overridden on the command line using the -p option.
When you successfully run your SSL-enabled client against the SSL version of the server, you will receive a new secret flag (that is different from the normal secret flag).
You must add this SSL secret flag into the secret_flags file when you turn in your project (i.e. your secret_flags file will now contain two flags, one per line, in plain ASCII).
You will only receive extra credit if your code successfully implements the -s option, and you include the SSL secret flag in the secret_flags file.
FAQ
Here are a few common questions that get asked about this project:
-
Does the [Makefile, README.md, secret_flags, client] file need be named exactly that?
Can I turn in [README.txt, client.py, secret_flags.whatever, etc.] instead?
NO. The files need to be named exactly what we have specified in this document. If you don't follow the specification exactly, you will lose points. -
Why do we need a Makefile?
This is the consequence of letting you write your program in whatever language you want. Since you can turn-in whatever crazy source code you want, we need to set a couple of ground rules so that we can compile and run your code. Those ground rules are (1) everyone turns in a Makefile, and (2) everyone's program must be named client. -
I'm using Java, and I can't get SSL to work.
My program is complaining about invalid, self-signed certificates.
Good on you for using a language that bothers to check whether the server's certificate is valid. As it turns out, my server's certificate is self-signed, which means it is not technically valid and should not be trusted. This will make a lot more sense later in the semester when we talk about SSL/TLS. In the meantime, you need to disable certificate validation in your code in order to ignore this error. -
I did socket.read(), counted the characters, and sent the count to the server, but then the server closed the connection.
I manually checked and my count was correct.
What's happening?
Your count wasn't correct; the question is why was your count incorrect? The problem is that you did not read the entire message from the server, i.e. you did not receive the entire random string. Did you check to make sure the message from the server ended with a newline ("\n")? Just because you call socket.read(), does not necessarily mean you will receive the entire message from the server. You may need to call socket.read() multiple times to receive the entire message. -
Sometimes when I socket.read(), I just receive random ASCII characters.
Why isn't the server sending me valid FIND or BYE messages?
This is the same problem as the previous question. You are probably not reading from the socket until you receive a newline ("\n"). In other words, you socket.read() the first portion of the server's message, count characters in an incomplete random string, send a COUNT message to the server, and then socket.read() additional characters from random string, but you are misinterpreting the second socket.read() as the start of a new message from the server.