homework 1

a first look at language processing


getting started with data


Please review the homework in this pdf file. Remember that reading resources can be found in the syllabus.




data and starter kit


You will need the data, which we’ll be using throughout the semester. The Bash command is

wget --no-check-certificate https://course.ccs.neu.edu/cs6120s25/data/shakespeare/shakespeare-edit.txt

You can use Jupyter and Google Colab magick commands (calling Bash from inside of a notebook) with the ! indicator. For example, !wget <options> <file> will suffice. If you are more comfortable with notebooks, there are several options:


submission instructions


Submit via Gradescope. There, you will upload your PDF and code.

  • Document templates can be either LaTeX File or DOCX File. When you’ve compiled/finished writing, download the PDF from Overleaf/Google and upload it to the submission link.