homework 1
a first look at language processing
getting started with data
Please review the homework in this pdf file. Remember that reading resources can be found in the syllabus.

data and starter kit
You will need the data, which we’ll be using throughout the semester. The Bash command is
wget --no-check-certificate https://course.ccs.neu.edu/cs6120s25/data/shakespeare/shakespeare-edit.txt
You can use Jupyter and Google Colab magick commands (calling Bash from inside of a notebook) with the !
indicator. For example, !wget <options> <file>
will suffice. If you are more comfortable with notebooks, there are several options:
- Locally on Your Laptop
- Google Cloud Vertex Work with your Google Cloud credits.
- Google Colabs with your Google Account
submission instructions
Submit via Gradescope. There, you will upload your PDF and code.
- Document templates can be either LaTeX File or DOCX File. When you’ve compiled/finished writing, download the PDF from Overleaf/Google and upload it to the submission link.