homework 3

mastering map reduce


mastering map reduce


Review the homework in this pdf file. Remember that reading resources can be found in the syllabus.


data and starter kit


You will need the data. If you are using Colabs (not a requirement), you would need a Google account.


Here are the starter kits that you might find useful.

  • Document templates can be either Overleaf TeX File or DOCX File. When you’ve compiled/finishe d writing, download the PDF from Overleaf/Google and upload it to the submission link.

  • Upload your processed output.txt file in the specified format in the homework questions

  • To help you get started, you may find that this Colaboratory Link is useful. In order to submit, you’ll need to Download the file as a *.ipynb file (Under FileDownloadDownload .ipynb.)


submission instructions

  • Commit your *.py or *.ipynb (either is fine as long as they are commented appropriately), output.txt, and name-submission-3.pdf to that repository and provide the URL via Gradescope before 5pm, Tuesday, October 10.

  • Make sure that you have documented your code with comments so that the TA can have an easier time understanding your logic. This will, in some cases, result in at least partial cre dit.

  • Include a diagram of your pipeline description in your writeup.

  • Include in your writeup the recommendations for the users with following user IDs: 924, 8941, 8942, 9019, 9020, 9021, 9022, 9990, 9992, 9993.

  • We will be checking for plagiarism, comparing code that is too similar to classmate or past class alumni homework. This will automatically result in zero credit.