Course Description
This course provides an overview of the important issues in information retrieval, and how
those issues affect the design and implementation of search engine software. The course
emphasizes the technology used in Web search engines and the information retrieval theories
and concepts that underlie all search applications. Mathematical experience including basic
probability is strongly desirable.
Reference books
[MRS]: Introduction to Information Retrieval Manning, Raghavan, and Schütze (Cambridge)
[CMS]: Information Retrieval in Practice Croft, Metzler and Strohman (Addison-Wesley, 2008).
Piazza
Please use Piazza discussion forum
for all questions regarding material, assignments, due
dates, data issues, programming issues, etc. Please do not email TAs or the
instructor directely with those questions.
Personal/private matters, such as availability, delays, grades, term projects or other
advanced material, etc, can be discussed by email or via Teams.
Course Policies
Prerequisites
Programming: you should be familiar with programming in an imperative language like Python or Java. While the programing language is your choice, we strongly encourage you to use one of the two mentioned.
Assignments
There will be a total of six/seven assignments, making up 90% of your final grade. Instructions and due dates will be posted on the schedule page as they are assigned. Some of the problems can be difficult, and it will often be helpful to discuss them with others. Feel free to form study groups; however, the idea is for everyone to understand the problems and experience working through the solutions. You may not simply share a solution with your classmate as that will be considered cheating. In general, students must write their own homework solutions/code and must not read or copy the solutions/code of others.
The assignments will consist mostly of programming exercises to implement various components of a search engine. We will usually also ask for your output on certain datasets and a short report describing your design choices and experimental results. In-class quizzes will be given at the end of each module.
All assignments will be submitted through Northeastern Github. Please refer to this blog for a basic Git tutorial .
We expect the average total load to be 15 hours/week on average. Based on past experience, many students spend more than 15 hours/week for the optional tasks denoted "Extra Credit"; the EC can improve your grade, but it is not a replacement for regular credit; EC is harder, more time consuming, worth less points, and less discussed during helping sessions; it is designed for students who want to get more out of the course. The best way, by far, to maximize your grade is to complete the required sections of the assignments on time.
Late homework policy: Homework is due on the announced due date. You can demo for up 48 hours after the due date. Any changes/resubmissions will be considered late. You can submit ONE assignment up to 48 hours late without any penalty. 5 points will be deducted per day on any other late submissions. If you know in advance of circumstances that would cause you to turn in an assignment late, please contact me or one of the TAs before the assignment is due to ask if an extension is possible.
Final Grade
- 7 Homework assignments, 90%
- 5 Quizzes, 10%
Academic Integrity
All work submitted for credit must be your own. If you're having trouble with the material, please ask questions on Piazza or come to our office hours for help. If you submit someone else's work as your own, or allow someone else to submit yours, you will get zero credit for that assignment and everyone involved will be reported. Please refer to the university's academic integrity policy for details.
You may discuss the homework problems or projects with your classmates, the TA, and the instructor. You must acknowledge the people with whom you discussed your work in the report you submit alongside your assignment. You must write your own code! Using ChatGPT to generate code is NOT allowed as that will be considered cheating and will result in a ZERO and a report to OSCCR.
Any written sources used (apart from the text) must also be acknowledged; however, you may not consult any solutions from previous years' assignments whether they are student or faculty generated.
Inclusive Class
Northeastern University values the diversity of our students, staff, and faculty; recognizing the important contribution each makes to our unique community.
Respect is demanded at all times throughout this course. In the classroom, not only is participation required, it is expected that everyone is treated with dignity and respect. We realize everyone comes from a different background with different experiences and abilities. Our knowledge will always be used to better everyone in the class.
We strive to create a learning environment that is welcoming to students of all backgrounds. If you feel unwelcome for any reason, please let us know so we can work to make things better. You can let us know by talking to anyone on the teaching staff. If you feel uncomfortable talking to members of the teaching staff, please consider reaching out to your academic advisor.
Northeastern is committed to providing equal access and support to all qualified students through the provision of reasonable accommodations so that each student may fully participate in the learning experience. If you have a disability that requires accommodations, please contact the Disability Resource Center , DRC@northeastern.edu, 617-353-2675. Accommodations cannot be made retroactively and to receive an accommodation, a letter from the DRC or LDP is required. Please present the instructor with your “Professor Notification Letter” during the first week of the semester, so that we can address your specific needs as early as possible.
Instructor
Office: 311 Meserve Hall
Office Hours:
M: 2-2:45 (in-person)
T: 1 - 2pm (Online - Please make an appointment)

Office Hours:
Additional hours by appointment
Email: mathew.ama@northeastern.edu

Office Hours:
Additional hours by appointment
Email: panjwani.ha@northeastern.edu



Office Hours:
Additional hours by appointment
Email: patel.khushiu@northeastern.edu

Office Hours:
Additional hours by appointment
Email: chapados.n@northeastern.edu

Office Hours:
Additional hours by appointment
Email: shah.arham@northeastern.edu
