Java Python COMPSCI 753
Algorithms for Massive Data
Assignment 2 / Semester 2, 2024
Recommender Systems
General instructions and data
Recommender systems are widely used in entertainment. In this assignment, we will explore one of the Goodreads review datasets using the recommendation algorithms learned in the lectures. To make the task feasible on most of the laptops and PCs, we have extracted a manageable dataset of reviews on Young Adult books1 (containing 2,389,900 reviews). We have split the dataset on training data (1,433,940 reviews), validation data (477,980 reviews) and test data (477,980 reviews). The corresponding files can be found on the assignment page. These files are of the same format. Each line includes a user id, item id, review id and rating.
Submission
Please submit (1) a file (.pdf or .html) that