TY - GEN
T1 - Are we there yet? Estimating training time for recommendation systems
AU - Paun, Iulia
AU - Moshfeghi, Yashar
AU - Ntarmos, Nikos
PY - 2021/4/26
Y1 - 2021/4/26
N2 - Recommendation systems (RS) are a key component of modern commercial platforms, with Collaborative Filtering (CF) based RSs being the centrepiece. Relevant research has long focused on measuring and improving the effectiveness of such CF systems, but alas their efficiency – especially with regards to their time- and resource-consuming training phase – has received little to no attention. This work is a first step in the direction of addressing this gap. To do so, we first perform a methodical study of the computational complexity of the training phase for a number of highly popular CF-based RSs, including approaches based on matrix factorisation, k-nearest neighbours, co-clustering, and slope one schemes. Based on this, we then build a simple yet effective predictor that, given a small sample of a dataset, is able to predict training times over the complete dataset. Our systematic experimental evaluation shows that our approach outperforms state-of-the-art regression schemes by a considerable margin.
AB - Recommendation systems (RS) are a key component of modern commercial platforms, with Collaborative Filtering (CF) based RSs being the centrepiece. Relevant research has long focused on measuring and improving the effectiveness of such CF systems, but alas their efficiency – especially with regards to their time- and resource-consuming training phase – has received little to no attention. This work is a first step in the direction of addressing this gap. To do so, we first perform a methodical study of the computational complexity of the training phase for a number of highly popular CF-based RSs, including approaches based on matrix factorisation, k-nearest neighbours, co-clustering, and slope one schemes. Based on this, we then build a simple yet effective predictor that, given a small sample of a dataset, is able to predict training times over the complete dataset. Our systematic experimental evaluation shows that our approach outperforms state-of-the-art regression schemes by a considerable margin.
KW - recommendation systems
KW - sampling-based processing time prediction
KW - information retrieval
U2 - 10.1145/3437984.3458832
DO - 10.1145/3437984.3458832
M3 - Conference contribution book
SN - 9781450382984
T3 - Proceedings of the 1st Workshop on Machine Learning and Systems, EuroMLSys 2021
SP - 39
EP - 47
BT - Proceedings of the 1st Workshop on Machine Learning and Systems, EuroMLSys 2021
CY - New York
ER -