TY - JOUR
T1 - Extended overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance
AU - Alkhalifa, Rabab
AU - Borkakoty, Hsuvas
AU - Deveaud, Romain
AU - El-Ebshihy, Alaa
AU - Espinosa-Anke, Luis
AU - Fink, Tobias
AU - Galuščáková, Petra
AU - Gonzalez-Saez, Gabriela
AU - Goeuriot, Lorraine
AU - Iommi, David
AU - Liakata, Maria
AU - Madabushi, Harish Tayyar
AU - Medina-Alias, Pablo
AU - Mulhem, Philippe
AU - Piroi, Florina
AU - Popel, Martin
AU - Zubiaga, Arkaitz
PY - 2024/9/12
Y1 - 2024/9/12
N2 - We describe the second edition of the LongEval CLEF 2024 shared task. This lab evaluates the temporal persistence of Information Retrieval (IR) systems and Text Classifiers. Task 1 requires IR systems to run on corpora acquired at several timestamps, and evaluates the drop in system quality (NDCG) along these timestamps. Task 2 tackles binary sentiment classification at different points in time, and evaluates the performance drop for different temporal gaps. Overall, 37 teams registered for Task 1 and 25 for Task 2. Ultimately, 14 and 4 teams participated in Task 1 and Task 2, respectively.
AB - We describe the second edition of the LongEval CLEF 2024 shared task. This lab evaluates the temporal persistence of Information Retrieval (IR) systems and Text Classifiers. Task 1 requires IR systems to run on corpora acquired at several timestamps, and evaluates the drop in system quality (NDCG) along these timestamps. Task 2 tackles binary sentiment classification at different points in time, and evaluates the performance drop for different temporal gaps. Overall, 37 teams registered for Task 1 and 25 for Task 2. Ultimately, 14 and 4 teams participated in Task 1 and Task 2, respectively.
KW - Evaluation
KW - Information Retrieval
KW - Temporal Generalisability
KW - Temporal Persistence
KW - Text Classification
UR - http://www.scopus.com/inward/record.url?scp=85201630954&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85201630954
SN - 1613-0073
VL - 3740
SP - 2267
EP - 2289
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 25th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2024
Y2 - 9 September 2024 through 12 September 2024
ER -