An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information

D R Campbell, S J Culley, C A McMahon, F Sellini

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

A number of technologies exist which enable the unobtrusive capture of computer interface interactions in the background of a user's working environment. The resulting data can be used in a variety of ways to model aspects of search activity and the general use of electronic documents in normal working routines. In this paper we present an approach for using captured data to identify relationships between documents used by an individual or group, representing their value in a given context-that may relate to specific information need or activity. The approach employs the use of a naive Bayesian classifier to evaluate possible relationships that are derived implicitly from the data. It is intended that the relationships established be stored within an information retrieval (IR) system to aid in the retrieval of related documents where future users arrive at a similar context. In the evaluation of the approach over 70 hours of data from computer users in industrial and academic settings are collected to assess its overall feasibility. The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.
Original languageEnglish
Pages (from-to)115-141
Number of pages27
JournalInformation Retrieval
Volume10
Issue number2
DOIs
Publication statusPublished - 2007

Fingerprint

Information retrieval systems
Interfaces (computer)
Classifiers
interaction
information retrieval
electronics
evaluation
Values
Group

Cite this

An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information. / Campbell, D R; Culley, S J; McMahon, C A; Sellini, F.

In: Information Retrieval, Vol. 10, No. 2, 2007, p. 115-141.

Research output: Contribution to journalArticle

@article{940069c33e954921b1664bb2c2a8437b,
title = "An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information",
abstract = "A number of technologies exist which enable the unobtrusive capture of computer interface interactions in the background of a user's working environment. The resulting data can be used in a variety of ways to model aspects of search activity and the general use of electronic documents in normal working routines. In this paper we present an approach for using captured data to identify relationships between documents used by an individual or group, representing their value in a given context-that may relate to specific information need or activity. The approach employs the use of a naive Bayesian classifier to evaluate possible relationships that are derived implicitly from the data. It is intended that the relationships established be stored within an information retrieval (IR) system to aid in the retrieval of related documents where future users arrive at a similar context. In the evaluation of the approach over 70 hours of data from computer users in industrial and academic settings are collected to assess its overall feasibility. The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.",
author = "Campbell, {D R} and Culley, {S J} and McMahon, {C A} and F Sellini",
note = "ID number: ISI:000244186400002",
year = "2007",
doi = "10.1007/s10791-006-9016-2",
language = "English",
volume = "10",
pages = "115--141",
journal = "Information Retrieval",
issn = "1386-4564",
publisher = "Springer Netherlands",
number = "2",

}

TY - JOUR

T1 - An approach for the capture of context-dependent document relationships extracted from Bayesian analysis of users' interactions with information

AU - Campbell, D R

AU - Culley, S J

AU - McMahon, C A

AU - Sellini, F

N1 - ID number: ISI:000244186400002

PY - 2007

Y1 - 2007

N2 - A number of technologies exist which enable the unobtrusive capture of computer interface interactions in the background of a user's working environment. The resulting data can be used in a variety of ways to model aspects of search activity and the general use of electronic documents in normal working routines. In this paper we present an approach for using captured data to identify relationships between documents used by an individual or group, representing their value in a given context-that may relate to specific information need or activity. The approach employs the use of a naive Bayesian classifier to evaluate possible relationships that are derived implicitly from the data. It is intended that the relationships established be stored within an information retrieval (IR) system to aid in the retrieval of related documents where future users arrive at a similar context. In the evaluation of the approach over 70 hours of data from computer users in industrial and academic settings are collected to assess its overall feasibility. The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.

AB - A number of technologies exist which enable the unobtrusive capture of computer interface interactions in the background of a user's working environment. The resulting data can be used in a variety of ways to model aspects of search activity and the general use of electronic documents in normal working routines. In this paper we present an approach for using captured data to identify relationships between documents used by an individual or group, representing their value in a given context-that may relate to specific information need or activity. The approach employs the use of a naive Bayesian classifier to evaluate possible relationships that are derived implicitly from the data. It is intended that the relationships established be stored within an information retrieval (IR) system to aid in the retrieval of related documents where future users arrive at a similar context. In the evaluation of the approach over 70 hours of data from computer users in industrial and academic settings are collected to assess its overall feasibility. The results indicate that the approach provides a useful method for the establishment of identifiable relationships between documents based on the context of their usage, rather than their content.

U2 - 10.1007/s10791-006-9016-2

DO - 10.1007/s10791-006-9016-2

M3 - Article

VL - 10

SP - 115

EP - 141

JO - Information Retrieval

JF - Information Retrieval

SN - 1386-4564

IS - 2

ER -