Here are some keywords:
Franklin and Graesser's definition:
An autonomous agent is a system situated within and a part of an environment that senses that environment and acts on it, over time, in pursuit of its own agenda and so as to effect what it senses in the future.
These are the best on-line introductions to agents that I found. They should be read in order, because the latter refer to the former.
There seems to be a lot of snobbery among the agent-interested community. The above authors frequently refute each other's definitions of agency, and call each other's programs "incredibly simpleminded" and "lacking initiative."
No one seems to agree on what an agent is.
Most of the so-called "Web Agents" are actually just front ends for automating web activity such as meta-searching, browsing for URL updates, etc.
Here are some of my notes on Web Agents. These are the "best" links I've found so far, in that they either have something to download, or are close conceptually to the type of agents in which we're interested (see IR Agents section).
NOTE: Most of these links no longer work. Sigh.
So how can we use agents in IR? Some ideas (inspired in part by Gerry's Library Agents concept):
Examine users' Netscape bookmarks (in ~/.netscape/) and links pages (in ~/public_html/); characterize the "type" of pages a user tends to link to.
For example, if Miro links to a lot of sites that are written in Croatian, or a lot of sites that have links to files rather than links to pages, then we can weight his search results accordingly.
This type of preference characterization also applies to the amount of graphics, source code citations (and the actual coding languages), links-to-content ratios, amounts of bad language, etc.
Have an agent "spider" around the Internet, looking for references to documents within a certain collection. If two or more documents are mentioned in close proximity, rate those documents more similarly (for clustering purposes).
This would also apply to queries; if query_1 is deemed to be similar to document on the Internet called remote_X (by deemed I mean we perform some sort of search of a query against the rest of the Internet) and remote_X mentions documents local_A and local_B, which exist in the collection, then you could improve local_A's and local_B's estimated "relevance" to query_1. In other word, you could search other collections to transitively deduce relevance.
Gerry Mckiernan is trying to investigate what he calls "Library Agents." These are agents whose function makes his job as librarian easier.
He wants to have agents that look at the different departmental and faculty web pages and come up with research interest profiles (RIPs). Gerry's more interested in group profiles than individual profiles, so also terms these "collective user profiles."
These profiles would be used to help assist web-searching, as well as ferret out relevant information for each deparment (information that, given the department's profile, seems likely to be of interest).
For more specific info, see his letters to us (below).