Reboot Research: September 2009

Some recent work at UWS was inspired by the real or imagined activities of a colony of foraging bees. Things got awkward when the model that was being used for experiments turned out to be different from the way bees actually forage. The supervisors said “all models are wrong, but some are useful” and of course wanted to explore the proposed approach, but the student largely lost interest. To my surprise, some readers felt that the study of unnatural systems was intrinsically repugnant, and that the story illustrated the need for science and religion to work hand in hand.
Suppose that a multi-agent system, with the task of looking for a particular sort of cluster in a large data set, observes a potential sub-cluster. We can imagine an automated step of spawning a new agent trained to look for further evidence of such a cluster. However, it is a bit fanciful to think of this new agent as a specially trained infant bee, as bees may learn the habits of the nest, but do not seem to receive the sort of individual instruction found in species with nuclear families. Other work at UWS examined the development of language in interacting groups of automata, and the introduction of a new word in that experiment is not unlike the introduction of a new agent in this one, since the introduction of the word implies a new subset of individuals that use it.
Leaving aside the biological inspiration, could a commercial system be imagined with similar properties? We could imagine such a system working in the data centre of a large supermarket or bank. If new agents can be spawned in this way, there would undoubtedly be issues of monitoring or control. A novel data cluster might result in a massive generation of new agents which might appear as unexpected additional activity in the system. In a commercial data centre it is possible that such an event would lead to suspicions of an intrusion or system fault.
If the autonomous agents are required to do a lot of status reporting to explain what they are up to, the additional monitoring traffic might create so many external messages as to call into question the wisdom of using agents. On the other hand, if the reporting traffic was cleverly aggregated within the swarm, a coherent report could be made to a monitor that a particular observation led to the deployment of 123,000 agents to investigate the possible existence of a new cluster, and this activity had now ended. Some computing systems build this sort of observable surface over chaotic, Brownian, internal motion; just as the apparently random behaviour of autonomous bees creates a regular-shaped nest. For example, network management systems aggregate event reports that have a shared cause, and (doubtless) Microsoft’s performance reporting systems do something similar.
In this way, the investigation of circumstances for and rules for the creation of a new agent leads to a new and interesting control problem, where the new problem is that of explaining the new situation that has arisen, in terms that make sense to those who have not been tracking all the details…

During August Reinventing Academic Publishing Online appeared on Scholarship 2.0. It is a polemic against what its authors see as an exclusive establishment consisting of the "top academic journals" that only the richest universities can afford, and a self-serving institutional system that distorts the academic process in order to make the job of funding bodies and appointing committees easier.
Now there are many misguided people who think that there are such things as "top academic journals" where the best computing research is to be found, and regrettably some of these people do appear to hold positions of power and influence. But the fault is theirs alone.
I believe strongly in the value of computing research conferences: but the large ones have pursued profit at the expense of discrimination. It is easy to find dreadful papers presented at even the best conferences, with half-baked ideas and without results or any pretence at evaluation. But it is precisely the Web and Web 2.0 that allows us to find quality independently of the vehicle used for publication.
I have been following with some interest the response in UK academia to the recent Research Assessment Exercise. The Computing panel noted that excellent articles were to be found in journals with low impact factors, and conversely. They were astonished at the huge number (1247) of refereed journals that submitted articles had appeared in, and were amazed that relatively few university departments had submitted conference publications. They restated their policy that conference papers could be just as good as those found in the "top-rated journals".
Interested readers can follow this debate in the Conference of Heads and Professors of Computing and the consultation about the Research Excellence Framework.
Not only is their analysis of the last RAE excellent: so are the proposals for the next RAE, which will recognise that originality, rigour and impact are not usually found in a single publication. So let's just get on with the research, and leave the task of re-inventing academic publication to those who have time for it.
(Update 9 Nov: and in the meantime, join www.mendeley.com, which looks like a good Web 2.0 scholarship repository!)

Reboot Research

Wednesday, September 9, 2009

Agents and Bee Foraging

Monday, September 7, 2009

On Scholarship 2.0

Followers

Blog Archive

About Me