Moreover Technologies
3 July 2002

Gathering news from over 3,000 web-based sources

Machines are efficient things, but when it comes to achieving high levels of accuracy, human intervention is often required.

At Moreover Technologies, a San Francisco, London and New York-based information management solutions provider, technology and manpower are both used to deliver the latest news and information to Global 2000 clients with the degree of speed and accuracy on which the company's reputation has grown.

Since its foundation in 1999, the company, which won the 2001 Search Engine Watch Award for Speciality Search and which counts Deutsche Bank and McGraw-Hill among its clients, has etched a niche for itself between general search engines such as Google, and proprietary content aggregators such as Lexis-Nexis.

Moreover operates as a highly focused search engine, retrieving information from the web specifically geared to a particular client's needs. Moreover sources news from more than 3,000 web-based sources, from major newspapers and company websites to newsgroups and specialist trade publications. The information can be received via the client's e-mail, intranet system, website, extranet or desktop and is refreshed every 15 minutes.

Moreover's technological infrastructure, which operates via a New Jersey-based server farm, allows for a high degree of automation.

As well as extracting new data from each news site every 15 minutes, the system converts the http data into XML, allowing articles to be filtered under such criteria as date, time, source name and source type.

Databases store the news as it arrives, a redundancy detector eliminates multiple postings and an output engine integrates outgoing data into the client's desired delivery format. However, the system would fall apart without input from Moreover's editorial team.

Comprising 10 members (a quarter of the company's 40-strong workforce), the editorial team plays a key role in two parts of the process. Firstly, editors locate, evaluate and select news sources based on the client's requirements. Moreover provides a range of sources from which the client can pick, and the client can also ask for additional sources to be included.

"We're very proud of the quality of the news sources we harvest," says Angus Bankes, co-founder, chief technology officer and general manager of UK operations at Moreover.

"The inclusion of any particular source is client-led, but sources have to pass editorial criteria - they have to be high quality."

Having established which sources to include in the client's package, the editors create customised templates for each selected news site. Because websites change constantly and the positioning of the headline varies from site to site, manually-created templates ensure that Moreover's technology harvests only what is needed from each source and identifies the fast-changing areas of a site.

Human intervention is also integral to another part of the process. Once http sources have been converted to XML and have been checked for redundancies, Moreover's editors define rules to filter content topically and add category XML tags.

Moreover currently filters news sources across at least 360 categories, from legal and clothing industry news to joke sites and Hollywood gossip.

Mr Bankes believes that there will always be room for an editorial team at Moreover. "If you only use software, you'll never get more than 85 per cent accuracy," he says. "Automation is key, but you need human intervention to increase that figure from 85 per cent to 98 per cent."

Mr Bankes says that although software is improving, people are needed to create profiles for the software.

Another reason for employing human editors is to keep up with the fast- changing face of the internet. "Most people would accept that purely automated systems can't adapt to change," says Mr Bankes. "Because the internet moves so fast, without human intervention, you wouldn't pick up on changes for weeks."

Some people disagree. Paola di Maio, a content analyst and editor of the web-based publication Content Wire, thinks that human editing teams are less than ideal. "Human editors are high cost and high error," comments Ms Di Maio.

Unlike Mr Bankes, Ms Di Maio believes that full automation is possible. "With a good enough algorithm and appropriate filters, automation is 100 per cent possible," she says. "However, humans must do systematic quality controls."

Moreover is not the only company working in the information management sector to consider humans an integral cog in the system.

"Professional editors will always be important in story selection," says David Scott, vice president of corporate marketing at NewsEdge, a Massachusetts-based information services company whose 280-strong staff includes 40 editors.

"There are certain things which machines simply cannot do. For example, machines cannot choose the better of two stories which come out at the same time from different media outlets on the same subject. Human judgment is important."

© Copyright The Financial Times Limited 2002 .