• Most Popular
  • Most Shared
Vincent Padois, head tutor at the Pierre and Marie Curie University who teaches robotics and is babysitting the Paris ICub, makes a demonstration with ICub robot, a ?hybrid embodied cognitive system for a humanoid robot" about 1 metre (3.2 feet) high, at the Pierre and Marie Curie University in Paris September 4, 2009. Six versions of ICub exist in laboratories across Europe, where scientists are painstakingly tweaking its electronic brain to make it capable of learning, just like a human child and hoping it will learn how to adapt its behaviour to changing circumstances, offering new insights into the development of human consciousness.   REUTERS/Philippe Wojazer

Pictures of the year: Technology

A look at the year's best science and technology photos.   Slideshow 

    Web start-up unveils semantic Wikipedia search tool

    SAN FRANCISCO
    Mon May 12, 2008 10:48am EDT
    The Powerset homepage is seen in this handout photo. REUTERS/Handout/Powerset

    SAN FRANCISCO (Reuters) - Powerset on Sunday unveiled tools for searching Wikipedia that use conversational phrasing instead of keywords, marking the first step of its challenge to established Web search services such as Google.

    Technology  |  Stocks  |  Mergers & Acquisitions  |  Media

    Powerset's technology breaks down the meaning of words and sentences into related concepts, freeing users from always needing to type the exact words they want to find.

    The closely watched Silicon Valley start-up is offering a way of searching millions of entries in Wikipedia's online encyclopedia, helping users find detailed answers to questions rather than isolated links that require further research.

    For example, a user who wants to know how many wives King Henry VIII had (six, or two, depending on your definition of marriage) can find an answer via Powerset's service at tinyurl.com/5qpcr9/.

    San Francisco-based Powerset is looking to leapfrog the current generation of services that rely on keyword searches such as Google Inc, Yahoo Inc, Microsoft Corp and IAC InterActiveCorp's Ask.com.

    "The Wikipedia is becoming a microcosm of the most useful parts of the Web," said Greg Sterling, an Internet analyst with Sterling Market Intelligence. "This offers a powerful way to find what you are looking for against this subset of the Web."

    While still a far cry from letting users search the World Wide Web, Powerset is using Wikipedia as a trial showcase for how its technology can be used to search a vast number of other websites using natural language phrases or questions.

    Over time, it aims to partner with other high-quality data sites where information can be organized in a question and answer form that lends itself to Powerset search techniques. Examples might include financial or patent filings, the CIA Factbook or Wikipedia-inspired clones, company officials said.

    Powerset, which can be found at www.powerset.com/, looks beyond words to try to understand conceptual relationships that get closer to what a user may be searching for. It analyzes each sentence and whole documents to do so.

    Powerset plans eventually to make money selling advertising alongside its search services. But for now, the 60-employee company consists almost entirely of computer scientists and linguists. It has no advertising staff and only a handful of marketing and support staff.

    Sterling said it is likely to take years for Powerset to be able to search the Web on the scale Google now does using statistical ranking techniques to find relevant Web links.

    "What I don't know is how Powerset will perform on the wide open Web. In a sense, this is a massive prototype using the relatively structured information of Wikipedia. It is difficult to compare to what Google has built," Sterling said.

    Sterling said a bigger danger to Google would be if rival Microsoft were to acquire Powerset and incorporate it into other search technologies it has. Recently, Microsoft backed off a $44 billion bid for Yahoo to create a formidable rival to Google in Web search and online advertising.

    "This could become the basis of a Google-killer," Sterling said. "Someone like Microsoft might want to buy Powerset."

    Spokesmen for Microsoft and Powerset declined to comment on rumors of a potential tie-up between the two companies.

    FUN WITH "FACTZ"

    Powerset offers richly annotated ways for searching inside Wikipedia entries to find related concepts. Called "Factz", these related ideas generate outlines, summaries and automated answers to users' questions.

    "Our system is a little more forgiving," Scott Prevost, general manager of Powerset, said in an interview on Sunday. "It is not looking for hard-word matches. We are not searching for exact words, but concepts," he said.

    The 2-1/2-year-old start-up licensed natural language processing technology and related machine processing methods developed over three decades at the Xerox PARC research centre in Silicon Valley to create new consumer Web search services.

    With tacit approval of the non-profit Wikimedia Foundation, the organization behind the Wikipedia, Powerset officials said they are hosting a copy of Wikipedia's 2.5 million English-language entries on its own computers. This lets Powerset make links across the breadth of Wikipedia data.

    "What Powerset is doing is offering readers a natural-language search interface, and we think that is an interesting experiment," Mike Godwin, Wikimedia Foundation's general counsel, said in response to an emailed question about how the two organizations would work together.

    In addition to Wikipedia, Powerset's new service also searches a related database called Freebase created by MetaWeb, another Web search start-up.

    After decades of research and debate, natural language processing is finally poised to go mainstream, predicted Barney Pell, co-founder and chief technology officer.

    "2008 is the year that semantic and linguistic technologies cross over into widespread consumer use," he said.

    (Editing by Louise Ireland)



    More from Reuters

    Exclusive: Saudis quit Caribbean oil storage

    NEW YORK/HOUSTON/BEIJING (Reuters) - Saudi Arabia has quit a long-held lease for 5 million barrels of Caribbean oil storage near the key U.S. market and state giant PetroChina is poised to move in, industry sources say, a potentially major shift in global oil trade dynamics.

    EDITORS' NOTE: Reuters and other foreign media are subject to Iranian restrictions on leaving the office to report, film or take pictures in Tehran.   A man holds a picture of Ayatollah Ruhollah Khomeini, founder of the Islamic Republic as government supporters protest against opposition demonstrations during the holy day of Ashura, in Tehran December, 30 2009.  REUTERS/Caren Firouz

    What next?

    Six months after a disputed election, tension in Iran shows no signs of letting up.  Full Article 

    Disgraced financier Bernard Madoff is escorted by police and photographed by the media as he departs U.S. Federal Court after a hearing in New York, January 5, 2009. REUTERS/Lucas Jackson

    I beg your pardon ...

    Bernie Madoff became the poster boy of crooked investment schemes this year -- but he wasn't alone. Here's a look at the 10 most notorious cases of 2009.  Full Article