Exclusive Interview Gaviri Founder
September 8, 2009
One of the most interesting aspects of the Beyond Search Web log in general and the Search Wizards Speak feature in particular is learning about those who invent search systems.
I had an opportunity to sepak with Emeka Akaezuwa in the spare student union at Drexell University not long ago. Dr. Akaezuwa struck me as a remarkable individual. First, he made the trek from his office near New York City to Philadelphia. Second, I learned that he used to work at Dow Jones & Co., laboring on the firm’s search and retrieval systems. Third, I found that he had a PhD and was contributing his time to help young students get their arms around computers and their potential.
You can read the full text of my interview with Dr. Akaezuwa in the Search Wizards Speak feature on ArnoldIT.com. Search Wizards Speak is the single most comprehensive collection of interviews with the movers and shakers in search and content processing.
Dr. Akaezuwa’s Universal SearchOS impressed me. I watched as he took his wristwatch, connected it to my laptop via a USB cable, and indexed my computer. I made the observation that some law enforcement and intelligence agencies might be intersted in the technology. The SearchOS indexed my netbook in a matter of minutes. He said, “Now you can use the data in the Gaviri index to search for documents on your PC.”
SearchOS is available in other “flavors” as well. I have been testing the desktop version since we met at Drexell. One of its most useful features is the ability to “point” the indexing system at an archive of Outlook or Outlook Express email. The system makes those messages and their attachments searchable. Very useful.
The system can be deployed like other enterprise search systems. I asked Dr. Akaezuwa if the SearchOS system could cluster and generate facets. He laughed and told me, “The system is very versatile. Yes, we can make those features available to you if you want them.”
One point Dr. Akaezuwa made in his intervew was:
SearchOS can index so many documents without additional servers because of its Sandbox, Distributed Indexing Architecture. Let us take a behind-the-firewall setting with one thousand users and maybe six servers. Each user indexes all the documents within their sphere of search influence (desktops, portable storage devices, etc.) using their PC or laptop. If we assume that each user has three million documents, we would index three billion documents. And if each of the six servers has fifty million documents, we would index 300 million documents on the servers. Indexing is distributed on each user’s machine. By design, SearchOS can index as many documents as are available on a device. We did not test on the specific hardware configuration you mentioned, but SearchOS’ processing throughput on a dual core processor is about 3,000 documents a minute. Keep in mind that SearchOS does full-text, not partial-text, indexing so the number may be less if a system has many large-size documents (over 500 MB). The software has a CPU utilization throttle that allows a user or a sys admin to power-up or decelerate content processing throughput to match available system resources. SearchOS not only morphs to user contexts but also can be scaled to a device’s content processing capabilities. Device-scaling – which I did not mention as one of what sets us apart – is necessary given the array of systems – from resource-challenged PDAs to high-powered servers that SearchOS must run on.
Stephen Arnold, September 8, 2009