Another View of the Search Market
November 3, 2008
I missed this September 27, 2008, analysis in Intelligent Enterprise. I am not surprised that my trusty correspondents did not forward the link to me. You must read “Enterprise Search: Microsoft, Google, Specialized Players Vie for Supremacy” by Andrew Conry-Murray” here. The article was interesting because it comes at a subject near and dear to my heart in a way that I would not have anticipated. This is a five part opus, so plan to spend some time analyzing the write up’s structure and assertions.
The thesis makes one key assumption; namely, enterprise search is alive and kicking and that it is a viable business sector for the hundreds of companies touting their search systems. First, Mr. Conry-Murray uses a segmentation developed by Information Week. That’s okay, but I am not certain it is 100 percent in line with my analysis of this complicated, confused, conflicted sector. Second, the article pops from finding stuff to finding stuff under the umbrella of eDiscovery. The leap doesn’t resonate with me, and it does not make much sense. eDiscovery can exist along with multiple search systems, and it involves some different issues that searching for stuff without threat of a fine or jail time. Think spoliation. Third, I was exposed to “the 17 databases problem”. Now, next generation data management systems can cope with heterogeneous types of structured and unstructured data. I quite like Google’s dataspace approach and the Exalead system works like a champ as well. Mark Logic and others are in this horse race as well. I could list more vendors but I don’t want to rehash my profiles in my Beyond Search study published by The Gilbane Group in April 2008. Finally, I learned about expert search.
I am not going to be able to recycle much of this article. Nor will I reference it in my lectures next week. What I learned is that a person who “reads up” about search and talks to some people can identify some of the issues. What’s missing is context. I do quite like the frequency with which the “beyond” preposition is turning up. There’s a “beyond Google” seminar. Even Attivio uses the “beyond” word in its newest white paper.
Here’s an interesting exercise. Navigate to Google. Run a query for “beyond search”. Start there.
Stephen Arnold, November 3, 2008
Herding SharePoint Content Sheep
November 2, 2008
Microsoft may be pushing Fast Search’s ESP into large SharePoint installations, but certified gold partners continue to find opportunities to make money from the 100 million SharePoint installations. Autonomy and Open Text recently rolled out systems that make it easier to keep control of SharePoint content. Why control SharePoint documents? You will learn quickly enough when you get caught in a legal matter and have to figure out which version of each document is the one that is the “right” one. SharePoint offers primitive and clunky controls for herding SharePoint content sheep; that is, the many bits and pieces of modular documents, emails with attachments, and PowerPoint decks with some relevant information but perhaps not the best and final version of the deck. When you get the invoice for collecting documents manually, you will understand why you have to have robust tools for governing information.
Let’s look briefly at two products that herd content sheep:
First, Autonomy has rolled out Controlpoint. You can read more about the product on the Autonomy Web site here and in the Marketwatch write up here. In a nutshell, you install Controlpoint, and you get policy-driven control of all SharePoint content. Autonomy includes a number of content processing functions with Controlpoint; for example, classification of documents.
Next, Open Text dubs its governance solution the sveltly named Open Text Content Lifecycle Management Services for Microsoft SharePoint, eDOCS Edition. The acronym is OTCLMSMSE. Between you and me Autonomy does a better job naming products. The Open Text solution delivers life cycle management, policies, and archiving functions. A licensee can hook SharePoint into Open Text’s other enterprise content management services as well. You can read CMSWire’s write up here.
Let’s step back and think about SharePoint. Autonomy and Open Text have identified a glaring weakness in the Frankenstein SharePoint. SharePoint is, according to some, Microsoft’s next generation operating system. I think that’s pretty wacky. SharePoint is a product that changes with each release. First it was content management. Then it was collaboration. Now it is knowledge management. The Microsoft sale pitch makes SharePoint seem easy, cheap, wonderful, and the cure for what ails a modern organization. In reality, SharePoint lacks the chops to deal with content when a lawyer shows up to collect information as part of the discovery process. SharePoint doesn’t do any single function particularly well. What it does is deliver stub functions that work okay when you have two or three people using a small number of documents. Increase the number of documents and the number of users, and you have a multi million dollar investment to get the system stable and running with acceptable performance.
I am willing to go out on a limb and say that Microsoft will introduce enhanced policy and information governance features. The Microsoft certified professionals will install these extensions, and the company will find that liberal injections of money and technical resources will be needed to get the amalgamation working in an acceptable way.
In the meantime, Autonomy and Open Text should be able to make sales. When Microsoft rolls out its own governance solutions, engineers at these companies will develop a fix for another void in SharePoint.
Stephen Arnold, November 2, 2008
Brainware: Nailing a Big Deal
October 31, 2008
Brainware won’t reveal the name of its client, but the “diversified energy company” bought $2.6 million worth of Brainware technology in October 2007. Brainware explains that this is a “follow on contract”. This size of this deal moves Brainware into Autonomy and Endeca territory, two vendors noted for their ability to land large search and content processing deals. Brainware describes itself as a vendor of “intelligent data capture and enterprise search solutions.” The firm offers what is called “end to end” solutions. Like OpenText and ZyLab, Brainware can put in place scanning and content capture and conversion systems, indexing, content processing, and information access tools. The idea is that paper or digital information goes in at one end of the system and the user can access that content at the other. You can read an interview with one of the Brainware executives here. I profiled the company in Beyond Search, published by the Gilbane Group. More information about that study is here.
According to the Brainware news release here:
This private sector oil company is deploying Brainware Distiller as a front-end data capture solution in conjunction with its global ERP rollout. The Brainware Distiller solution will process millions of invoice pages per year from more than a hundred different countries through their shared services centers located across Europe and the U.S.
A happy quack to the Brainware team. Now, what’s next for the Ashburn, Virginia, company? If you haven’t sold Halliburton, maybe you should aim for Shell or BP next? I found the announcement interesting because Brainware is associated with eDiscovery, not enterprise search, in my mind. Like Recommind, Brainware seems to be making an effort to penetrate new markets with its patented technology for information processing.
Stephen Arnold, October 30, 2008
Silobreaker: Two New Services Coming
October 24, 2008
I rarely come across real news. In London, England, last week I uncovered some information about Silobreaker‘s new services. I have written about Silobreaker before here and interviewed one of the company’s founders, Mats Bjore here. In the course of my chatting with some of the people I know in London, I garnered two useful pieces of intelligence. Keep in mind that the actual details of these forthcoming services may vary, but I am 99% certain that Silobreaker will introduce:
Contextualized Ad Retrieval in Silobreaker.com.
The idea is that Silobreaker’s “smart software” called a “contextualization engine” will be applied to advertising. The method understands concepts and topics, not just keywords. I expect to see Silobreaker offering this system to licensees and partners. What’s the implication of this technology? Obviously, for licensees, the system makes it possible to deliver context-based ads. Another use is for a governmental organization to blend a pool of content with a stream of news. In effect, when certain events occur in a news or content stream, an appropriate message or reminder can be displayed for the user. I can think of numerous police and intelligence applications for this blend of static and dynamic content in operational situations.
Enterprise Media Monitoring & Analysis Service
The other new service I learned about is a fully customizable online service that delivers a simple and effective way for enterprise customers to handle the entire work flow around their media monitoring and analysis needs. While today’s media monitoring and news clipping efforts remain resource intensive, Silobreaker Enterprise will be a subscription-based service that will automate much of the heavy lifting that either internal or external analysts must perform by hand. The Silobreaker approach is to blend–a key concept in the Silobreaker technical approach–in a single intuitive user interface disparate yet related information. The enterprise customers will be able to define monitoring targets, trigger content aggregation, perform analyses, and display results in a customized web-service. A single mouse click allows a user to generate a report or receive an auto-generated PDF report in response to an event of interest. Silobreaker has also teamed up with a partner company to add sentiment analysis to its already comprehensive suite of analytics. Currently in final testing phase with large multinational corporate test-users and due to be released at end of 2008/early 2009.
Silobreaker is a leader in search enabled intelligence applications. Check out the company at www.silobreaker.com. A happy quack to the reader who tipped me on these Silobreaker developments.
Stephen Arnold, October 23, 2008
Google: A Powerful Mental Eraser
October 23, 2008
Earlier today I learned that a person who listened to my 20 minute talk at a small conference in London, England, heard one thing only–Google. I won’t mention the name of this person, who has an advanced degree and is sufficiently motivated to attend a technical conference.
What amazed me were these points:
- The attendee thought I was selling Google’s eDiscovery services
- I did not explain that organizations require predictive services, not historical search services
- I failed to mention other products in my talk.
I looked at the PowerPoint deck I used to check my memory. At age 64, I have a tough time remembering where I parked my car. Here’s what I learned from my slide deck.
Mention Google and some people in the audience lose the ability to listen and “erase” any recollection of other companies mentioned or any suggestion that Google is not flawless. Source: http://i265.photobucket.com/albums/ii215/Katieluvr01/eraser-2.jpg.
First, I began with a chart created by an SAS Institute professional. I told the audience the source of the chart and pointed out the bright red portion of the chart. This segment of the chart identifies the emergence of the predictive analytics era. Yep, that’s the era we are now entering.
Second, I reviewed the excellent search enabled eDiscovery system from Clearwell Systems. I showed six screen shots of the service and its outputs. I pointed out that attorneys pay big sums for the Clearwell System because it creates an audit trail so queries can be rerun at any time. It generates an email thread so an attorney can see who wrote whom when and what was said. It creates outputs that can be submitted to a court without requiring a human to rekey data. In short, I gave Clearwell a grade of “A” and urged the audience to look at this system for competitive intelligence, not just eDiscovery. Oh, I pointed out that email comprises a larger percentage of content in eDiscovery than it has in previous years.
Recommind: Grabs Legal Hold
October 20, 2008
Recommind released its Insite Legal Hold solution today. This product bridges the gap between enterprise search and analytics.
Recommind’s Craig Carpenter states that Insite maps well with the current customer base of financial and professional service firms that are involved in heavily regulated, high knowledge users that are subject to mass litigation.
The release of this product during these financially strained times is viewed as a growth opportunity backed by a recently infusion of $7.5 million in private-equity funding.
So what makes Insite Legal Hold worth an investment in your company? First, it is an integrated solution – early risk assessment (ERA), preservation, hold/collection and processing. Second, you can reduce your litigation related costs and risks to some degree. Third, you can collect only what is needed and leave the rest to current company retention policy. Finally, you can proactively address retention and spoliation risks; that is, having an email changed.
Perhaps the most intriguing part of this product is the automated updates to current holds, though Mr. Carpenter said that in response to customer feedback, Recommind also included less sexy but still important features including filtering, deduping, near-duping, and e-mail-thread processing.
A few other benefits of Insite Legal Hold include:
- Collective selection based upon keyword, Boolean, and concept matching. This collective selection provides is more defensible than previous legal hold releases because the applied intelligence normalizes for related concepts and produces documents that yield more relevant data that is above and beyond reasonable as required by the Federal Rules of Civil Procedure.
- Explore in Place Technology allows the indexing an return of light results into html for a sampling review which can them be used to apply concept searches and more to the fuller data sets.
- Multi-platform flexibility: allows enterprises with a legacy review platform to enhance data analytics yet still use its current system for production
- Built-in processing: filter, dedupe, near-dupe, and thread documents, thereby saving 70-80% of processing and review costs.
- Manages Multiple Holds.
- Reduces IT costs by providing a forensically sound copy of perceived
relevant data and holds it in a separate data store.
When asked about pricing Mr. Carpenter provided an overview of Recommind’s three-tiered licensing module.
- Annual license fee bases upon the number of custodians
- System sold outright to customers with existing infrastructures
- A La Carte for those customers who don’t have a huge litigation load but need to manage 1 or 2 cases per year.
Insite Legal Hold has a huge potential to reduce the costs and risks involved in e-discovery endeavors. The pain points of high costs at the collection and review stage make the automation of updates and concept and near-concept bases selection an attractive solution.
Recommind’s investment of private equity funds to get the word out about their solution in a time when more potential customers are struggling with the fall-out from a global financial crisis bodes well for the profit stream of this company. What is apparent with this solution is that the developers are starting to pay attention to the less-sexy parts of e-discovery work and spending time and money to provide solutions that help reduce costs and the collection and production stages of the e-discovery cycle.
Constance Ard, Answer Maven for Beyond Search, October 20, 2008
Google Yahoo: Boo Hoo for GooHoo
October 16, 2008
I don’t pay much attention to online advertising or the search engine optimization games. Ads are part of the furniture of living in the Wild West of capitalism. SEO is little more than people without the ability to create compelling content working hard to outsmart indexing robots. Dull, uninteresting, and crass work from my point of view. The headline in The AM Law Daily here “Five Firms Push for Google Yahoo Antitrust Settlement” by Nate Raymond caught my attention. For me the key point was not the litigation that Google faces. I noted the legal eagles enlisted by GooHoo to make the objections go away: Cleary Gottlieb Steen & Hamilton, Wilson Sonsini Goodrich & Rosati, Skadden, Arps, Slate, Meagher & Flom, and Hunton & Williams. The price tag for the eDiscovery related to this matter is going to be a blockbuster just like the legal fees. No BS regarding the costs of this adventure. We have an aleph of eagles circling this GooHoo matter. Google and Yahoo do seem to be care about ads and SEO-related activities for traffic. In my opinion utilities are natural monopolies and will form one way or another. What’s your view?
Stephen Arnold, October 16, 2008
Recommind: A Cash Infusion to Go to eDiscovery 2.0
October 10, 2008
Most people don’t know what eDiscovery is. Believe me. You are better off not knowing. The term applies to information obtaining during the discovery process of a legal matter. The “e” part means that software does some of the heavy lifting. Human attorneys fall asleep when grinding through hundreds of thousands of pages of documents, email, and transcripts. Software never sleeps. The customers for eDiscovery services include some law firms. Keep in mind that although there are a quarter million attorneys running around the lower 48 states, the top 50 law firms constitute a small market. The bigger market for eDiscovery are corporations. With the cost of litigation rising more quickly than the price of diet soda in Harrod’s Creek, Kentucky, eDiscovery can save in house legal operations big money.
Recommind emerged as a challenger to Stratify, now a unit of Iron Mountain, a $2.8 billion records and information company known for putting paper in limestone caves. Recommind’s system is, to the uninitiated, similar to Autonomy’s. The company started in the legal market, made a run at the enterprise search market, and now with this infusion of $7.5 million seems to be nosing back into the eDiscovery space. You must read Anthony Ha’s story “Recommind Raises $7.5 million for eDiscovery 2.0” here. I’m baffled by the “2.0” part. Recommind has to come up with something beyond 2.0 because Brainware, Clearwell Systems, and a couple of other firms such as Autonomy have caught the fancy of some legal eagles. I think that the “2.0” is somewhat gratuitous, maybe meaningless. As I read the story by Mr. Ha, Recommind is going to use the money to grow.
Kennet Partners was the lead for the financing. Smart money knows where there’s a payoff. Recommind will have the cash to leap frog some of the competitors, but not necessarily the ones mentioned in the New York Times article. Once the technology moves Recommind ahead of the upstarts who are selling management systems, not just eDiscovery systems, then Recommind will have to adjust its marketing. My offer of visiting and conducting an interview with the firm was rejected. “Too busy” was the reason I was given by the firm’s top sales guru. With smart money watching $7.5 million, expecting its cash to grow, Recommind will have an exciting road to travel in the next six to nine months as it goes “2.0” as the economic thermometer drifts toward zero.
Stephen Arnold, October 10, 2008
Google and the NSA: An Alleged Document
September 30, 2008
I don’t have any comment about this document, which appears to contain information about Google and the puzzle palace. Oh, I’m sorry the National Security Agency or NSA. You can read Google Blogoscoped’s comments here. The addled goose doesn’t dip its beak into the se waters. If you are hungrier than this water fowl, have at it. My view on these alleged tie ups between information literate companies and government agencies is that I don’t want to be involved. If you really want to get the inside scoop, fly into BWI Airport, rent a car, and drive on down to the nerve center of the puzzle palace. Lots of friendly people around. I’m sure you will get some useful information. Just don’t push those goose feathers to me.
Stephen Arnold, September 30, 2008
IBM: Another New Search System from Big Blue
September 27, 2008
IBM announced its eDiscovery Analyzer. You can read the IBM news release on the MarketWatch news release aggregation page here. Alternatively you can put up with the sluggish response of IBM.com and read the more details here. You won’t be able to locate this page using IBM.com’s search function. The eDiscovery Analyzer had not been indexed when I ran the query at 7 30 pm on September 27, 2008. I * was * able to locate the page using Google.com. If I were the IBM person running site search, I would shift to Google, which works.
The eDiscovery Analyzer, according to Big Blue:
… provides conceptual search and analysis of cases created by IBM eDiscovery Manager.
Translating: eDiscovery Manager assists with legal discovery, a formal investigation governed by court rules and conducted before
trial, and internal investigations on possible violations of company policies, by enabling users to search e-mail documents that
were archived from multiple mailboxes or Mail Journaling databases into a central repository. You license eDiscovery Manager, the bits and pieces needed to make it go and then you license the brand new eDiscovery Analyzer component.
I believe that this is the current interface for the “new” IBM eDiscovery Analyzer. Source: IBM’s Information Management Software IBM eDiscovery Analyzer 2.1 marketing collateral.
You will need FileNet, IBM’s aging content management system. The phrase I liked best in the IBM write up was, “[eDiscovery Analyzer] is easy to deploy and use, Web 2.0 based interface requires minimal user training.” I’m not sure about the easy to deploy assertion. And the system has to be easy to use because the intended users are attorneys. In my experience, which is limited, legal eagles are not too excited about complicated technology unless it boosts their billable hours. You can run your FileNet add in on AIX (think IBM servers) or Windows (think lots of servers).
You can read about IBM’s search and discovery technology here. You can tap into such “easy to deploy” systems as classification, content analysis, OmniFind search, and, if you are truly fortunate, DB2, IBM’s user friendly enterprise database management system. You might want to have a certified database administrator, an expert in SQL, and an IBM-trained optimization engineer on hand in case you run into problems with these user friendly systems. If these systems leave you with an appetite for more sophisticated functions, click here to learn about other IBM search and discovery products. You can, for example, read about four different versions of OmniFind and learn how to buy these products.
Remember: look for IBM products by searching Google. IBM.com’s search system won’t do the job. Of course, IBM’s enterprise eDiscovery Analyzer is a different animal, and I assume it works. By the way, when you try to download the user guide, you get to answer a question about the usefulness of the information * before * you have received the file. I conclude that IBM prefers users who are able to read documents without actually having the document.
Stephen Arnold, September 27, 2008