Wake Up Time: IBM Watson and Real Journalists

August 11, 2018

I read “IBM Has a Watson Dilemma.” I am not sure the word “dilemma” embraces the mindless hyperbole about Vivisimo, home brew code, and open source search technology. The WSJ ran the Watson ads which presented this Lego collection of code parts one with a happy face. You can check out the Watson Dilemma in your dead tree edition of the WSJ on page B1 or pay for online access to the story at www.wsj.com.

The needle point of the story is that IBM Watson’s push to cure cancer ran into the mushy wall composed of cancerous cells. In short, the system did not deliver. In fact, the system created some exciting moments for those trying to handcraft rules to make Dr. Watson work like the TV show and its post production procedures. Why not put patients in jeopardy? That sounds like a great idea. Put experts in a room, write rules, gather training data, and keep it update. No problem, or so the received wisdom chants.

The WSJ reports in a “real” news way:

…Watson’s recommendations can be wrong.

Yep, hitting 85 percent accuracy may be wide of the mark for some cognitive applications.

From a practical standpoint, numerical recipes can perform some tasks to spin money. Google ads work this magic without too much human fiddling. (No, I won’t say how much is “too much.”)

But IBM believed librarians, uninformed consultants who get their expertise via a Gerson Lehrman phone session, and from search engine optimization wizards. IBM management did not look at what search centric systems can deliver in terms of revenue.

Over the last 10 years, I have pointed out case examples of spectacular search flops. Yet somehow IBM was going to be different.

Sorry, search is more difficult to convert to sustainable revenues than many people believe. I wonder if those firms which have pumped significant dollars into the next best things in information access look at the Watson case and ask themselves, “Do you think we will get our money back?”

My hunch is that the answer is, “No.”

For me, I will stick to humanoid doctors. Asking Watson for advice is not something I want to do.

But if you have cancer, why not give IBM Watson a whirl. Let me know how that works out.

Stephen E Arnold, August 11, 2018

Thoughtspot: Confused in Kentucky over AI for BI Plus Search Plus Analytics

August 3, 2018

i read “Nutanix Co-Founder Lures Away Its President to Be New CEO at ThoughtSpot.” The headline is a speed bump. But what puzzled me was this passage:

ThoughtSpot Inc. has hired Nutanix Inc.’s president as its new CEO. Sudheesh Nair joins ThoughtSpot about three months after the Palo Alto enterprise search business raised $145 million in a funding round that valued the company at more than $1 billion.

I added the emphasis on the phrase “enterprise search business.”

Search is not exactly the hottest buzzword around these days. After shock from the FAST Search & Transfer and IBM Watson adventures I hypothesize.

Now here’s the pothole: The ThoughtSpot Web site states:

Search & AI Driven analytics.

I noted the phrase “next generation analytics for the enterprise.” Plus, ThoughtSpot is a platform.

But what about artificial intelligence? Well, that’s part of the offering as well.

Remarkable: A Swiss Army knife. Many functions which may work in a pinch and certainly better than no knife at all.

But what’s the company do? Gartner suggests the firm has vision.

That helps. The first time around with FAST ESP and IBM Watson-like marketing the slow curves went right by the batters and the buyers. The billion dollar valuation is juicy as well. Another Autonomy? Worth watching.

Stephen E Arnold, August 3, 2018

Enterprise Search: Long Documents Work, Short Documents, Not So Much

July 16, 2018

Enterprise Search goals are notoriously wordy and complex. Is this just a symptom of a complicated system that cannot be explained any other way? Probably not, and it’s all one venture capitalist’s idea, according to Business Insider’s recent story: “One Simple Management Trick to Improve Performance, According to John Doer.”

According to the story which is about Doerr’s book “Measure What Matters.”

“[It] explains the thinking behind the Objectives and Key Results (OKR) goal-setting process famously used by companies like Google, MyFitness Pal, and Intel…. “The theory explains that hard goals “drive performance more effectively than easy goals,” and that “specific hard goals ‘produce a higher level of output’ than vaguely worded ones.”

According to Skyword, there are things you and your vendors can do to reverse this trend (Some may not want to reverse. Hey, it’s your world.). Mainly, it deals with understanding your audience and giving them what they crave.

However, short documents often make sense in context; that is, metadata, information about the sender / author and reader / person looking for information, category tags, and other useful information. Enterprise search, despite the wide availability of low cost or no cost solutions, struggle to make sense of short messages like:

“Doesn’t work.”

Videos, encrypted messages, audio, compound documents—Enterprise search systems struggle and often fail. More OPUD.

Patrick Roland, July 16, 2018

The Future of Enterprise Search: The View of a WeWork World

July 13, 2018

I read “The Future of Enterprise Search: Visual, Voice & Vertical.” My reaction was, “This approach to enterprise search describes a WeWork world.” My view of enterprise search is much, much different. Let me point out that I am okay with voice interfaces, but I am struggling to come to grips with the idea of visual search in an enterprise as one of the three pillars of enterprise search as i understand the function. The “vertical” angle is another way of saying, “Enterprise search does not work as a one size fits all solution. Therefore, let’s embrace a search engine for the legal unit, one for the computational chemists, one for marketers, and so on.

The write up points out that organizations, needs, and marketing are in flux. Uncertainty is the name of the game. That’s why there are employees who aren’t really full time equivalents working in Starbuck’s and WeWork offices. Who has a desk, assistants, and a regular nine to five job? Darned few people today. If we recognize the medieval set up of most organizations, the traditional definition of a job has more in common with the world of the Willy Loman (low man on totem pole, get it?). Life today has kings, court staff, and peasants. The difference is that the staff and peasants have mobile phones; otherwise, we’re back in the 7th century CE.

Skipping over the copy and paste of an Economist chart, the guts of the expository essay explains visual, voice, and vertical. The Vs reminded me of IBM’s alleged insight about Big Data’s volume, velocity, and variety. A mnemonic with alliteration. Okay, just not enterprise search as I have defined it in a number of my books; for example, The New Landscape of Search, published by Panda, years ago.

Enterprise search makes it possible for an employee to obtain the information needed to complete a business task. I pointed out that an employee cannot perform some work without locating digital information and data needed to answer a question. My examples included locating the most recent version of a CEO’s PowerPoint presentation, a list of the suppliers for a particular component in a product, information about an alleged personnel matter which violated the terms of an agreement with a customer, and the lab notes relative to a new compound developed by a chemical engineer with a structure diagram.

Now it would be wonderful if I could speak to a mobile device and have the data for any one of these enterprise search tasks delivered to me. But there are a couple of problems; namely, the screen size and capabilities of most mobile devices. For these information tasks, I personally prefer a multi monitor set up, a printer, plus old fashioned paper and pencil for notes.

The visual search angle is useful when looking for engineering drawings or chemical structures. But the visual component is only a part of the information I needed. That lab notebook is important, particularly if the product is going to be commercialized or patented or used as a bargaining chip in a deal with a potential partner or acquisition.

The vertical part is, as I have said, the reason that the typical organization has dozens of information access systems. A decade ago, according to our research, a Fortune 500 company licensed most of the available enterprise search systems, one or more legal search systems, and the specialist tools for those working with engineering drawings, specifications, vendor profiles, etc.

I don’t want to suggest that the discussion of visual, voice, and vertical search in the Search Engine Journal is with value. The information is simply not on point for today’s organization information access requirements.

For those in the top tier of workers — that is, those with senior positions and staff — the tools needed are more diverse and must be more robust. For those laboring away in WeWork offices, voice and visual search may be the go to ways to get information. The vertical search systems are useful, but for many workers, the expertise required to make a chemical structure search system deliver useful outputs is outside those workers’ skill set without some work and midnight oil.

To sum up, enterprise search is a difficult concept. Simplifying it to the three Vs understates the challenge. Explaining enterprise search in terms of semantic technology, natural language processing, and the other difficult to define jargon sprints to the far end of the complexity spectrum.

That’s why enterprise search is problematic. The vendors hope for a buyer and then head for the beach or a new job. The customers end up like Robinson Caruso, stranded and alone with tools that usually fall into disrepair quickly. Enterprise search itself is jargon, but it is jargon which has been marginalized by systems which over promised and under delivered.

That’s a mnemonic and acronym for you: OPUD.

Stephen E Arnold, July 13, 2018

Sinequa Review: Questions Go Unanswered

July 12, 2018

I read “Sinequa Review” by an outfit called Finances Online. I think the idea is an interesting one. Navigate to a Web page and get a snapshot about a product. In this case, the vendor is Sinequa, and its product is described as “an integrated search platform that can extract information from your interconnected applications and environments.  Aside from letting you draw data rapidly, it affords you actionable and intelligent insights that it gains through its deep content analyses.”

That suggests that Sinequa is more than a search engine. In my book CyberOSINT, I pointed out that next generation information access systems represent the path forward for making sense of digital information. I did not include Sinequa is that book’s profiles of vendors to watch.

Like many vendors of keyword search, Sinequa has been working hard to find a way to describe basic search in terms of higher value functions. The Finances Online write up about Sinequa illustrates the difficulty a company like Sinequa has in describing its various functions; for example, “extract information from your interconnected applications and environments.” I am not sure what that means.

The listing of benefits strikes me as different from what I identified in CyberOSINT. In that monograph, I focused on a system’s ability to identify high value or potentially high interest data automatically, interfaces which move beyond Google style lists of results which create more work for the analyst because relevance and a document’s inclusion of a specific item of data are impossible without directly reading a document, and analytic functions designed to present data in the context of the user.

Contrast CyberOSINT’s key features with Sinequa’s:

  • In depth analytics
  • Connectors
  • Mobile strategy

We have congruence with analytics; however, misses on the other features.

The features of Sinequa are a listing of buzzwords. In CyberOSINT, the idea is that next generation information access systems emphasize outputs, not the mechanisms “under the hood.”

The cost of an enterprise search or NGIA system is a difficult issue. The big expenses for enterprise search or NGIA systems are planning, administration, training, set up, optimization, customization, content preparation, and remediation (most of these systems don’t work “out of the box.”)

Here’s how pricing of Sinequa is explained:

Sinequa is a platform that transforms your organization into an information driven one. If you are interested in powering your company or institution with the solution, you can request custom enterprise pricing from the sales team by phone, email, web form, or chat.

I think that inclusion of downsides about a product is important. Perhaps Finances Online will include:

    • Pricing information from verified customers; for example, last year, the total cost of ownership was …
    • Details about issues with the Sinequa system which may date from 2002
    • A more informed listing of competitors
    • Less jargon and fewer undefined buzzwords; for example, “deep content analytics” and “semantics”.

The trajectory of old school search vendors has been similar to the approach taken to extend the life and usefulness of DEC 10s and 20s. Wrappers of software can keep somethings youthful—at least at first glance. Don’t get me wrong. Quick looks can be useful.

Stephen E Arnold, July 12, 2018

Will Cognitive Search (Whatever That Is) Change Because of Squrro?

July 2, 2018

We are not exactly sure what cognitive search is. That’s a plus. Changing cognitive search should be easy. Tweak technology, do some wordsmithing, and the landscape is different. I think this works for some search wizards in today’s fluid environment for information access.,

Cognitive search is ready to undergo a major shift and a few companies seem to be leading the pack. The way that we not only process search, but the questions we ask, could see a drastic change quite soon. We were alerted to this move from a recent product called Squrro, with their post “Add Context, Accuracy, and Speed to Your Enterprise Searches.”

According to the piece:

“Cognitive search is the new generation of enterprise search that uses artificial intelligence (AI) to return results that are more relevant to the user or embedded in an application issuing the search query. The benefits of understanding and extracting more insights from content changes search from, “This document has the keyword in the title so it must be more relevant” to understanding user intent…”

According to reports, a wide variety of businesses are responding to this more detailed ability of cognitive search to serve customers. For example, one Swiss insurance company recently adopted Squrro technology to provide customers with better search options. The company, Helevetia, claims its five million customers are benefiting from this change.

We will have to wait until industry fonts of wisdom like LinkedIn and CMSWatch provide their views. LinkedIn is, however, a Microsoft entity and there is that Fast Search ESP stub.  CMSWatch often views the world from the perspective of content management or CMS. (Is there such a thing as cognitive CMS?)

Clarity will be forthcoming in step with the anticipated Elastic IPO. That financial event makes enterprise search more interesting again. Frankly making IBM Watson and Coveo equivalents did not do it for our research team.

Here in Harrod’s Creek, we want our old fashioned Boolean queries to return relevant results. We like the old school precision and recall approach.

Patrick Roland, July 2, 2018

What Has Happened to Enterprise Search?

June 28, 2018

I read “Enterprise Search in 2018: What a Long Strange Trip It’s Been.” I found the information presented interesting. The thesis is that enterprise search has been on a journey almost like a “Wizard of Oz” experience.

The idea of consolidation, from my point of view, boils down to executives who want to cash in, get out, and move on. The reasons are not far to seek: Over promising and under delivering, financial manipulations, and positioning a nuts and bolts utility as something that solves information problems.

lava flow fixed

Some, maybe many, licensees of proprietary enterprise search systems may have viewed their investment as an opportunity that delivered an unexpected but inevitable outcome. Where is that lush scenery? Where’s the beach?

The reality is that enterprise search vendors were aced by Shay Banon. His Act II of Compass: A Finding Story was Elasticsearch and the company Elastic. Why not use free and open source software. At least the code gets some bugs fixed unlike old school proprietary enterprise search systems. Bug fixes? Yep, good luck with your Fast Search & Retrieval ESP platform idiosyncrasies.

The landscape today is a bit like the volcanic transformation of Hawaii’s Vacationland. Real estate agents will be explaining that the lava flows have created new beach views, promising that eruptions are a low probability event.

The write up does not highlight one simple fact: Enterprise search has given way to “roll up” services or what I call “meta-plays.” The idea is that search is tucked inside systems like Diffeo, Palantir Gotham, and other “intelligence” platforms.

Why aren’t these enterprise grade solutions sold as “enterprise search” or “enterprise business intelligence and discovery solutions”?

The answer is that the information retrieval nest has been marginalized by the actions of vendors stretching back to the Smart system and to the present with “proprietary” solutions which actually include open source technology. These systems are anchored in the past.

Consider Diffeo?

Why offer enterprise search when one can provide a solution that delivers information in context, provides collaboration tools, and displays information in different ways with a single mouse click?

Read more

Coveo Positions Itself to Fend off Enemies

June 4, 2018

Coveo, one of the numerous players in the race for AI supremacy, took a massive leap forward recently. By securing some substantial investments, the company is poised to make a big splash in the field. However, we are not certain money is the answer to all their concerns, after reading a recent press release on their site, “Coveo Announces $100 Million Investment Led By Evergreen Coast Capital.”

According to the story:

Coveo, a recognized leader in AI-powered insight, recommendations and search engines, has secured a $100 million investment from Elliott Management for a 27% stake in the company. The investment was led by Elliott’s Menlo Park, California-based private equity affiliate, Evergreen Coast Capital.”

Nice work if you can get it, to be sure. However, we will be curious whether or not this money makes much of a dent in the market. For instance, competition like Elastic have been gaining ground and Algolia are actually acquiring other companies in an effort to better position themselves. Keep an eye on this fight, because we suspect the company that comes out on top will begin making a major impact on our daily lives through their AI offerings.

One final thought: Will Coveo and companies like Attivio and LucidWorks be able to generate sufficient revenue to pay off the investors and generate a sustainable revenue stream? From our vantage point 45 minutes from Churchill Downs where gambling is a way of life, we think the odds are long, very long.

Perhaps a larger company will buy one of these three firms, allowing the senior managers to have a big payday and retire. Dassault Systèmes, Hewlett Packard, IBM, and Oracle have expensive search stallions in their stable. We assume there will be other prospects if the revenue race stumbles.

Patrick Roland, June 1, 2018

Search Bias a Big Topic Across the Board

June 1, 2018

Manipulating bias in search is a tricky business that is often left in the hands of contractors or third parties. While that may be good enough for others, you need search results manipulated the way they were intended. Luckily, those options are becoming more common than ever as we discovered when we saw the Thunderstone blog that “Thunderstone releases Version 20 and that includes some beefy upgrades.

According to the post:

“Parametric profiles gain new capabilities to set a bias on a per document field using data from field rules.  This allow documents to be biased up or down in the search results, for example PDF results could be biased down, or documents with “Important” in a meta tag could be ranked higher.”

Ranking PDFs isn’t the only way in which search bias is a tricky business. Often, we are hearing about this factor for al the wrong reasons, like when Google’s search results are biased one way or another. However, the search giant uses similar technology, but on a much more grand scale, to eliminate these issues as they recently pointed out: “Google is committed to making products that work well for everyone, and are actively researching unintended bias and mitigation strategies.” Clearly, bias is an issue on the high and low end of everyone’s spectrum.

Patrick Roland, June 1, 2018


Enterprise Search: Not Even a Bridesmaid Now

June 1, 2018

By this point in our history, you would think that most enterprise search would be created equally. However, there are still some stalwarts that are only now stepping into the golden age of search, as it were. One odd entry into this club was discovered after reading a white paper by Hyland on “Enterprise Search”.

According to the paper on Hyland’s new product:

“Enterprise Search allows you to provide access to the right information even if users don’t know exactly where to find it. Enable anyone to build powerful queries without expertise using intuitive, menu-assisted and natural language search options. You can also find answers when there are misspelled search terms, inexact queries, substandard data, document errors and other faults.”

Seems a little elementary, right? We suspect this has a little to do with playing catchup in the field. Hyland recently acquired Lexmark’s enterprise software arm, which itself was never exactly leading the field. (Lexmark bought ISYS Search Software, an outfit with technology from the mid 1980s if the founder spoke the truth when I interviewed him in 2008.) This will be interesting to see where Hyland takes this technology, though. The company has been around for a while and seems intent on wedging itself into the enterprise data conversation. However, it’ll have to make some light speed advances in order to go toe-to-toe with the big dogs in this battle.

Patrick Roland, June 1, 2018

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta