dtSearch Offers Three Search Tips

October 26, 2015

Most users key in 2.8 words and let the system do the heavy lifting. What millennial has time to do old fashioned research? Apparently lots. I read “Top 3 Tips to Improve Document Search.” The source of the information is a wizard at dtSearch, a vendor which offers a solution to SharePoint search woes.

Here are the tips. Note that these are useful when using the dtSearch system.

Use Boolean AND, OR, and NOT operators. Yes, even millennials can learn about George Boole’s better idea
Use quotation marks around phrases. Note: This still sort of works on Google too.
“Store OCR output in searchable PDF format.” Raise your hand if you know how to create a searchable PDF. The remainder of the folks answering this question may find this link helpful.

See how easy it is to improve one’s ability to search.

Stephen E Arnold, October 26, 2015

Written by Stephen E. Arnold · Filed Under Enterprise search, News | Comments Off on dtSearch Offers Three Search Tips

Technical Debt: Maybe Real but Let It Go

October 22, 2015

I read “Letting Go Of Technical Debt.” The argument makes sense to a person developing software or other high technology product or service. The conclusion to the write up is, “Let it go.”

Is the conclusion rock solid?

The answer depends on one’s point of view. Google has a handful of staff who think about technical debt in terms of high interest credit card charges. The idea is that if you don’t accelerate your payments, the debt just keeps on growing. Bad for the debtor. Bad for the credit card company.

There is an interesting variant to the notion of non financial debt. Consider intentional debt. The idea is that a marketer hypes a product or service. The reality is different. The customer and the company end up in the hole.

When I think of search and content processing, a number of case examples come to mind. These range from HP’s and IBM’s assumption that by betting big on information access, these ageing businesses can generate substantial, sustainable, organic revenue to search start ups funded by friends and family.

The challenge is to produce enough money from whatever sources are available to:

Create, bug fix, enhance, and support the search product or service. The costs incur what I believe should be called technical debt. Do search vendors get out of the red? The trials and tribulations of Fast Search and HP Autonomy warrant some study. I think this is technical debt.
Market the product so that sales and revenues flow so that continuous transfusions of cash are not required for the search vendor to operate. I call these and related money issues financial debt.
Build a consistent message for prospects, customers, employees, and stakeholders. Search vendors flip flop from keyword search to Big Data search to customer support services. The lack of a consistent, meaningful presentation of the company and its products or services underscores what I call credibility debt.

Net net: Search and content processing companies incur several types of debt. It is easy to say, “Let it go.” Is that approach fair to investors, ethical in the sense of trying to build a sustainable business, and practical in today’s somewhat soft economy?

For me, I think dealing with debt is essential. Obviously some folks do not agree.

Stephen E Arnold, October 22, 2015

Written by Stephen E. Arnold · Filed Under Business strategy, Enterprise search, News | Comments Off on Technical Debt: Maybe Real but Let It Go

Genentech Joins the Google Enterprise Crew

October 22, 2015

Enterprise search offers customizable solutions for organizations to locate and organize their data. Most of the time organizations purchase a search solution is to become more efficient, comply with procedures for quality compliance, and or to further their business development. The latter usually revolves around sales operation planning, program research, customer service, contracts, and tech sales collateral.

Life sciences companies are but one of the few that can benefit from enterprise search solutions. Genentech recently deployed the Google Search Application to improve the three areas listed above. Perficient explains the benefits of enterprise search for a life science company in the video, “Why Life Sciences Leader Genentech Adopted Google Enterprise Search.”

“‘…we explore why life sciences leader Genentech executed Google Search Appliance. “No company is or should ever be static. You have to evolve,’ said CEO Ian Clark.”

Perficient helps companies like Genentech by customizing a search solution by evaluating the company and identifying the areas where it can be improved the most. They host workshops to evaluate where people in different areas must stop to search for information before returning to the task. From the workshops, Perficient can create a business prototype to take their existing business process and improve upon it. Perficient follows this procedure when it deploys enterprise search in new companies.

The video only explains a short version of the process Perficient deployed at Genentech to improve their business operations with search. A full webinar was posted on their Web site: “Google Search For Life Sciences Companies.”

Whitney Grace, October 22, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph

Written by Stephen E. Arnold · Filed Under Acquisition, Data, Enterprise search, Google, News, Search | Comments Off on Genentech Joins the Google Enterprise Crew

Enterprise Search Is a Growth Industry: No, Really

October 16, 2015

I noticed two things when we were working through the Overflight news about proprietary vendors of enterprise search systems on October 14, 2015.

First, a number of enterprise search vendors which the Overflight system monitors, are not producing substantive news. Aerotext, Dieselpoint, and even Polyspot are just three firms with no buzz in social media or in traditional public relations channels. Either these outfits are so busy that the marketers have no time to disseminate information or there is not too much to report.

Second, no proprietary enterprise search vendor is marketing search and retrieval in the way Autonomy and the now defunct Convera used to market. There were ads, news releases, and conference presentations. Now specialist vendors talk about webinars, business intelligence, Big Data, and customer support solutions. These outfits are mostly selling consulting firms. Enterprise search as a concept is not generating much buzz based on the Overflight data.

Imagine my surprise when I read “Enterprise Search Market Expanding at a 12.2% CAGR by 2019.” What a delicious counterpoint to the effective squishing of the market sector which husbanded the Autonomy and Fast Search & Transfer brouhahas. These high profile enterprise search vendors found themselves mired in legal hassles. In fact, the attention given to these once high profile search vendors has made it difficult for today’s vendors to enjoy the apparent success that Autonomy and Fast Search enjoyed prior to their highly publicized challenges.

Open source search solutions have become the popular and rational solution to information access. Companies offering Lucene, Solr, and other non proprietary information access systems have made it difficult for vendors of proprietary solutions to generate Autonomy-scale revenue. The money seems to be in consulting and add ons. The Microsoft SharePoint system supports a hot house of third party components which improve the SharePoint experience. The problem is that none of the add in and component vendors are likely to reach Endeca-scale revenues.

Even IBM with its Watson play seems to be struggling to craft a sustainable, big money revenue stream. Scratch the surface of Watson and you have an open source system complemented with home brew code and technology from acquired companies.

The write up reporting the double digit comp9ound growth rate states:

According to a recent market study published by Transparency Market Research (TMR), titled “Enterprise Search Market – Global Industry Analysis, Size, Share, Growth, Trends and Forecast 2013 – 2019”, the global enterprise search market is expected to reach US$3,993.7 million by 2019, increasing from US$1,777.5 million in 2012 and expanding at a 12.2% CAGR from 2013 to 2019. Enterprise search system makes content from databases, intranets, data management systems, email, and other sources searchable. Such systems enhance the productivity and efficiency of business processes and can save as much as 30% of the time spent by employees searching information.The need to obtain relevant information quickly and the availability of technological applications to obtain it are the main factors set to drive the global enterprise search market.

TMR, like other mid tier consulting firms, will sell some reports to enterprise search vendors who need some good news about the future of the market for their products.

The write up also contains a passage which I found quite remarkable:

To capitalize on opportunities present in the European regional markets, major market players in the U.S. are tying up with European vendors to provide enterprise search solutions.

Interesting. I do not agree. I don’t see to many US outfits tying up with Antidot, Intrafind, or Sinequa and their compatriots. Folks are using Elasticsearch, but I don’t categorize these relationships as tie ups like the no cash merger between Lexalytics and its European partner.

Furthermore, we have the Overflight data and evidence that enterprise search is a utility function increasingly dominated by open source options and niche players. Where are the big brands of a decade ago: Acquired, out of business, discredited, and adorned with jargon.

The problems include sustainable revenue, the on going costs of customer support, and the appeal of open source solutions.

Transparency Market Research seems to know more than I do about enterprise search and its growth rate. That’s good. Positive. Happy.

Stephen E Arnold, October 16, 2015

Written by Stephen E. Arnold · Filed Under Consulting, Enterprise search, Marketing, News | Comments Off on Enterprise Search Is a Growth Industry: No, Really

Web Site Search Goes Camping

October 12, 2015

It is a common fact that if you are a major retailer and your Web site’s search function is horrible, you are losing millions of dollars in sales. Cabela’s is the world’s largest marketer of hunting, fishing, camping, and other outdoor merchandise decided to upgrade their Web site with GroupBy says PR Newswire in the press release, “Cabela’s And GroupBy Partner To Improve Site Search.”

With GroupBy’s advice, Cabela’s has made a good choice:

“After careful evaluation, Cabela’s selected Searchandiser to replace their Oracle Endeca site search, as they required a robust solution that would deliver accurate search results and an improved user experience for their customers. ‘At Cabela’s we strive to continually improve our customer experience and search relevance is an opportunity area we have identified,’ said Scott Johnstone, Cabela’s Technology Partner Relationship Manager. ‘To that end, we are partnering with GroupBy Inc. to leverage their merchandising tools, search expertise and the underlying technology.’”

As Cabela’s market expands, with Searchandiser creates a better online shopping experience for users with more secure transactions. Any outdoor enthusiast with tell you that equipment is vital for a good adventure. As more people are heading outside to experience the great outdoors, they rely on a decent Web site to order their supplies and gear. Cabela’s is set to meet the new surge with better searching functionalities.

Whitney Grace, October 12, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph

Written by Stephen E. Arnold · Filed Under Enterprise search, Management, Marketing, News, Search, Technology | Comments Off on Web Site Search Goes Camping

Innovation: Not Slowing, Stopped in Search

October 11, 2015

I read “You Call this Progress?” Good article.

The write up points out that some fans of progress may be annoyed at the notion that today’s whiz kids are painting rooms, not building houses. I highlighted this statement:

I think we should admit that our hypothetical 1885 person would be more bewildered by the passage of 65 years than the 1950 “modern” human. I think we should admit that the breathtaking pace of major breakthroughs has actually declined.

I am in agreement with this pace of innovation thing. Years ago when I was working in the thrilling field of big time investing, I remember a discussion among some of my colleagues. The point of the argument was the notion that innovation was ripping right along. I suggested that innovation was more like breeding hamsters than creating a new order of furry friends.

No one at the financial outfit really cared. The focus was making money on things that would create investment opportunities. Do not confuse making money with creating a jet engine.

I can offer one possible market sector which illustrates that innovation has slowed to a crawl in the last 50 years.

Consider search and retrieval. If we look at the early systems, there were things like string matching and counting stuff. There was lemmatization. In fact, most of today’s search and content processing companies are not offering products dramatically different from what was available decades ago.

Sure, today’s products sport graphical interfaces, exploit fast and cheap hardware, and coding methods which allow a stream of content to be moved through an information factory. These are interesting developments, but the underlying procedures are look for strings, alert software or a person when an anomaly occurs, and convert counts for an entity into a graph. Toss in some geo coordinates and ring investors’ door bells.

The problem with search is that humans often have a tough time expressing exactly what they want. That’s why looking at search histories and asking questions like, “What’s the signal for a person’s looking for a pizza joint?” work pretty well.

Also, humans may not know what the heck they want. Armed with partial or incomplete information, the poor human has to look through books from a library or browse a list or colorful icons until something appears meaningful.

I would suggest that once the marketing hoo-hah is stripped from the descriptions of search and retrieval systems, what’s left over reveals the paucity of innovation in information access. Google’s biggest recent search innovation is providing a pointer to content available within an app. Interesting but not discovering penicillin.

Perhaps that’s why it is easier to ask friends or colleagues than use Fancy Dan tools?

Stephen E Arnold, October 11, 2015

Written by Stephen E. Arnold · Filed Under Enterprise search, News | Comments Off on Innovation: Not Slowing, Stopped in Search

Attivio Does Data Dexterity

October 9, 2015

Enterprise search company Attivio has an interesting post in their Data Dexterity Blog titled “3 Questions for the CEO.” We tend to keep a close eye on industry leader Attivio, and for good reason. In this post, the company’s senior director of product marketing Jane Zupan posed a few questions to her CEO, Stephen Baker, about their role in the enterprise search market. Her first question has Baker explaining his vision for the field’s future, “search-based data discovery”; he states:

“With search-based data discovery, you would simply type a question in your natural language like you do when you perform a search in Google and get an answer. This type of search doesn’t require a visualization tool. So, for example, you could ask a question like ‘tell me what type of weather conditions which exist most of the time when I see a reduction in productivity in my oil wells.’ The answer that comes back, such as ‘snow,’ or ‘sleet,’ gives you insights into how weather patterns affect productivity. Right now, search can’t infer what a question means. They match the words in a query, or keywords, with words in a document. But [research firm] Gartner says that there is an increasing importance for an interface in BI tools that extend BI content creation, analysis and data discovery to non-skilled users. You don’t need to be familiar with the data or be a business analyst or data scientist. You can be anyone and simply ask a question in your words and have the search engine deliver the relevant set of documents.”

Yes, many of us are looking forward to that day. Will Attivio be the first to deliver? The interview goes on to discuss the meaning of the company’s slogan, “the data dexterity company.” Part of the answer involves gaining access to “dark data” buried within organizations’ data silos. Finally, Zupan asks what “sets Attivio apart?” Baker’s answers: the ability to quickly access data from more sources; deriving structure from and analyzing unstructured data; and friendliness to “non-technical” users.

Launched in 2008, Attivio is headquartered in Newton, Massachusetts. Their team includes folks with an advantageous combination of backgrounds: in search, database, and business intelligence companies.

Cynthia Murrell, October 9, 2015

Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph

Written by Stephen E. Arnold · Filed Under Analytics, Enterprise search, Marketing, News, Real time search | Comments Off on Attivio Does Data Dexterity

Another Categorical Affirmative: Nobody Wants to Invest in Search

October 8, 2015

Gentle readers, I read “Autonomy Poisoned the Well for Businesses Seeking VC Cash.” Keep in mind that I am capturing information which appeared in a UK publication. I find this type of essay interesting and entertaining. Will you? Beats me. One thing is certain. This topic will not be fodder for the LinkedIn discussion groups, the marketers hawking search and retrieval at conferences to several dozen fellow travelers, or in consultant reports promoting the almost unknown laborers in the information access vineyards.

Why not?

The problem with search reaches back a few years, but I will add a bit of historical commentary after I highlight what strikes me as the main point of the write up:

Nobody wants to invest in enterprise search, says startup head. Patrick White, Synata

Many enterprise search systems are a bit like the USS United States, once the slickest ocean liner in the world. The ship looks like a ship, but the effort involved in making it seaworthy is going to be project with a hefty price tag. Implementing enterprise search solutions are similar to this type of ocean-going effort.

There you go. “Nobody.” A categorical in the “category” of logic like “All men are mortal.” Remarkable because outfits like Attivio, Coveo, and Digital Reasoning, among others have received hefty injections of venture capital in recent memory.

The write up makes this interesting point:

“I think Autonomy really messed up [the space]”, and when investors hear ‘enterprise search for the cloud’ it “scares the crap out of them”, he added. “Autonomy has poisoned the well for search companies.” However, White added that Autonomy was just the most high profile example of cases that have scared off investors. “It is unfair just to blame Autonomy. Most VCs have at least one enterprise search in their portfolio. So VCs tend to be skittish about it,” he [added.

I am not sure I agree. Before there was Autonomy, there was Fulcrum Technologies. The company’s marketing literature is a fresh today as it was in the 1990s. The company was up, down, bought, and merged. The story of Fulcrum, at least up to 2009 or so is available at this link.

The hot and cold nature of search and content processing may be traced through the adventures of Convera (formerly Excalibur Technologies) and its relationships with Intel and the NBA, Delphes (a Canadian flame out), Entopia (a we can do it all), and, of course, Fast Search & Transfer.

Now Fast Search, like most old school search technology, is very much with us. For a dose of excitement one can have Search Technologies (founded by some Convera wizards) implement Fast Search (now owned by Microsoft).

Where Are the Former Big Six in Enterprise Search Vendors: 2004 and 2015

Autonomy, now owned by HP and mired in litigation over allegations of financial fraud

Convera, after struggles with Intel and NBA engagements, portions of the company were sold off. Essentially out of business. Alums are consultants.

Endeca, owned by Oracle and sold as an eCommerce and business intelligence service. Oracle gives away its own enterprise search system.

Exalead, owned by Dassault Systèmes and now marketed as a product component system. No visibility in the US.

Fast Search, owned by Microsoft and still available as a utility for SharePoint. The technology dates from the late 1990s. Brand is essentially low profiled at this time.

Verity, Autonomy purchased Verity and used its customer list for upsales and used the K2 technology as part of the sprawling IDOL suite.

Fast Search reported revenues which after an investigation and court procedure were found to be a bit enthusiastic. The founder of Fast Search was the subject of the Norwegian authorities’ attention. You can check out the news reports about the prohibition on work and the sentence handed down for the issues the authorities concluded warranted a slap on the wrist and a tap on the head.

The story of enterprise search has been efforts—sometimes Herculean—to sell information access companies. When a company sells like Vivisimo for about one year’s revenues or an estimated $20 million, there is a sense of getting that mythic task accomplished. IBM, like most of the other acquirers of search technology, try valiantly to convert a utility into something with revenue lift. As I watch the evolution of the lucky exits, my overall impression is that the purchasers realize that search is a utility function. Search can generate consulting and engineering fees, but the customers want more.

That realization leads to the wild and crazy hyper marketing for products like Hewlett Packard’s cloud version of Autonomy’s IDOL and DRE technology or IBM’s embrace of open source search and the wisdom of wrapping that core with functions.

Enterprise search, therefore, is alive and well within applications or solutions that are more directly related to something that speaks to senior managers; namely, making sales and reducing costs.

What’s the cost of making sure the controls for an enterprise search system are working and doing the job the licensee wants done?

The problem is the credit card debt load which Googlers explained quite clearly. Technology outfits, particularly information access players, need more money than it is possible for most firms to generate. This contributes to the crazy flips from search to police analysis, from looking up an entry in a data base to an assertion that customer support is enabled, hunting for an article in this blog is now real time, active business intelligence, or indexing by proper noun like White House morphs into natural language understanding of unstructured text.

Investments are flowing to firms which could be easily positioned as old school search and retrieval operations. Consider Lexmark, a former unit of IBM, and an employer of note not far from my pond filled with mine run off in Kentucky. The company, like Hewlett Packard, wants to find a way to replace its traditional business which was not working as planned as a unit of IBM. Lexmark bought Brainware, a company with patents on trigram methods and a good business for processing content related to legal matters. Lexmark is doing its best to make that into a Trump scale back office content processing business. Lexmark then bought a technology dating from the 1980s (ISYS Search Software once officed in Crow’s Nest I believe) and has made search a cornerstone of the Lexmark next generation health care money spinning machine. Oracle has a number of search properties. Most of these are unknown to Oracle DBAs; for example, Artificial Linguistics, TripleHop, InQuira’s shotgun NLP technology, etc. The point is that the “brands” have not had enough magnetism to pull revenues on a stand alone basis.

Successes measured in investment dollars is not revenue. Palantir is, in effect, a search and retrieval outfit packaged as a super stealthy smart intelligence system. Recorded Future, funded by Google and In-Q-Tel, is doing a bang up job with specialized content processing. There are, remember, search and retrieval companies.

The money in search appears to be made in these plays:

The Fast Search model. Short cuts until an investigator puts a stop to the activities.
Creating a company and then selling it to a larger firm with a firm conviction that it can turn search into a big time money machine
Buying a search vendor to get its customers and opportunities to sell other enterprise software to those customers
Creating a super technology play and going after venture funding until a convenient time arrives to cash out
Pursue a dream for intelligent software and survive on research grants.

This list does not exhaust what is possible. There are me-too plays. There are mobile niche plays. There are apps which are thinly disguised selective dissemination of information services.

The point is that Autonomy is a member of the search and retrieval club. The company’s revenues came from two principal sources:

Autonomy bought companies like Verity and video indexing and management vendor Virage and then sold other products to these firm’s clients and incorporated some of the acquired technology into products and services which allowed Autonomy to enter a new market. Remember Autonomy and enhanced video ads?
Autonomy managed well. If one takes the time to speak with former Autonomy sales professionals, the message is that life was demanding. Sales professionals including partners had to produce revenue or some face time with the delightful Dr. Michael Lynch or other senior Autonomy executives was arranged.

That’s it. Upselling and intense management for revenues. Hewlett Packard was surprised at the simplicity of the Autonomy model and apparently uncomfortable with the management policies and procedures that Autonomy had been using in highly visible activities for more than a decade as a publicly traded company.

Perhaps some sources of funding will disagree with my view of Autonomy. That is definitely okay. I am retired. My house is paid for. I have no charming children in a private school or university.

The focus should be on what the method for generating revenue is. The technology is of secondary importance. When IBM uses “good enough” open source search, there is a message there, gentle reader. Why reinvent the wheel?

The trick is to ask the right questions. If one does not ask the right questions, the person doing the querying is likely to draw incorrect conclusions and make mistakes. Where does the responsibility rest? When one makes a bad decision?

The other point of interest should be making sales. Stated in different terms, the key question for a search vendor, regardless of camouflage, what problem are you solving? Then ask, “Will people pay money for this solution?”

If the search vendor cannot or will not answer these questions and provide data to be verified, the questioner runs the risk of taking the USS United States for a cruise as soon as you have refurbed the ship, made it seaworthy, and hired a crew.

The enterprise search sector is guilty of making a utility function appear to be a solution to business uncertainty. Why? To make sales. Caveat emptor.

Stephen E Arnold, October 8, 2015

Written by Stephen E. Arnold · Filed Under Enterprise, Enterprise search, News, Text processing | 1 Comment

IBM Defines Information Access the Madison Avenue Way

October 7, 2015

Yesterday (October 6, 2015) I wrote a little dialogue about the positioning of IBM as the cognitive computing company. I had a lively discussion at lunch after the story appeared about my suggesting that IBM was making a grand stand play influenced by Madison Avenue thinking, not nuts and bolts realities of making sales and generating revenue.

Well, let’s let IBM rejiggle the line items in its financial statements. That should allow the critics of the company to see that Watson (which is the new IBM) account for IBM revenues. I am okay with that, but for me, the important numbers are the top line revenue and profit. Hey, call me old fashioned.

In the midst of the Gartner talk about IBM, the CNBC exclusive with IBM’s Big Blue dog (maybe just like the Gartner talk and thus not really “exclusive”?), and the wall paper scale ads in the New York Times and Wall Street Journal, there was something important. I don’t think IBM recognizes what it has done for the drifting, financially challenged, and incredibly fragmented search and content processing market. Even the LinkedIn enterprise search discussion group which bristles when I quote Latin phrases to the members of the group will be revivified.

Indexing and groupoiing are useful functions. When applied with judgment, an earthworm of unrelated words and phrases may communicate more effectively.

To wit, this is IBM’s definition of Watson which is search based on Lucene, home brew code, and IBM acquisitions’ software:

Author extraction—Lots of “extraction” functions
Concept expansion
Concept insights—I am not sure I understand the concept functions
Concept tagging—Another concept function
Dialog—Part of NLP maybe
Entity extraction—Extraction
Face detection with the charming acronym F****d—Were the Mad Ave folks having a bit of fun?
Feed detection—Aha, image related
Image Link extraction—Aha, keeping track of urls
Image tagging—Aha, image indexing. I wonder is this is recognition or using information in the file or a caption
Keyword extraction
Language detection
Language translation
Message resonance—No clue here in Harrod’s Creek
Natural language classifier—NLP again
Personality insights—Maybe figuring out what the personality of the author of a processed file means?
Question and answer (I think this is natural language processing which incorporates many other functions in this list)—More NLP
Relationship extraction—IBM has technology from its purchase of i2 which performs this function. How does this work on disparate streams of unstructured content? I have some thoughts
Review and rank—Does this mean relevance ranking?
Sentiment analysis—Yes, is a document with the word F****d in it positive or negative
Speech to text—Seems similar to text to speech
Taxonomy—Ah, ha. A system to generate a list of controlled terms. No humans needed? Nah, humans can be billable and it is an IBM function
Text extraction—Another extraction function
Text to speech
Tone analyzer—So what is the tone of a document containing the string F****d?
Tradeoff analytics—Hmm. Now Watson is doing a type of analytics presumably performed on text? What are the thresholds in the numerical recipe? Do the outputs make sense to a normal human?
Visual recognition—Baffller
Watson news—Is this news about Watson or news presented in Watson via a feed-type mechanism. Phrase does not even sound cool to me.

Now that’s a heck of a list. Notice that the word “search” does not appear in the list. I did not spot the word “semantics” either. Perhaps I was asleep at the switch.

When I was in freshman biology class in 1962, Dr. Daphne Swartz, a very traditional cut ‘em up and study ‘em scientist, lectured for 90 minutes about classification. I remember learning about Aristotle and this dividing organizations into two groups: Plants and animal. I know this is rocket science, but bear with me. There was the charmingly named Carolus Linnaeus, a fan of herring I believe, who cooked up the kingdom, genus, species thing. Then there was, much later, the wild and crazy library crowd which spawned Dewey or, as I named him, Mr. Decimal.

Why is this germane?

It seems to me that IBM’s list of Watson functions needs a bit of organization. In fact, some of the items appear to below to other items; for example: language detection and language translation. More egregious is the broad concept of natural language processing. One could, if one were motivated, argue that entity extraction, text extraction, and keyword extraction might look similar to a non-Watsonian intellect. Dr. Swartz would probably have some constructive criticism to offer.

What’s the purpose of this earthworm list?

Beats me. Makes IBM Watson seem more than Lucene with add ons?

Stephen E Arnold, October 7, 2015

Written by Stephen E. Arnold · Filed Under Enterprise search, Marketing, News, Text processing | Comments Off on IBM Defines Information Access the Madison Avenue Way

Legacy Servers: Upgrade Excitement

October 2, 2015

Enterprise management systems (ECM) were supposed to provide an end all solution for storing and organizing digital data. Data needs to be stored for several purposes: taxes, historical record, research, and audits. Government agencies deployed ECM solutions to manage their huge data loads, but the old information silos are not performing up to modern standards. GCN discusses government agencies face upgrading their systems in “Migrating Your Legacy ECM Solution.”

When ECMs first came online, information was stored in silos programmed to support even older legacy solutions with niche applications. The repositories are so convoluted that users cannot find any information and do not even mention upgrading the beasts:

“Aging ECM systems are incapable of fitting into the new world of consumer-friendly software that both employees and citizens expect. Yet, modernizing legacy systems raises issues of security, cost, governance and complexity of business rules — all obstacles to a smooth transition. Further, legacy systems simply cannot keep up with the demands of today’s dynamic workforce.”

Two solutions present themselves: data can be moved from an old legacy system to a new one or simply moving the content from the silo. The barriers are cost and time, but the users will reap the benefits of upgrades, especially connectivity, cloud, mobile, and social features. There is the possibility of leaving the content in place using interoperability standards or cloud-based management to make the data searchable and accessible.

The biggest problem is actually convincing people to upgrade. Why fix what is not broken? Then there is the justification of using taxpayers’ money for the upgrade when the money can be used elsewhere. Round and round the argument goes.

Whitney Grace, October 2, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph

Written by Stephen E. Arnold · Filed Under Cloud computing, Data, Enterprise search, Mobile, News, Search, Security, Technology | Comments Off on Legacy Servers: Upgrade Excitement

« Previous Page — Next Page »

Search the site
Subscribe to Beyond Search
Feature archive
News archive

Stephen E. Arnold monitors search, content processing, text mining and related topics from his high-tech nerve center in rural Kentucky. He tries to winnow the goose feathers from the giblets. He works with colleagues worldwide to make this Web log useful to those who want to go "beyond search". Contact him at sa [at] arnoldit.com. His Web site with additional information about search is arnoldit.com.

Categories
- 3D-Printing
- Acquisition
- Advertising
- Aggregation
- AI
- Alexa
- algorithms
- Amazon
- Amazonia
- Analytics
- Appliance
- Applications
- Audio
- Augmented Reality
- Big data
- Bing
- Bitcoin
- Bitext
- Book review
- Business intelligence
- Business process
- Business strategy
- Censorship
- Cloud computing
- Company Profile
- Conferences
- Connectors
- Consulting
- Consumer
- Content processing
- Copyright
- Corporate Concerns
- Cost
- Crawl
- Crowdfunding
- cryptocurrency
- Customer support
- Cyber OSINT
- cybercrime
- cybersecurity
- Dark Web
- DarkCyber
- Data
- Data mining
- Database
- Deepfakes
- Digital Assistant
- Digital Library
- E2EE
- ECommerce
- EDiscovery
- Editorial opinion
- Education
- Emoticons
- Employment
- Enterprise
- Enterprise search
- Entity extraction
- Ethics
- Facebook
- Faceted search
- Factualities
- Feature
- Federated search
- Financial
- Fogint
- Google
- Governance
- Government
- Hackers
- healthcare
- IBM Watson
- Image search
- Indexing
- Infrastructure
- Innovation
- Integration
- intelware
- Interface
- Internet
- Interview
- Investment
- law enforcement
- Legal matters
- Library automation
- Management
- Marketing
- Mathematics
- Metadata
- Microsoft
- Mobile
- Natural language processing
- News
- NGIA
- Online (general)
- Open Access
- Open source
- OSINT
- Osint Radar
- Overflight
- Palantir
- Patents
- Personnel
- Podcast
- Policeware
- Portals
- Predictive coding
- Privacy
- Profile
- Publishing
- Quotation
- Real time search
- Reference tool
- Rich media
- Robot Writer
- Search
- Search enabled applications
- search engine
- Search quality
- Security
- Semantic
- Sentiment analysis
- SEO
- SharePoint
- Short Honks
- Smart Technology
- Social
- Social Media
- software
- Statistics
- Taxonomy
- Technology
- Telegram
- Text analytics
- Text processing
- Tools
- Tor
- Training
- Translation
- Twitter
- Uncategorized
- Unstructured Data
- User experience
- User Interface
- Vertical search
- Video
- visualization
- Voice search
- Voice technology
- Web 3
- Web Services
- Webinar
- Windows
- Work flow
- XML
- Yahoo

Beyond Search

dtSearch Offers Three Search Tips

Technical Debt: Maybe Real but Let It Go

Genentech Joins the Google Enterprise Crew

Enterprise Search Is a Growth Industry: No, Really

Web Site Search Goes Camping

Innovation: Not Slowing, Stopped in Search

Attivio Does Data Dexterity

Another Categorical Affirmative: Nobody Wants to Invest in Search

IBM Defines Information Access the Madison Avenue Way

Legacy Servers: Upgrade Excitement

Search the site

Categories

Archives

Recent Posts

Meta

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Search the site

Categories

Archives

Recent Posts

Meta