Browse >
Home / Archive by category 'Enterprise search'
It seems that Microsoft and Yahoo are friends again, at least for the time being. Search Engine Watch announces, “Yahoo and Microsoft Amend Search Agreement.” The two companies have been trying to partner on search for the past six years, but it has not always gone smoothly. Writer Emily Alford tells us what will be different this time around:
“First, Yahoo will have greater freedom to explore other search platforms. In the past, Yahoo was rumored to be seeking a partnership with Google, and under the new terms, Microsoft and Yahoo’s partnership will no longer be exclusive for mobile and desktop. Under the new agreement, Yahoo will continue to serve Bing ads on desktop and mobile, as well as use Bing search results for the majority of its desktop search traffic, though the exact number was undisclosed.
“Microsoft and Yahoo are also making changes to the way that ads are served. Microsoft will now maintain control of the Bing ads salesforce, while Yahoo will take full control of its Gemini ads salesforce, which will leave Bing free to serve its own ads side by side with Yahoo search results.”
Yahoo CEO Marissa Mayer painted a hopeful picture in a prepared statement. She and Microsoft CEO Satya Nadella have been working together, she reports, to revamp the search deal. She is “very excited to explore” the fresh possibilities. Will the happy relationship hold up this time around?
Cynthia Murrell, May 7, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
A new enterprise search system startup is leveraging the SAP HANA Cloud Platform, we learn from “EnterpriseJungle Tames Enterprise Search” at SAP’s News Center. The company states that their goal is to make collaboration easier and more effective with a feature they’re calling “deep people search.” Writer Susn Galer cites EnterpriseJungle Principal James Sinclair when she tells us:
“Using advanced algorithms to analyze data from internal and external sources, including SAP Jam, SuccessFactors, wikis, and LinkedIn, the applications help companies understand the make-up of its workforce and connect people quickly….
“Who Can Help Me is a pre-populated search tool allowing employees to find internal experts by skills, location, project requirements and other criteria which companies can also configure, if needed. The Enterprise Q&A tool lets employees enter any text into the search bar, and find experts internally or outside company walls. Most companies use the prepackaged EnterpriseJungle solutions as is for Human Resources (HR), recruitment, sales and other departments. However, Sinclair said companies can easily modify search queries to meet any organization’s unique needs.”
EnterpriseJungle users manage their company’s data through SAP’s Lumira dashboard. Galer shares Sinclair’s example of one company in Germany, which used EnterpriseJungle to match employees to appropriate new positions when it made a whopping 3,000 jobs obsolete. Though the software is now designed primarily for HR and data-management departments, Sinclair hopes the collaboration tool will permeate the entire enterprise.
Cynthia Murrell, May 4, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Funnelback, as I have mentioned, has lost some of its marketing oomph. I think some staff shuffling took place. The company is now stepping up its effort to remain visible in a darned tough, crowded, and struggling market sector: Enterprise search.
I read “How Do You Solve a Problem Like Enterprise Search?” My answer is and has been, “One does not. One solves specific information access problems.” The wreckage of Convera, Delphes, Entopia, Fast Search, et al is evidence that enterprise search is a sticky wicket. The howls of pain on the LinkedIn forums and the odd collection of content in the Paper.Li round up about enterprise search make the challenges quite visible.
Read the article. Here are two points I found interesting.
According to the founder of Funnelback:
A holistic enterprise search solution should include:
- Bird’s-eye view metrics of all content, showing where it’s stored (e.g. web vs. enterprise vs. social media), how much exists in each repository, how old it is, missing metadata, poor quality titles, duplication, accessibility metrics, and the link graph. This provides information managers with a means to prioritize organizational investment in managing information, and thereby enhancing search effectiveness.
- Intelligent guidance on how to make content more visible/findable. Search engines generally attempt to hide the internals of their ranking systems and this makes it difficult for customers to learn how to make content more findable. An enterprise search engine should use its internal ranking knowledge to show content authors why pages rank the way they do and provide guidance on how to increase each page’s findability.
- The ability to surface and promote content based on user context with simple rules such as “User is in Department A”, “User is located in New Zealand”, “User is in the finance industry”, “User works for LexisNexis”. These rules can then be overlaid to form more sophisticated rules, without the need to create rules for every distinct possibility. Funnelback goes even further by allowing these rules to be applied to anonymous users by looking up their IP address in an internal database and inferring information based on the organization that owns the IP address.
These are darned interesting “shoulds.” The problem of access controls, contractual and regulatory constraints, and the human practice of creating silos of information are tough nuts to crack. “Shoulds” are easy. Delivering is tough, and Funnelback is neither more or less well equipped than open source or proprietary information retrieval solutions.
The second point illustrates the flawed logic that many champions of enterprise search as a grand solution make. Here’s the passage:
The first question every organization should ask is: Who are the stakeholders affecting the success of our organization and what information do they need to maximize our success?
At a more practical level, this includes questions like:
- What are the personas in our organization? (i.e. the archetypes that represent the different roles)
- What information do they need in order to maximize productivity and make better decisions?
- What are our customer personas?
- What information do they need in order to maximize engagement and have a positive customer experience?
Without asking these questions, organizations sometimes assume that searching everything with a single query (access controls permitting) is the answer. Sometimes it is the answer, but it can be a more complicated and costly exercise than necessary. For example, do users want to use an enterprise search tool to search their own email, or would they prefer to use the search on their mail client?
Sorry, Funnelback. Asking the questions is the first step. The work is to answer the questions and then use that information to tailor a solution that does not anger the users, lead to litigation, or just not work.
Today’s flagship enterprise search vendors seem to include Coveo, dtSearch, Elastic, Funnelback, and a handful of other firms with low profiles. The present crisis in information access has been created by the actions of previous industry leaders in enterprise search.
The fix is to focus on solving a problem for a specific group of users. Lawyers have specialized search tools. Chemists have specialized search tools. Regular employees have Google and whatever findabiliy solution is available within specific applications.
Want to get in a pickle? Sell a clueless senior executive a solution that solves the information access challenges for the entire organization. Didn’t work for STAIRS and won’t work for today’s systems.
The history of search is a painful one. There are options, but these are next generation systems, not yesterday’s systems wrapped with shoulds.
Stephen E Arnold, May 1, 2015
Navigate to http://hits1k.com/fast-enterprise-search/. Here’s the Web page I see:
The spelling of the word “fast” with caps and the reference to artificial intelligence are interesting. Is this company surfing on the “reputation” of the late, much loved, and not forgotten Fast Search & Transfer? Believe it or not the company is integrating
Yes, it is a reseller of EMC and Microsoft explaining search using Microsoft Surface. The video and the marketing suggest that enterprise search is a most interesting discipline. I watched the video, and I would give the solution a whirl if I had a Surface and some interest in standing up and looking down as I dragged stuff from place to place.
Don’t try this app on your mobile phone.
Stephen E Arnold, May 1, 2015
Written by Stephen E. Arnold · Filed Under Enterprise search, News | Comments Off on FAST Marketing: Will It Work?
Enterprise search is limited to how well users tag their content and the preloaded taxonomies. According Tech Target’s Search Content Management blog, text analytics might be the key to turning around poor enterprise search performance: “How Analytics Engines Could Finally-Relieve Enterprise Pain.” Text analytics turns out to only be part of the solution. Someone had the brilliant idea to use text analytics to classification issues in enterprise search, making search reactive to user input to proactive to search queries.
In general, analytics search engines work like this:
“The first is that analytics engines don’t create two buckets of content, where the goal is to identify documents that are deemed responsive. Instead, analytics engines identify documents that fall into each category and apply the respective metadata tags to the documents. Second, people don’t use these engines to search for content. The engines apply metadata to documents to allow search engines to find the correct information when people search for it. Text analytics provides the correct metadata to finally make search work within the enterprise.”
Supposedly, they are fixing the tagging issue by removing the biggest cause for error: humans. Microsoft caught onto how much this could generate profit, so they purchased Equivio in 2014 and integrated the FAST Search platform into SharePoint. Since Microsoft is doing it, every other tech company will copy and paste their actions in time. Enterprise search is gull of faults, but it has improved greatly. Big data trends have improved search quality, but tagging continues to be an issue. Text analytics search engines will probably be the newest big data field for development. Hint for developers: work on an analytics search product, launch it, and then it might be bought out.
Whitney Grace, May 1 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
A new enterprise search system startup is leveraging the SAP HANA Cloud Platform, we learn from “EnterpriseJungle Tames Enterprise Search” at SAP’s News Center. The company states that their goal is to make collaboration easier and more effective with a feature they’re calling “deep people search.” Writer Susn Galer cites EnterpriseJungle Principal James Sinclair when she tells us:
“Using advanced algorithms to analyze data from internal and external sources, including SAP Jam, SuccessFactors, wikis, and LinkedIn, the applications help companies understand the make-up of its workforce and connect people quickly….
“Who Can Help Me is a pre-populated search tool allowing employees to find internal experts by skills, location, project requirements and other criteria which companies can also configure, if needed. The Enterprise Q&A tool lets employees enter any text into the search bar, and find experts internally or outside company walls. Most companies use the prepackaged EnterpriseJungle solutions as is for Human Resources (HR), recruitment, sales and other departments. However, Sinclair said companies can easily modify search queries to meet any organization’s unique needs.”
EnterpriseJungle users manage their company’s data through SAP’s Lumira dashboard. Galer shares Sinclair’s example of one company in Germany, which used EnterpriseJungle to match employees to appropriate new positions when it made a whopping 3,000 jobs obsolete. Though the software is now designed primarily for HR and data-management departments, Sinclair hopes the collaboration tool will permeate the entire enterprise.
Cynthia Murrell, April 27, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
In 2014, we noted that Scientel’s Norman Kutemperor was a leader in NoSQL data management. We learned that Scientel is beating the drum for an integrated, user friendly content management, search, and analytics system. Kutemperor has been described as the father of NoSQL.
According to “Scientel Releases EZContent Content Management and Search System for the Small Enterprise,”
Scientel’s EZContent™ Content Management and Search system operating under GENSONIX® NoSQL DB is an advanced ECM solution for “Big Data” content for the smaller enterprise. Scientel’s EZContent is derived from Scientel’s primary Enterprise Content Management & Search System (ECMS). It is the ideal, most cost-effective, and simple to operate tool for organizing, managing, and retrieving your Big Data contents at all organizational levels. Powerful, yet comprehensive and fun to use, it can start small and is highly scalable. The system can be configured for various system requirements. This makes it ideal for use in small offices/organizations as well as medium and large enterprises.
The company asserts that it has a search system which displays an information object thumbnail. The user drags a document to the system. EZContent processes 40 different file types, including images and video clips. Kutemperor explained the search system this way:
With ECMS, we are able to move the contents of that CD into our ECMS system, and all 100+ people can access that all at the same time. They can also do searches from within what we call textual documents – PDFs, Microsoft documents mostly are all textual documents, whereas clips, videos and pictures are not. By being able to search inside the textual document, we can actually locate what we are looking for and get to the right page that we want to read. Content management is a very valuable tool for all of us, and it is a very helpful tool for all organizations, whether it is non-profit or profit, commercial, corporate, scientific, medical, city government, small businesses or large enterprises. Everybody needs it and now can have it cost-effectively. The basic offering that we can start with is a very small appliance that is turnkey and virtually maintenance free. It is easily installed into the network and pretty much goes to work without having to do too much in the way of setup. For larger organizations, we offer appliances that can scale to very large configurations, that can store very large numbers of documents efficiently, and that are able to locate these documents rapidly.
According to CIOReview:
Scientel’s Gensonix DB is an all in one SQL. Gensonix based solutions can take the place of SQL, NoSQL and storage systems and can process large data sets in real time. Its massive core based parallel solutions deliver performance in range with in memory systems. thus performance of Gensonix on Scientel LDWA hardware matches the performance of in memory systems and with higher reliability.
In 2014 the database was described as “polymorphic.” One explanation is:
Polymorphism is the ability of an entity to behave like more than 1 of its counter parts given a set of circumstances or criteria; or, the provision of a single interface (a shared boundary across which separate components of a computer system exchange information) to entities of different types. In other words, in a polymorphic DB, you can use a relational approach when that is appropriate, hierarchical when that is, and so on. No one paradigm is fully implemented, but the DB uses enough of the features/capabilities needed to provide a reasonable solution to a problem.
These are envelope stretching assertions. The Manta entry for the company reports, perhaps erroneously, that the company has three employees. Another Manta entry asserts that the company employs five to nine people and has revenues of $1.0 million to $2.5 million. For more information about Scientel, navigate to the company’s Web site at www.scientel.com.
Should MarkLogic and other vendors offering similar products up their game? Worth monitoring this Swiss Army knife approach to information access.
Stephen E Arnold, April 25, 2015
Sponsored by CyberOSINT: Next Generation Information Access
Searching is an essential function for basic Internet use and it is a vital function in enterprise systems. While searching on the Internet with a search engine might not seem like a security risk, the comparable action on enterprise search could be potentially dangerous. Security Enterprises points out the potential security risks in the article, “SearchBlox Vulnerabilities Underscore Importance Of Updating Enterprise Search Tools.”
Recently the Carnegie Mellon Software Engineering Institute CERT Division compiled a list of all the security risks from SearchBlox’s software. They included ways for hackers to view private information, upload files, cross-site (XSS) scripting, and cross-site request forgeries. Enterprise security developers can learn from SearchBlox’s vulnerabilities by being aware and repairing them before a hacker discovers the information leak.
The problem, however, might come from within an organization rather than out:
“Of all the possible threats, the ability for cybercriminals to conduct XSS attacks from within the product’s default search box is likely the most concerning, Threatpost reported. On the other hand, anyone trying to take advantage of such SearchBlox vulnerabilities would need to be an authenticated user, though there is no shortage of stories about insider threats within the enterprise.”
The article alludes that SearchBlox’s vulnerabilities came from day-to-day activities that keep an organization running. Using SearchBlox as an example, other organizations with enterprise systems will be able to learn where their own products need patches so the same issues don’t happen with them. So what do you take away: most hackers are probably insiders and look for holes in the ordinary, everyday routines.
Whitney Grace, April 23, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
The article titled Microsoft Beefs Up Office 365’s Delve, Aims To Complete Its Rollout By May on Computerworld discusses the improvements to the enterprise search and discovery app Delve. Delve was built for Office 365’s Office Graph machine learning engine, and helps create and analyze detailed data on users by linking to content through card icons. The article states,
“Based on what it learns about the user’s work, it determines which files, colleagues, documents and data are most relevant and important at any given point, and displays links to them in a graphically rich, card-based dashboard. Delve provides this assistance in real time, so that users can prioritize their work and find the information they need as they participate in whatever work projects and tasks they’re involved in.”
This means that Delve can figure that a user’s upcoming meeting will be about a particular topic with particular colleagues, and then collect information that is relevant in a timely manner for display in the dashboard. Microsoft is currently working to make Delve capable of analyzing email content within Exchange Online attachments. Yammer actions will also be performable in the near future from the Delve interface. It can also, of course, be used more traditionally as a search engine, but Microsoft has big plans for more dynamic and innovative capabilities.
Chelsea Kerwin, April 20, 2014
Stephen E Arnold, Publisher of CyberOSINT at www.xenky.com
I read “Moore’s Law Is Dead, Long Live Moore’s Law.” The “law” cooked up by a chip company suggests that in technology stuff gets better, faster, and cheaper.” With electronic brains getting, better, faster, cheaper, it follows that phones are more wonderful every few weeks. The logic applies to laptops, intelligence in automobiles, and airline related functions.
The article focuses on the Intel-like world of computer parts. The write up makes this point which I highlighted:
From 2005 through 2014, Moore’s Law continued — but the emphasis was on improving cost by driving down the expense of each additional transistor. Those transistors might not run more quickly than their predecessors, but they were often more power-efficient and less expensive to build.
Yep, the cheaper point is significant. The article then tracks to a point that warranted a yellow highlight:
After 50 years, Moore’s Law has become cultural shorthand for innovation itself. When Intel, or Nvidia, or Samsung refer to Moore’s Law in this context, they’re referring to the continuous application of decades of knowledge and ingenuity across hundreds of products. It’s a way of acknowledging the tremendous collaboration that continues to occur from the fab line to the living room, the result of painstaking research aimed to bring a platform’s capabilities a little more in line with what users want. Is that marketing? You bet. But it’s not just marketing.
These two points sparked my thinking about the discipline of enterprise information access. Enterprise search relies on a wide range of computing operations. If these operations are indeed getting better, faster, and cheaper, does it make sense to assume that information retrieval is also getting better, faster, and cheaper?
What is happening from my point of view is that the basic design of enterprise information access systems has not changed significantly in the last decade, maybe longer. There is the content acquisition module, the normalization or transformation module, the indexing module, the query processing module, the administrative module, and other bits and pieces.
The outputs from today’s information access systems do not vary much from the outputs available from systems on offer a decade ago. Endeca generated visual reports by 2003. Relationship maps were available from Inxight and Semio (remember that outfit) even earlier. Smart software like the long forgotten Inference system winnowed results on what the user sought in his or her query. Linguistic functions were the heart and soul of Delphes. Statistical procedures were the backbone of PLS, based on Cornell wizardry.
Search and retrieval has benefited from faster hardware. But the computational burdens piled on available resources have made it possible to layer on function after function. The ability to make layers of content processing and filtering work has done little to ameliorate the grousing about many enterprise search systems.
The fix has not been to deliver a solution significantly different from what Autonomy and Fast Search offered in 2001. The fix has been to shift from what users’ need to deal with business questions to:
- Business intelligence
- Semantics
- Natural language processing
- Cognitive computing
- Metadata
- Visualization
- Text analytics.
I know I am missing some of the chestnuts. The point is that information access may be lagging behind certain other sectors; for example, voice search via a mobile device. When I review a “new” search solution, I often find myself with the same sense of wonder I had when I first walked through the Smithsonian Museum: Interesting but mostly old stuff.
Just a thought that enterprise search is delivering less, not “Moore.”
Stephen E Arnold, April 17, 2015
Written by Stephen E. Arnold · Filed Under Enterprise search, News, Search | Comments Off on The Law of Moore: Is Information Retrieval an Exception?
« Previous Page — Next Page »