Honkin' News banner

Elastic Links Search and Social Through Graph Capabilities

September 13, 2016

The article titled Confused About Relationships? Elasticsearch Gets Graphic on The Register communicates the latest offering from Elasticsearch, the open-source search server based on Apache’s Lucene. Graph capabilities are an exciting new twist on search that enables users to map out relationships through the search engine and the Kibana data visualization plug-in. The article explains,

By fusing graph with search, Elastic hopes to combine the power of social with that earlier great online revolution, the revolution that gave us Google: search. Graph in Elasticsearch establishes relevance by establishing the significance of each relationship versus the global average to return important results. That’s different to what Elastic called “traditional” relationship mapping, which is based on a count of the frequency of a given relationship.

Elasticsearch sees potential for their Graph capabilities in behavioral analysis, particularly in areas such as drug discovery, fraud detection, and customized medicine and recommendations. When it comes to identifying business opportunities, Graph databases have already proven their value. Discovering connections and trimming degrees of separation are all of vital importance in social media. Social networks like Twitter have been using them since the beginning of NoSQL. Indeed, Facebook is a customer of Elastic, the business version of Elasticsearch that was founded in 2012. Other users of Elasticsearch include Netflix, StumbleUpon, and Mozilla.

Chelsea Kerwin, September 13, 2016
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
There is a Louisville, Kentucky Hidden Web/Dark Web meet up on September 27, 2016.
Information is at this link: https://www.meetup.com/Louisville-Hidden-Dark-Web-Meetup/events/233599645/

Autonomy Back Home in Merrie Olde England

September 12, 2016

I read “Hewlett Packard Offloads Last Autonomy Assets in Software Deal.” I think that Autonomy is now going back home. Blood pudding, the derbies, and Indian take aways—yes, the verdant isle.

The union of Hewlett Packard (once an ink outfit) and the love child of Bayesian and Laplacian methods is burst asunder. HPE (the kissin’ cousin of the ink outfit) fabricated a deal only lawyers, MBAs, and accountants can conjure.

There is an $8 billion deal, cash to HPE, and a fresh swath of lush pasture for Micro Focus to cultivate.

I learned:

“Autonomy doesn’t really exist as an entity, just the products,” said Kevin Loosemore, executive chairman of Micro Focus. Loosemore said the Newbury-based business conducted due diligence across all of the products included in the deal, with no different approach taken for the Autonomy assets. No legal liabilities from Autonomy will be transferred to Micro Focus.

Integration is what Micro Focus does. Autonomy embodied in products was once a goal for some senior Autonomy executives. The golden sun is rising over the mid 1990s technology.

We wish Micro Focus well. We wish HPE well as it moves toward the resolution of its claims against Autonomy for assorted misdeeds.

Without search, HPE ceases to interest me. While HPE was involved in search, there was some excitement generated, but that is winding down and, for some I imagine, has long since vaporized.

I will have fond memories of HP blaming Autonomy for HP’s decision to buy Autonomy. Amazing. One of the great comedic moments in search and fading technology management.

Autonomy is dead. Long live Autonomy. Bayes lasted 60 years; Autonomy may have some legs even if embodied in other products. IDOL hands are the devil’s playthings I think. PS. I will miss the chipper emails from BM.com. Substantive stuff.

Stephen E Arnold, September 12, 2016

Ads Appear Here, There, and Everywhere Across Google Landscape

September 12, 2016

The article on CNN Money titled Google Is Going to Start Showing You More Ads discusses the surge in ads that users can expect to barely notice over the coming weeks and months. In efforts to ramp up mobile ad revenue to match the increasing emphasis on mobile search, Google is making mobile ads bigger, more numerous, and just more. The article explains,

Google will be simplifying the work flow for businesses to create display ads with images. The company says advertisers need to “simply provide headlines, a description, an image, and a URL,” and Google will automatically design ads for the business. Location-based ads will start showing up on Google too. If you search for “shoe store” or “car repair near me,” ads for local businesses will populate the search results… The changes come as Google is trying to stay ahead of customers’ changing demands.

Google claims in the article that the increase is already showing strong results for advertisers, which click-through rates (CTR) up 20%. But it is hard to believe. As ads flood the space between articles, search results, and even Google Map directions, they seem to be no more significant than an increase in white noise. If Google really wants to revolutionize marketing, they are going to need to dig deeper than just squeezing more ads in between the lines.

Chelsea Kerwin, September 12, 2016
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
There is a Louisville, Kentucky Hidden Web/Dark Web meet up on September 27, 2016.
Information is at this link: https://www.meetup.com/Louisville-Hidden-Dark-Web-Meetup/events/233599645/


Google Springboard: Diving into Familiar Water

September 10, 2016

In June 2016, Google we learned the creator of the late, the replacement for the champion Google Search Appliance was bouncing up and down on the enterprise search diving board. Springboard, GOOG’s latest “new” search product  was was, like the GSA, designed to put the right information at one’s fingertips. After the announcement in the Google for Work Official Blog, the product has down shallow dives in kiddie pools. Three laps later, Google is checking out more competitive indoor swimming facilities.

We learned this in “Box Teams Up with Google for Docs and Springboard Integration.” The announcement reveals a different approach to enterprise search for the GOOG. In the good old days, one could pony up hefty sums to license the Google Search Appliance. Google had determined more than a decade ago that on premises enterprise search systems like Autonomy IDOL (RIP) or Fast Search & Transfer ESP were too difficult for mere mortal to deploy in a cost effective manner. Google figured a search appliance, a finding toaster if I may craft a metaphor, was the solution. It really wasn’t. Google backed away from the expensive servers. From the fit go, Google’s use of on premises, old fashioned hardware seemed to run counter to the Google cloud ad search business.

We noted this statement in the “Box Teams Up” write up:

It may seem a little odd for Google to be collaborating with Box on cloud storage when Google has its own offering there, which is also a revenue driver for the search giant. But the partnership is actually only really likely to benefit customers of both groups, without really biting into the customer base of either, given the distinctions between what Box and Google Drive can provide.

The major features of Springboard from what we can see from our cabin in Harrod’s Creek are:

  • Connectors to federate content
  • Quick and easy searching across the content
  • Assistance with “useful and actionable information throughout the day.

For more than six years the savvy Alphabet Google thing watched Amazon, Elastic, SearchBlox, Yippy and other vendors roll out cloud search solutions. As surprising as it is to some people, Google’s slow response to cloud based enterprise search underscores the malaise which seems to be emerging around the volleyball court. Will Googlers execute perfectly an arm stand back double somersault tuck into the pool from its Springboard?

Google’s marketing reminded me that I was  19 percent of one’s time looking for information. If I own a GSA (which I no longer possess), that device did not really help me out if Google’s data are correct? Will Springboard?

We will have to wait for an enterprise search competition before we know if Google wins a medal. One hopes Springboard will have that Elastic bounce.

Stephen E Arnold, September 10, 2016

Enterprise Search: Pool Party and Philosophy 101

September 8, 2016

I noted this catchphrase: “An enterprise without a semantic layer is like a country without a map.” I immediately thought of this statement made by Polish-American scientist and philosopher Alfred Korzybski:

The map is not the territory.

When I think about enterprise search, I am thrilled to have an opportunity to do the type of thinking demanded in my college class in philosophy and logic. Great fun. I am confident that any procurement team will be invigorated by an animated discussion about representations of reality.

I did a bit of digging and located “Introducing a Graph-based Semantic Layer in Enterprises” as the source of the “country without a map” statement.

What is interesting about the article is that the payload appears at the end of the write up. The magic of information representation as a way to make enterprise search finally work is technology from a company called Pool Party.

Pool Party describes itself this way:

Pool Party is a semantic technology platform developed, owned and licensed by the Semantic Web Company. The company is also involved in international R&D projects, which continuously impact the product development. The EU-based company has been a pioneer in the Semantic Web for over a decade.

From my reading of the article and the company’s marketing collateral it strikes me that this is a 12 year old semantic software and consulting company.

The idea is that there is a pool of structured and unstructured information. The company performs content processing and offers such features as:

  • Taxonomy editor and maintenance
  • A controlled vocabulary management component
  • An audit trail to see who changed what and when
  • Link analysis
  • User role management
  • Workflows.

The write up with the catchphrase provides an informational foundation for the company’s semantic approach to enterprise search and retrieval; for example, the company’s four layered architecture:


The base is the content layer. There is a metadata layer which in Harrod’s Creek is called “indexing”. There is the “semantic layer”. At the top is the interface layer. The “semantic” layer seems to be the secret sauce in the recipe for information access. The phrase used to describe the value added content processing is “semantic knowledge graphs.” These, according to the article:

let you find out unknown linkages or even non-obvious patterns to give you new insights into your data.

The system performs entity extraction, supports custom ontologies (a concept designed to make subject matter experts quiver), text analysis, and “graph search.”

Graph search is, according to the company’s Web site:

Semantic search at the highest level: Pool Party Graph Search Server combines the power of graph databases and SPARQL engines with features of ‘traditional’ search engines. Document search and visual  analytics: Benefit from additional  insights through interactive visualizations of reports and search results derived from your data lake by executing sophisticated SPARQL queries.

To make this more clear, the company offers a number of videos via YouTube.

The idea reminded us of the approach taken in BAE NetReveal and Palantir Gotham products.

Pool Party emphasizes, as does Palantir, that humans play an important role in the system. Instead of “augmented intelligence,” the article describes the approach methods which “combine machine learning and human intelligence.”

The company’s annual growth rate is more than 20 percent. The firm has customers in more than 20 countries. Customers include Pearson, Credit Suisse, the European Commission, Springer Nature, Wolters Kluwer, and the World Bank and “many other customers.” The firm’s projected “Euro R&D project volume” is 17 million (although I am not sure what this 17,000,000 number means. The company’s partners include Accenture, Complexible, Digirati, and EPAM, among others.

I noted that the company uses the catchphrase: “Semantic Web Company” and the catchphrase “Linking data to knowledge.”

The catchphrase, I assume, make it easier for some to understand the firm’s graph based semantic approach. I am still mired in figuring out that the map is not the territory.

Stephen E Arnold, September 8, 2016

HonkinNews for September 6, 2016, Now Available

September 6, 2016

If you visit Zimbabwe, what risks do you face when you use Facebook? Is the CIA’s investment arm too secretive? Whom do you consult to get the inside scoop about legacy code running on the mainframe in the basement? For the answers to these questions, invest six minutes in the September 6, 2016, edition of HonkinNews, a round up of stories from Beyond Search. You can view this week’s program at this link or click on the embedded viewer on the Beyond Search blog.

Kenny Toth, September 6, 2016

Watson Ads for Branded Answers to the Little Questions of Life

September 6, 2016

Here is a potent new way for brands to worm their way into every aspect of consumers’ lives. “IBM Watson Is Now Offering AI-Powered Digital Ads That Answer Consumers’ Questions,” we learn from AdWeek. Watson Ads will hook users up with answers to their everyday questions—answers supplied by advertisers. Apparently, IBM’s Weather-Company acquisition supplied the tools behind this product. Writer Christopher Heine explains:

IBM’s relatively new ownership of The Weather Company’s digital properties is coming into play in a serious fashion: Watson Ads will first appear on Weather.com, the Weather mobile app and the company’s data-driven WeatherFX platform. Later, IBM plans to allow them to appear on third-party properties.

Campbell Soup Company, Unilever and GSK Consumer Healthcare are some of the brands that will run the ads in the coming days. Watson Ads’ pricing details were not disclosed.

Jeremy Steinberg, global head of sales, The Weather Company, described how they work, stating that ‘machine learning and natural-language capabilities will allow it to provide accurate responses. What we’re doing is moving away from keyword searches and towards more natural language and well-reasoned answers.

Heine outlines Campbell’s plan as an example—their Watson Ads will present “Chef Watson,” the helpful AI which suggests recipes based on criteria like available ingredients, the time of day, and what the weather is like. Those recipes will be pulled from Campbell’s existing site Campbell’s Kitchen. Not surprisingly, their ingredient lists rely heavily on Campbell’s product line (which goes well beyond soup these days).

Another Watson Ads client is GSK Consumer Healthcare, which plans to use the tech to help users make better real-time health decisions—a worthy project, I’ll admit. I am curious to see how Unilever, and other companies down the line, will leverage their digital voices of authority. See the article for more details on the project.

Cynthia Murrell, September 6, 2016
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
There is a Louisville, Kentucky Hidden Web/Dark Web meet up on September 27, 2016.
Information is at this link: https://www.meetup.com/Louisville-Hidden-Dark-Web-Meetup/events/233599645/

Verizon Strategizes to Get Paid for Installing Big Brand Apps That You Will Probably Never Open

September 5, 2016

The article titled Verizon Offered to Install Marketers’ Apps Directly on Subscribers’ Phones on AdAge discusses the next phase in Verizon’s marketing strategy, a seeming inheritance of product placement: automatic installations for big brands onto your phone. Next time you notice an app that you didn’t download on your phone, look no further. Verizon has been in talks with both retail and finance brands about charging between $1 and $2 per device, which sounds small until you multiply it by 75 million Verizon smartphone subscribers. The article discusses some of the potential drawbacks.

Verizon has stoked some user frustration in the past with “bloatware,” as have many carriers and phone manufacturers. Bloatware comprises the often irrelevant apps that arrive pre-installed on phones, though they’re less often major brands’ apps and more often small, proprietary services from the carriers and manufacturers…There is no guarantee, however, that Verizon subscribers open the apps they find pre-installed on their phones. “If a user is not interested, they just delete it without activating.

Sara Choi, COO of AirFox, is quoted in the article making a great point about the importance to carriers to innovate new strategies for profit growth. Ultimately, the best use for this marketing technique is a huge number of immediate downloads. How to engage users once you have gotten into their phones is the next question. If this goes through, there will be no need to search to get an ad, which could mean bad news for online ad search.

Chelsea Kerwin, September 5, 2016
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
There is a Louisville, Kentucky Hidden Web/Dark Web meet up on September 27, 2016.
Information is at this link: https://www.meetup.com/Louisville-Hidden-Dark-Web-Meetup/events/233599645/

The Zen of More Tabs from Yandex

September 5, 2016

Serendipitous information discovery has been attempted through many apps, browsers and more. Attempting a solution, Russia’s giant in online search, Yandex, launched a new feature to their browser. A news release from PR Newswire appeared on 4 Traders entitled Yandex Adds AI-based Personal Recommendations to Browser tells us more. Fueling this feature is Yandex’s personalized content recommendation technology called Zen, which selects articles, videos, images and more for its infinite content stream. This is the first time personally targeted content will appear in new tabs for the user. The press release offers a description of the new feature,

The intelligent content discovery feed in Yandex Browser delivers personal recommendations based on the user’s location, browsing history, their viewing history and preferences in Zen, among hundreds of other factors. Zen uses natural language processing and computer vision to understand the verbal and visual content on the pages the user has viewed, liked or disliked, to offer them the content they are likely to like. To start exploring this new internet experience, all one needs to do is download Yandex Browser and give Zen some browsing history to work with. Alternatively, liking or disliking a few websites on Zen’s start up page will help it understand your preferences on the outset.

The world of online search and information discovery is ever-evolving. For a preview of the new Yandex feature, go to their demo. This service works on all platforms in 24 different countries and in 15 different languages. The design of this feature implies people want to actually read all of their recommended content. Whether that’s the case or not, whether Zen is accurate enough for the design to be effective, time will tell.

Megan Feil, September 5, 2016
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
There is a Louisville, Kentucky Hidden Web/DarkWeb meet up on September 27, 2016.
Information is at this link: https://www.meetup.com/Louisville-Hidden-Dark-Web-Meetup/events/233599645/

The Wheel of Search: Leidos

September 3, 2016

I know that most experts in search and content processing do not know too much about Teratext, the search system once owned by SAIC, a services firm. Teratext is described in this free profile. I read “Leidos Closes Lockheed Merger.” What I wanted to point out is that Lockheed Martin is “back into” the search business. The company sold its AeroText system, which is similar in some ways to to Leidos TeraText, to Rocket Software. With this deal, one services firm has moved search from its core business to a subsidiary and then sold that entity (Leidos) to a major US government contractor. Now Lockheed Martin is back in the search and content processing business. I find this brokering of search and content systems interesting because the technology is becoming dated and the systems require substantial professional support to install, optimize, operate, maintain, and extend. The wheel of search keeps on turning on axels of decades old technology. There is money in search, particularly some massive, complex systems.

Stephen E Arnold, September 3, 2016

« Previous PageNext Page »