Quantum Search: Consultants, Rev Your Engines

April 18, 2019

Search is a utility function. A number of companies have tried to make it into a platform upon which a business or a government agency’s mission rests. Nope.

In fact, for a decade I published “Beyond Search” and just got tired of repeating myself. Search works if one has a bounded domain, controlled vocabularies, consistent indexing, and technology which embraces precision and recall.

Today, not so much. People talk about search and lose their grip on the accuracy, relevance, and verifiability of the information retrieved. It’s not just wonky psycho-economic studies which cannot be replicated. Just try running the same query on two different mobile phones owned by two different people.

Against this background, please, read “How the Quantum Search Algorithm Works.” The paper contains some interesting ideas; for example:

It’s incredible that you need only examine an NN-item search space on the order of \sqrt{N}N?times in order to find what you’re looking for. And, from a practical point of view, we so often use brute search algorithms that it’s exciting we can get this quadratic speedup. It seems almost like a free lunch. Of course, quantum computers still being theoretical, it’s not quite a free lunch – more like a multi-billion dollar, multi-decade lunch!

Yes, incredible.

However, the real impact of this quantum search write up will be upon the search engine optimization crowd. How quickly will methods for undermining relevance be found.

Net net: Quantum or not, search seems destined to repeat its 50 year history in a more technically sophisticated computational environment. Consultants, abandon your tired explanations of federated search. Forget mere geo-tagging. Drill right into the heart of quantum possibilities. I am eagerly awaiting a Forrester wave report on quantum search and a Gartner magic quadrant, filled with subjective possibilities.

Stephen E Arnold, April 18, 2019

Expert System: Interesting Financials

April 6, 2019

Expert System SpA is a firm providing semantic software that extracts knowledge from text by replicating human processes. I noticed information on the company’s Web site which informed me:

  • The company had sales revenues of 28.7 million euros for 2018
  • The company’s growth was 343 percent compared to 2017
  • The net financial position was 12.4 million euros up from 8.8 million euros in March 2017.

Remarkable financial performance.

Out of curiosity I navigated to Google Finance and plugged in Expert System Spa to see what data the GOOG could offer.

Here’s the chart displayed on April 6, 2019:

image

The firm’s stock does not seem to be responding as we enter the second quarter of 2019.

Read more

Netwrix Buys Concept Searching

April 5, 2019

Late last year we learned that Concept Searching was selling itself to Netwrix. I don’t pay much attention to “finding” solutions. I thought of Concept Searching in the context of the delay in awarding the JEDI contract. Concept Searching might be a nifty add on if Microsoft gets the $10 billion deal.

Concept Searching had positioned itself as an indexing outfit and taxonomy management tool. The company struck me as having a Microsoft-centric focus and dabbled in enterprise search and jousted with Smartlogic.

According to the company’s founder Martin Garland:

Concept Searching is excited about becoming a part of Netwrix. Merging our unique technology with its exceptional Netwrix Auditor product delivers a new level of protection to organizations concerned about data security, with the ability to identify and remediate personal or organizationally defined sensitive information, regardless of where it is stored or how it was ingested. The expanded team will enable us to be even more agile, increasingly responsive to our clients’ needs, and to deliver a platform for growth to both client bases and ensure we maintain our leadership position in delivering world-class metadata-driven solutions.

Netwrix is a software company focused exclusively on providing IT security and operations teams with pervasive visibility into user behavior, system configurations and data sensitivity across hybrid IT infrastructures to protect data regardless of its location. The company has 10,000 customers.

DarkCyber believes that like Exalead’s acquisition by Dassault or OpenText’s purchase of assorted search and retrieval systems, it will be interesting to watch how this acquisition works out.

Stephen E Arnold, April 5, 2019

Thoughts about Search: A Word That Means Almost Anything

April 3, 2019

We have long been frustrated that search technology has not come very far since its early days. Sure, Google has made tweaks over the years, but even many of those incremental changes are designed to maximize that company’s ad revenue. Now, though, AI technology may fundamentally change how we find information online. Forbes asks, “Might AI Spell the Death of Search?” Writer Michael Ashley observes:

“‘This is the first time since 1994 when the search paradigm has changed,’ says David Seuss, CEO of Northern Light, a Boston-based strategic research portal provider I consult with that offers a cloud-based SaaS to global enterprises. ‘In 1994, you went to a search box, filled in a query, hit the search button, and received a list of documents. You manually reviewed these, picking the most relevant item to download. Fast forward to 2019 and it’s still the same thing. Find me one other part of the tech landscape that has not changed since the ’90s, whether it be broadband, wireless, mobile cloud computing, artificial intelligence—everything has changed. Everything except search.’”

The man has a point. He also claims it is Millennials that are pushing for change. Older users were just so happy to search from their desks instead of in the library stacks, he posits, that most of them remain satisfied with 90s-style online search. The younger generation, though, find manually reviewing search results inefficient, and they recognize that a lot of good information tends to get buried later in the search results—especially as paid listings claim the top spots. Ashley writes:

“With the help of A.I., tasks once relegated to flesh and blood researchers can be now accomplished by computers. Drawing on the latter’s pattern-forming and predictive abilities, it can observe users’ actions, discerning their interests based on what they download, share, comment on or bookmark. Informed by this knowledge, an A.I. can proactively—and without manual prompting—recommend relevant content to users. Disrupting the traditional search model to its page ranking core, content can seek out the user instead of the other way around.”

Ah, Northern Light sails again with the AI flag whipping in the marketing breeze with help from puffs of insight in “Why Are So Many People Wasting Their Time with Web Search?” Trim the jib!

Cynthia Murrell, April 3, 2019

Echosec: Dark Web Search for Those Who Qualify

April 2, 2019

A Canadian company has devised a way to search the Dark Web without the hassle of the Tor browser or proxy servers. HotHardware reports: “Beacon, a Dark Web Search Engine Can Be Your Eyes in the Internet Underworld.” The catch—one must prove to the company behind Beacon, Echosec, that they have a legitimate reason to use the “Google of the Dark Web.” The intention, we’re told, is for organizations to monitor whether any of their sensitive data has made it onto a Dark Web marketplace. Reporter Rod Scher writes:

“This could include stolen corporate emails, company documents, personal info, or other such data that could be detrimental to a company, its brand, or its customers. After all, if your data has been compromised, it’s always better to know than not to know. …

We noted this statement:

“While [CTO Mike] Raypold notes that it is possible to misuse Beacon, since the tool makes it easier for users to locate data they might otherwise have difficulty finding, he says that the company has taken steps to mitigate that danger. ‘First, every Echosec customer must go through a use-case approval process to determine how the customer is using the application and to make sure they are in compliance with the vendors from whom the data Is sourced,’ says Raypold. ‘If a potential customer cannot pass the use-case approval process, they do not get access to the system.’ Second, the company has built automated tools and manual processes into its platform and into the company workflows to notify the Echosec team if users attempt to run searches that are in violation of their approved use case.”

Not only will Echosec know if a user violates their agreement, certain queries simply cannot be run through Beacon. The company shares their acceptable-use policy here, and it is thorough. Founded in 2013, Echosec is based in Vancouver, British Columbia. If you want to see selected screenshots of the system’s output, check out the Dark Cyber video for March 26, 2019, at this link.

Stephen E Arnold, February 27, 2019

Audio Search: Google Gets with the Program

March 27, 2019

Searching audio files has been difficult. Exalead, before Dassault bought the company, dabbled in audio search. One could key in a key word and jump to the segment of a file which contained the word or phrase. That was in 2006, maybe 2007. That was, despite my advanced age and inability to recall the innovations from search and retrieval wizards, more than a decade ago.

I read “Google Podcast in Episode Search Is Coming, Shows Now Being Fully Transcribed.” The write up reports:

Google Podcasts is now automatically generating transcripts of episodes and is using them as metadata to help listeners search for shows, even if they don’t know the title or when it was published.

I spoke with a person who translates audio recordings from one language into English. Here are some highlights from that chat:

  • “Even though I am a native speaker and fluent in English, it is very, very difficult to make out what some people are saying. I slow down the recording. I listen several times. I fiddle with the sound.”
  • “Accents pose a problem. For example, if a person is speaking one language but learned that language by osmosis, the pronunciation is often strange. In some cases, I have no idea what the person speaking is trying to communicate. Some people do not articulate or put the stresses where a native speaker puts them.
  • “Muddled sounds pose big challenges. I am not sure why but even modern recording equipment drops sounds. In some cases, rustling or tapping fuzzes what the person is saying.”

Net net: How accurate will the transcripts be? The answer is going to be like the accuracy scores for facial recognition? Maybe 50 percent to 75 percent accurate out of the gate. But better than nothing, when one wants to sell ads which match the translated key words, right? Will Steve Gibson stop creating transcripts of Security Now? Probably not.

Stephen E Arnold, March 27, 2019

Apple News: Another Search Fail?

March 26, 2019

Apple is a bit of a mystery to me. Example: Navigate to the Apple app store. Type in a word like “disc recovery”? What do you get? Which app does what? I need to recover now, not conduct a day long click, read, and compare. Now try: “bootable iso”? Helpful, right? A suggestion or a link to Apple help might be useful? Next what app is best for a particular task like hotel reservation? Give up yet? Now go to Garageband and enter in the help or search box, “no audio for the microphone”? Get any help from the help system? Tip: Look for audio HDMI in utilities on a Mac laptop. The volume sliders get reset. How? Who knows? What about finding a book in Apple iTunes’ audiobook section? Type in an author’s name but misspell it by omitting a letter? Learn to spell, gentle reader.

I thought of these example when I read “Apple News Plus Is a Fine Way to Read Magazines, but a Disappointment to Anyone Wishing for a Real Boost for the News Business.” I noted this statement in the article:

It’s actually a little hard to even find L.A. Times and Journal content in Apple News Plus because they don’t fit into the magazine UX it’s dependent on. Tap “Browse the Catalog” in Plus and you can scroll all day, but you’ll never find either paper, because they’re not contained in “issues.”

Findability. Stated another way, Apple is not particularly good at search and retrieval. Gloss and PR are covered. Finding information? Not on the radar in my opinion.

Stephen E Arnold, March 26, 2019

A Chat Search Feature for WhatsApp

March 21, 2019

Computers and anything with a hard drive or indexable database can be searched. Chat and text messages are stored are such devices, but good luck trying to find information with a quick search. WhatsApp is a popular message service and it might be on the verge of a breakthrough in chat technology: search. Mirror shares the story, “WhatsApp’s New Feature Will Make Searching Through Chats MUCH Easier.”

WhatsApp and other message services already have basic search functions that allow users to find specific messages, some keywords, and other information. The basic search still requires users to search through their messages to find information they cannot pull up with the function. That takes a long time and often does not yield successful results. WhatsApp will debut an advanced search mode that will outpace its basic search by a long run.

What will the new Whatsapp advanced search do?

“Advanced Search will let you filter your search, whether you’re looking for photos, links, audio, documents, GIFs or videos. The feature will also show your Search History. WABetaInfo explained: “If you tap a media file, for example Photos, WhatsApp will show all messages that contain an image!” Thankfully, your history an also be easily deleted, using a ‘Clear’ button. Your search results will also include a preview, meaning there’s no need to open the search result in order to see it.”

We believe this development could be an important one.

Whitney Grace, March 21, 2019

Has Search Evolved or Is It Spinning in Circles?

February 28, 2019

We have long been frustrated that search technology has changed into “we will tell you what you need to know.” Search is asking. Providing answers based on behavior is manipulation or a digital version of “mother knows best.”

Smart software or “AI” technology may fundamentally change how we find information online. Forbes asks, “Might AI Spell the Death of Search?” Writer Michael Ashley observes:

“‘This is the first time since 1994 when the search paradigm has changed,’ says David Seuss, CEO of Northern Light, a Boston-based strategic research portal provider I consult with that offers a cloud-based SaaS to global enterprises. ‘In 1994, you went to a search box, filled in a query, hit the search button, and received a list of documents. You manually reviewed these, picking the most relevant item to download. Fast forward to 2019 and it’s still the same thing. Find me one other part of the tech landscape that has not changed since the ’90s, whether it be broadband, wireless, mobile cloud computing, artificial intelligence—everything has changed. Everything except search.’”

The former consultant is doing consultant type thinking. There is a problem, and the consultant can ride to the rescue. A digital Lone Ranger can kill the useless system outputs. Well, that’s the story line.

Seuss claims it is Millennials that are pushing for change. Older users were just so happy to search from their desks instead of in the library stacks, he posits, that most of them remain satisfied with 90s-style online search.

The younger generation, though, find manually reviewing search results inefficient, and they recognize that a lot of good information tends to get buried later in the search results—especially as paid listings claim the top spots. Ashley writes:

“With the help of A.I., tasks once relegated to flesh and blood researchers can be now accomplished by computers. Drawing on the latter’s pattern-forming and predictive abilities, it can observe users’ actions, discerning their interests based on what they download, share, comment on or bookmark. Informed by this knowledge, an A.I. can proactively—and without manual prompting—recommend relevant content to users. Disrupting the traditional search model to its page ranking core, content can seek out the user instead of the other way around.”

Not surprisingly, the piece cites Northern Light’s platform as an example of the new, AI-powered possibilities: it quickly examines documents relevant to a query and presents a summary of pertinent information. The author ponders a time, close at hand, when the information we need finds us when we need it.

That sounds good, but I wonder—how can one be sure the algorithms are choosing wisely? What’s the old adage about consultants? Keep your hand on your wallet?

Cynthia Murrell, February 28, 2019

Free Web Search and Objective Results

February 8, 2019

I spotted a story from the Moscow Times called “Google Began Censoring Search Results in Russia, Reports Say.” I read:

Google began complying with Russian requirements and has deleted around 70 percent of the websites blacklisted by authorities, an unnamed Google employee told Russia’s Vedomosti business daily Wednesday. An unnamed Roskomnadzor source reportedly confirmed the information to the paper. On Thursday, a Roskomnadzor spokesman told the state-run RIA Novosti news agency that the regulator had established a “constructive dialogue” with Google over filtering content.

Let’s assume the report is accurate.

Is this the model for filtering content in online indexes which Google developed to comply with different countries’ laws and regulations?

If the Russian regulatory authority is “fully satisfied”, the Google system appears to be working.

Several questions crossed my mind; to wit:

  1. Has Google used this system to filter content in other countries; for example, the US, Brazil, or Iran?
  2. Does the system work with acceptable reliability? Some potentially objectionable can be located via a Google image query to cite one example?
  3. What is the economic payoff of Google find a solution to its pre-filtering disputes with Russia?

Interesting, particularly when one asks the question, “Am I getting accurate information when running a query on Google, regardless of the country in which the query appears to have been launched?”

If search results are shaped, what does one do to locate potentially useful information? One answer, I suppose, is to pay for commercial online access. Another may be to assume that what’s online IS the correct data set? One could ask those in one’s social network, but that too may be filtered.

But free services are free. Free services may have other characteristics as well. What does “free” mean? Hmmm.

Stephen E Arnold, February 8, 2019

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta