Whois: A Frustrating Search Experience

February 3, 2018

It’s a complicated moment to be starting any business, especially if you have a domain finding WHOIS site. These tools for tracking down owners of web sites might be under fire, but that hasn’t stopped Zeit from trying. We discovered this startup at its website, under the title: “Domains Search for Web: Instant, Serverless, And Global.”

According to the site:

“We also knew we wanted the user’s search to feel instant and effortless. As the user types, we want to start dispatching queries to get the data from our services as soon as possible, and similarly render them as soon as they become available to the user. With this in mind, we would have to embrace asynchrony at every level of the stack.”

We love Zeit’s user-friendly approach. We just worry how long a business like this can stay afloat. This stems from the recent news that WHOIS sites are under threat of going dark. The General Data Protection Regulation could drastically change or shutter this niche online industry. We are certain that where regulation closes a door, innovators open a window, but we’d be sweating a little if we started a WHOIS site today.

Patrick Roland, February 3, 2018

Google Has Its Own Browser History

February 2, 2018

Have you ever wanted to look at your past Google searches, but did not want to go through your browser history?  Google has a new feature that will allow users to see their recent searches.  Search Engine Land reports that “Google Home Page Search Box Now Shows You Recent Searches By Default” and it is a super option.  Super annoying, that is.  Whenever you use Google search, a default dropdown appears before you even enter text into the search box.

Even former Google search executive Matt Cutts said this “new feature” is super annoying.   He tried to opt out of it, but could not find the opt-out option.  Search Engine Land sent Google an email to see what the scoop was.  They discovered that even Google found the automatic browser history box annoying.  Here is Google’s official response:

Google has confirmed with Search Engine Land that this is not the behavior they want and it was likely a bug. “We launched the ability to see past searches by clicking the search box earlier this year. However, past searches should not be appearing immediately on page load, so we are working to fix this issue,” a Google spokesperson told Search Engine Land.

All right, Google!  You admitted a mistake and provided a solution.  Now can you do something about the fake news stories that are plaguing Google News?

Whitney Grace, February 2, 2018

Sinequa Continues Its Repositioning Effort

January 30, 2018

Cognitive search is a pretty murky pool in terms of public understanding. Recently, Laurent Fanichet attempted to bridge that gap and also stake a claim for his company, Sinequa, as a leader in the field. However, we have questions. His editorial was featured in the French magazine, Journal Du Net called “Cognitive Search: A Valuable Tool to Fight Terrorism.”

According to the story (translated into English):

“Access to the right information at the right time is the promise of cognitive search. This artificial intelligence (AI) treats the “mega-data” of big data, to extract, practically in real time, relevant and contextualized information, from a multitude of sources and complex data. Cognitive search intelligent data processing combines with machine learning…”

It is exciting that France is attempting to use AI to solve its terror threats. While the plans laid out by Fanichet sound wonderful, it will still take quite a bit of work to surpass the other global leaders in the field. Namely, IBM’s Watson, which recently declared it was actually surpassing human beings in reading comprehension. Power like that seems like it would be a huge advantage to counter terror organizations wading through files and reports on suspects. We hope startups see this as a challenge to improve their power and that we all win in the end.

Sinequa continues to associate itself with the hot trends in content processing. Its roots in search and retrieval are now a secondary or tertiary part of the company’s 2018 marketing it appears.

Patrick Roland, January 30, 2018

Big Data, Search, and Artificial Intelligence: Quite a Mash Up

January 29, 2018

I read a consultant-technology mash up marketing essay. The write up is “Big Data and Search: The Time for Artificial Intelligence Is Now.” The write up is mostly jargon. I wonder if the engineer driving the word processing train pushed the wrong button.

Image result for train wreck

Here are the “keywords” I noted in the write up:

Analytics
Artificial intelligence
Big Data
Blockchain
Business action and business use cases
Chatbots
Cognitive (presumably not the IBM which maybe doesn’t work as advertised)
Consumer services
Customer / citizen facing (some government speak too)
Digital assistants
False or biased results (yes, fake news)
Keywords
Machine learning
Natural language processing
Platforms
Real time results
Resources
SQL databases
Search
Transparency
Trust
Video

Read more

Google Takes On Russia In Epic Fight

January 26, 2018

It is foolish to challenge Russia to a fight.  Napoleon lost his throne because he tried to invade Russia during winter.  Hitler pissed off Stalin during World War II, so Russia switched sides, then the Nazis invaded in winter.  It is a really bad idea to invade Russia, especially in winter.  Google is duking it out with Russia, but this war is digital so maybe Alphabet stands a chance.  The Washington Report discusses the wired Cold war in, “Google Is Getting Pulled Into A Fight With Russia Over RT And Sputnik.”  The real battle is with two Russian news outlets RT and Sputnik, but they are owned by the government.

The reason for battle is due to Russians apparent and supposed influence on US politics-most notably, the 2016 presidential election.  Russia is accused of spreading fake news through RT and Sputnik.  News outlets like Google News picks these up and are pushed to US readers.  Russia is threatening to retaliate if Google pushes RT and Sputniks’ ratings lower in search rankings.  Google decided to curb fake news stories that could be weaponized information against the US.  Russia’s RT and Sputnik are amongst those that distribute fake news.

When asked why Russian-backed sites enjoy favorable placement on Google’s platforms, Schmidt said, ‘We are working on detecting this kind of scenario … de-ranking those kinds of sites. It’s basically RT and Sputnik are the two.’ He added that the company does not want to ban the outlets. And according to Google, the company does not re-rank individual websites.

Russia is, of course, is not happy.  They claim that Google is being discriminatory and are demanding that Eric Schmidt explain himself.  Google just wants to curb fake news and also make sure their platform is not used for nefarious purposes.  Good luck, Google.  Russia is hard to defeat, but how do they stand on the digital front?

Whitney Grace, January 26, 2018

SEO Relevance Killer: Semantic Search

January 22, 2018

I am not sure if this Forbes’ write up is “real” journalism or just a pay-to-play story. Either way, it makes clear that the trajectory of search has been to destroy the once useful methods for determining precision and recall as part of an effort to explain or define relevance.

The write up which made me reach for my bottle of Tum’s is “Why And How Semantic Search Transformed SEO For The Better.”

Here’s a passage I highlighted in bilious yellow:

instead of finding exact matches for keywords, Google looks at the language used by a searcher and analyzes the searcher’s intent. It then uses that intent to find the most relevant search results for that user’s intent. It’s a subtle distinction, but one that demanded a new approach to SEO; rather than focusing on specific, exact-match keywords, you had to start creating content that addressed a user’s needs, using more semantic phrases and synonyms for your primary targets.

So what’s this mean in actual practice.

Navigate to Google and run this query with zero quotes and no additional words or phrases: 4iq Madrid.

Now look at the results:

image

The information is about the firm’s US office. The company was founded in Madrid and has some R&D facilities in the high-tech section of that city across from what used to be a hunting preserve for a former government leader. No address is Las Rozas, no LinkedIn listings of staff in Madrid, zip.

The world of search as described in the Forbes’ flag waving prose is great for expanding a user’s query. The purpose is not relevance, providing answers, or delivering on point results.

The purpose is to make it possible to broaden a query so more and usually less relevant ads can be displayed.

If you want relevance in search, you have to work very hard.

For example, to get the Spanish information related to 4iq, you set up a proxy in Spain. Google no longer makes it easy to query its index for content in a language different from the one Google decides you speak based on where you are in the world the moment you run your query. Then you enter the query and peruse the Spanish Google index results.

Yeah, that’s something the average eighth grader will do when writing an essay about Madrid. I know lots of adults who cannot perform this workaround.

The Forbes’ essay states:

The SEO community is better off focusing on semantic search optimization, rather than keyword-specific optimization. It’s forcing content producers to produce better, more user-serving content, and relieving some of the pressure of keyword research (which at times is downright annoying).

Why even bother providing results even marginally related to the user’s query. Do what the NFL Sirius Radio Network does. Run ads all the time. Football is a bit of distraction to the real business of pay-to-play information.

Ads, ads, ads.

Stephen E Arnold, January 22, 2018

Google Busts Fake News

January 22, 2018

Ever since fake news stories swamped the 2016 election and other events in 2017, the powers that be have pressured Google and other news sources to stop all the fake headlines.  Soyacincau shares how Google plans to make take down the fall information in the article, “This Is How Google’s Clamping Down On Fake News.”  Google has partnered with the International Fact-Checking Network (IFCN) to ensure that its search results and news stories are accurate.

What exactly is the IFCN?

The IFCN is a nonpartisan organization run by The Poynter Institute that advocates accuracy in online articles, and they hold an annual fact-checking conference, alongside funding fellowships and training for would-be fake news busters. Google has plans to work with them in a handful of ways to help fix the fake news epidemic, one being to offer free fact-checking tools, expanding their code of principles into new regions and generally increasing the number of verified fact-checkers worldwide.

Google is really taking the lead in quashing fake news. They will host workshops, coaching, and financial assistance for new fact-checking organizations.  They also plan to translate IFCN’s code of principles into ten languages, provide training sessions, and access to an engineering time bank.  Google partnered with Snopes and Politifact in the past, but the fake news keeps coming.  Hopefully, this will have an impact.

Whitney Grace, January 22, 2018

The Consequences of an Echo Chamber for Google Search

January 19, 2018

I read “Google Memory Loss.” The author is a fellow who created a text search engine, helped found OpenText, did some time at the GOOG, and swam in the Semantic Web pond.

The write up provides useful information to anyone wondering why a Google query for a company name goes off the rails or why the Google suggestions have zero relevance to the user’s query.

There were some important points in the write up; for example:

  1. Search is “crushingly expensive”. This means that when Google needs to cut costs and maximize revenue, the company will make business decisions. The decisions may favor advertising revenues. Maybe.
  2. Archival information is not popular. The reasoning may be, “Why index this stuff or revisit the archive to figure out if there is “new information” in the old archive? If old information is not important, what about unpopular sites the National Railway Retirement Board Web content?
  3. Google is into the timely, not the research-centric type of query.
  4. Dr. Bray uses Google but supplements the look up by using very un-Googley search systems.

Here in Harrod’s Creek, we love the Google. Filtered, ad-tailored results are perfect for looking up KY Fry or the NCAA rules committee’s favorite team, the Louisville Cardinals.

A search for Cardinals returns this results page this morning:

image

Lots of Googlers love March Madness. Too bad if a 7th grader has to look up information about cardinals with feathers.

Stephen E Arnold, January 19, 2018

We Are Without a Paddle on Growing Data Lakes

January 18, 2018

The pooling of big data is commonly known as a “data lake.” While this technique was first met with excitement, it is beginning to look like a problem, as we learned in a recent Info World story, “Use the Cloud to Create Open, Connected Data Lakes for AI, Not Data Swamps.”

According to the story:

A data scientist will quickly tell you that the data lake approach is a recipe for a data swamp, and there are a few reasons why. First, a good amount of data is often hastily stored, without a consistent strategy in place around how to organize, govern and maintain it. Think of your junk drawer at home: Various items get thrown in at random over time, until it’s often impossible to find something you’re looking for in the drawer, as it’s gotten buried.

This disorganization leads to the second problem: users are often not able to find the dataset once ingested into the data lake.

So, how does one take aggregate data from a stagnant swamp to a lake one can traverse? According to Scientific Computing, the secret lies in separating the search function into two pieces, finding and searching. When you combine this thinking with Info World’s logic of using the cloud, suddenly these massive swamps are drained.

Patrick Roland, January 18, 2018

 

 

Some Think Google Is No Longer the King of Search

January 18, 2018

Google is much more than a search engine, it’s a verb. Like Xerox and Kleenex before it, that says something about the hierarchy of their business. However, some are claiming it’s time for alternatives (In search…not in copy-making or nose blowing). This, according to a recent Eyerys story, “Searching Beyond Google: When The Internet is Too Big for a Single Search Engine.”

According to the story:

[T]he information you need might be hidden from the tools you use. Either because the webmasters wanted that to happen by blocking search engines’ access, or inaccessible by search engine because they are behind paywalls or login forms, or lies inside the deep web.

To access them, you need more specific tools other than search engines, and look at the right place, with the right privilege.

If and only if you still can’t find the information you’re looking for, it’s either not available on the internet, or doesn’t exist in the first place.

Or, they could be hidden inside database, encrypted, lies deeper and accessible to only using certain IPs, classified methods or privilege. In this case, it’s not publicly available though it is there. You need to be a hacker to get yourself into that, and that is certainly illegal by any means.

While the story has its heart in the right place, recommending alternative engines, like DuckDuckGo, and giving tips on using social media for search, it’s not really too believable. For one, humans are creatures of habit and they are stuck on the single search engine method. This is wishful thinking, and actually makes sense in places, but we can’t see it happening.

Patrick Roland, January 18, 2018

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta