Basho Enters Ring With New Data Platform
June 18, 2015
When it comes to enterprise technology these days, it is all about making software compliant for a variety of platforms and needs. Compliancy is the name of the game for Basho, says Diginomica’s article, “Basho Aims For Enterprise Operational Simplicity With New Data Platform.” Basho’s upgrade to its Riak Data Platform makes it more integration with related tools and to make complex operational environments simpler. Data management and automation tools are another big seller for NoSQL enterprise databases, which Basho also added to the Riak upgrade. Basho is not the only company that is trying to improve NoSQL enterprise platforms, these include MongoDB and DataStax. Basho’s advantage is delivering a solution using the Riak data platform.
Basho’s data platform already offers a variety of functions that people try to get to work with a NoSQL database and they are nearly automated: Riak Search with Apache Solr, orchestration services, Apache Spark Connector, integrated caching with Redis, and simplified development using data replication and synchronization.
“CEO Adam Wray released some canned comment along with the announcement, which indicates that this is a big leap for Basho, but also is just the start of further broadening of the platform. He said:
‘This is a true turning point for the database industry, consolidating a variety of critical but previously disparate services to greatly simplify the operational requirements for IT teams working to scale applications with active workloads. The impact it will have on our users, and on the use of integrated data services more broadly, will be significant. We look forward to working closely with our community and the broader industry to further develop the Basho Data Platform.’”
The article explains that NoSQL market continues to grow and enterprises need management as well as automation to manage the growing number of tasks databases are used for. While a complete solution for all NoSQL needs has been developed, Basho comes fairly close.
Whitney Grace, June 18, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Jargon: Chief Digital Officer
June 17, 2015
Navigate to “‘I Can Imagine a World Where We Have CDOs Instead of CIOs’ – Says Atkins CIO and CDO.” Sounds like a coinage designed to remove an important corporate function from reality.
I visited a company without a chief information officer. Hold that. The company did not have an information technology manager. Instead several individuals did different computery things. One of the senior managers told me, “We need to move everything to the cloud.”
Okay. The statement is easy to make, but without someone who knows about technology options, migration methods, work processes, and making technology do what users need to do their jobs.
The write up assumes that a company will have a CDO or chief digital officer. Well, many some companies have the funds to add a full time equivalent to manage “digital” zeros and ones. For me, the notion of a CDO is just a bit of a priority and responsibility problem.
The write up does not agree, stating:
The CIO and CDO of engineering juggernaut Atkins, Richard Cross, has weighed into the chief digital officer debate, suggesting that CDOs could one day replace CIOs.
The write up adds:
Cross, who told Computing yesterday that his job is to ensure that the firm is “digital by default” by 2020, said that despite some CIOs believing that the CDO role is just a ‘fad’, it could in fact replace the CIO position.
So, one person wants to have a more lofty title? I understand that, but it may be useful for the article to define the acronyms, provide some facts or semi facts about why the phrase “information technology manager” is not useful. I understand that there are some people who believe they know about social media, programming, marketing strategy and tactics, system architecture, and computing infrastructure. I have met a couple of people with this span of expertise.
The problem is that most companies today have pretty basic and quite difficult technical problems. These range from the Office of Personnel Management type of security issues to figuring out why the cost of the company’s Web site is rising more rapidly than any other information technology expense item. Search probably is a burr under employees’ saddles. The company is clueless when it comes to dealing with analytic methods which can be used by an employee quickly and easily.
The propensity to generate dorky buzzwords is blurring the very real technical work that organizations must do.
CDO? Forget that. Focus on the basics. Can employees locate the final version of the CEO’s PowerPoint used three days ago? How many organizations will have today’s staff on the payroll in five years? Heck, how many companies in business today will be viable in five years? My hunch is that doing the work is more important than printing a business card with a different title.
Stephen E Arnold, June 17, 2015
Video and Image Search In the News
June 17, 2015
There’s been much activity around video and image search lately. Is it all public-relations hype, or is there really progress to celebrate? Here are a few examples that we’ve noticed recently.
Fast Company reports on real-time video-stream search service Dextro in, “This Startup’s Side Project Scans Every Periscope Video to Help You Find the Best Streams.” Writer Rose Pastore tells us:
“Dextro’s new tool, called Stream, launches today as a mobile-optimized site that sorts Periscope videos by their content: Cats, computers, swimming pools, and talking heads, to name a few popular categories. The system does not analyze stream text titles, which are often non-descriptive; instead, it groups videos based only on how its algorithms interpret the visual scene being filmed. Dextro already uses this technology to analyze pre-recorded videos for companies … but this is the first time the two-year-old startup has applied its algorithms to live streams.”
Meanwhile, ScienceDaily reveals an interesting development in, “System Designed to Label Visual Scenes Turns Out to Detect Particular Objects Too.” While working on their very successful scene-classification tool, researchers at MIT discovered a side effect. The article explains that, at an upcoming conference:
“The researchers will present a new paper demonstrating that, en route to learning how to recognize scenes, their system also learned how to recognize objects. The work implies that at the very least, scene-recognition and object-recognition systems could work in concert. But it also holds out the possibility that they could prove to be mutually reinforcing.”
Then we have an article from MIT’s Technology Review, “The Machine Vision Algorithm Beating Art Historians at Their Own Game.” Yes, even in the highly-nuanced field of art history, the AI seems to have become the master. We learn:
“The challenge of analyzing paintings, recognizing their artists, and identifying their style and content has always been beyond the capability of even the most advanced algorithms. That is now changing thanks to recent advances in machine learning based on approaches such as deep convolutional neural networks. In just a few years, computer scientists have created machines capable of matching and sometimes outperforming humans in all kinds of pattern recognition tasks.”
Each of these articles is an interesting read, so check them out for more information. It may be a good time to work in the area of image and video search.
Cynthia Murrell, June 17, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Magic May Not Come From Pre-Made Taxonomies
June 17, 2015
There are hundreds of companies that advertise they can increase information access, retrieval and accuracy for enterprise search by selling prefabricated taxonomies. These taxonomies are industry specific and are generated by using an all-or-nothing approach, rather than individualizing them for each enterprise search client. It turns out that the prefabricated taxonomies are not guaranteed to help enterprise search; in fact, they might be a waste of money. The APQC Blog posted “Make Enterprise Search Magical Without Money” that uses an infographic to explain how organizations can improve their enterprise search without spending a cent.
APQC found that “best-practice organizations don’t have significantly better search technology. Instead, they meet employees’ search needs with superior processes and approaches the content management.”
How can it be done?
The three steps are quite simple:
- Build taxonomies that reflect how people actually think and work-this can be done with focus groups and periodically reviewing taxonomies and metadata. This contributes to better and more effective content management.
- Use scope, metadata, and manual curation to ensure search returns the most relevant results-constantly the taxonomies for ways to improve and how users are actually users search.
- Clear out outdated, irrelevant, and duplicate content that’s cluttering up your search results-keep taxonomies updated so they continue to deliver accurate results.
These are really simple editing steps, but the main problem organizations might have is actually implementing the steps. Will they assign the taxonomy creation task to the IT department or information professionals? Who will be responsible for setting up focus groups and monitoring usage? Yes, it is easy to do, but it takes a lot of time.
Whitney Grace, June 17, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Apple: Search Not Important. The Mom Approach Is.
June 16, 2015
Imagine the difficulty of pitching a six figure information retrieval service today. On one hand, there are suitable open source alternatives for search. On the other hand, there are users who prefer to do the 7-11 thing; that is, find a convenient source of good enough stuff.
That’s the Apple approach, which in my opinion, denigrates the importance of proactive online information access. Take what’s provided. Want a peach iced tea. The 7-11 has a lemon iced tea. Take what’s available. Good enough.
Navigate to “Apple News Curation Will Have Human Editors and That Will Raise Important Questions.” I am not too concerned about publishers. That crowd lost my interest decades ago. What does interest me is that the article does not put the notion of humans making choices in the context of customers just accepting what an Apple, Facebook, or Google provides.
I did highlight this passage:
Curating lists of apps is one thing, but having Apple employees curate the news flow could be more controversial depending on how the company ultimately works with select publishers and surfaces content from its preferred sources.
But associating the loss of interest in or the ability to look for information, figure out what is accurate or more nearly accurate, is not associated with the research function. Search vendors take another whack on the snoot.
It is tough to sell search when the customers prefer to drop by a convenience store and let that experience dictate the set of things from which one chooses.
Stephen E Arnold, June 16, 2015
Instagram Promises Improved Search
June 15, 2015
Frustrated with the abysmal search functionality at Instagram? Rejoice, for Wired tells us that, soon, “Better Search Will Transform How You Use Instagram.” Instagram’s cofounder Mike Krieger admitted that it is currently difficult for users to discover many photos that would interest them, but also asserted the company knows it must do better. Why, then, wasn’t search a priority earlier in the company’s history, and why are they talking about this now? Writer Julia Greenberg informs us:
“All that could soon change, given that Instagram has Facebook on its team. The social media titan, which acquired Instagram in 2012, is targeting Google itself as it develops a robust search system to make both its own platform and the whole web searchable through its own app. But while Facebook users post links, status updates, news, opinions, and photos, Instagram is almost completely visual. That means Instagram needs to teach its search engine to see. Krieger said his team has worked on a project to better understand how to automate sight. ‘Computer vision and machine learning have really started to take off, but for most people the whole idea of what is a computer seeing when it’s looking at an image is relatively obscure,’ Krieger said.”
Ah, prodded by their Facebook overlords; makes sense. Instagram isn’t ready to hand the site over to algorithms entirely, though. Their human editorial team still works to help users find the best images. Apparently, they feel humans are more qualified to choose photos with the most emotional impact (go figure). Krieger sees Instagram developing into a “storytelling” destination, the place users to go connect with world events through images: “the real-time view into the world,” as Krieger puts it. We agree that implementing an effective search system should help toward that goal.
Cynthia Murrell, June 15, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Solcara Is The Best! Ra Ra Ra!
June 15, 2015
Thomson-Reuters is a world renowned news syndication, but the company also has its own line of search software called Solcara Federated Search also known as Solcara SolSearch.” In a cheerleading press release, Q-resolve highlights Solcara’s features and benefits: “Solcara Legal Search, Federated Search And Know How.” Solcara allows users to search multiple information resources, including intranets, databases, Knowledge Management, and library and document management systems. It returns accurate results according to the inputted search terms or keywords. In other words, it acts like an RSS feed combined with Google.
Solcara also has a search product specially designed for those in the legal profession and the press release uses a smooth reading product description to sell it:
“Solcara legal Search is as easy to use as your favorite search engine. With just one search you can reference internal documents and approved legal information resources simultaneously without the need for large scale content indexing, downloading or restructuring. What’s more, you can rely on up-to-date content because all searches are carried out in real time.”
The press release also mentions some other tools, case studies, and references the semantic Web. While Solcara does sound like a good product and comes from a reliable new aggregator like Thomson-Reuters, the description and organization of the press release makes it hard to understand all the features and who the target consumer group is. Do they want to sell to the legal profession and only that group or do they want to demonstrate how Solcara can be adapted to all industries that digest huge information amounts? The importance of advertising is focusing the potential buyer’s attention. This one jumps all over the place.
Whitney Grace, June 15, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
More Semantic Search and Search Engine Optimization Chatter
June 10, 2015
I read “Understanding Semantic Search.” I had high hopes. The notion of Semantic Search as set forth by Tim Bray, Ramanathan Guha, and some other wizards years ago continues to intrigue me. The challenge has been to deliver high value outputs that generate sufficient revenue to pay for the plumbing, storage, and development good ideas can require.
I spent considerable time exploring one of the better known semantic search systems before the company turned off the lights and locked its doors. Siderean Software offered its Seamark system which could munch on triples and output some quite remarkable results. I am not sure why the company was not able to generate more revenue.
The company emphasized “discovery searching.” Vivisimo later imitated Siderean’s user input feature. The idea is that if a document required an additional key word, the system accepted the user input and added the term to the index. Siderean was one of the first search vendors to suggest that “graph search” or relationships would allow users to pinpoint content processed by the system. In the 2006-2007 period, Siderean indexed Oracle text content as a demonstration. (At the time, Oracle had the original Artificial Linguistics’ technology, the Oracle Text function, Triple Hop, and PL/SQL queries. Not surprisingly, Oracle did not show the search acquisition appetite the company demonstrated a few years later when Oracle bought Endeca’s ageing technology, the RightNow Netherlands-originated technology, or the shotgun marriage search vendor InQuira.)
I also invested some time on behalf of the client in the semantic inventions of Dr. Ramanathan Guha. This work was summarized in Google Version 2.0, now out of print. Love those print publishers, folks.
Dr. Guha applied the features of the Semantic Web to plumbing which, if fully implemented, would have allowed Google to build a universal database of knowledge, serve up snippets from a special semantic server, and perform a number of useful functions. This work was done by Dr. Guha when he was at IBM Almaden and at Google. My analysis of Dr. Guha’s work suggests that Google has more semantic plumbing than most observers of the search giant notice. The reason, I concluded, was that semantic technology works behind the scenes. Dragging the user into OWL, RDF, and other semantic nuances does not pay off as well as embedding certain semantic functions behind the scenes.
In the “Understanding Semantic Search” write up, I learned that my understanding of semantic search is pretty much a wild and crazy collection of half truths. Let me illustrate what the article presents as the “understanding” function for addled geese like me.
- Searches have a context
- Results can be local or national
- Entities are important; for example, the White House is different from a white house
So far, none of this helps me understand semantic search as embodied in the 3WC standard nor in the implementation of companies like Siderean or the Google-Guha patent documents from 2007 forward.
The write up makes a leap from context to the question, “Are key words still important?”
From that question, the article informs me that I need to utilize schema mark up. These are additional code behinds which provide information to crawlers and other software about the content which the user sees on a rendering device.
And that’s it.
So let’s recap. I learned that context is important via illustrations which show Google using different methods to localize or personalize content. The write up does not enumerate the different methods which use browser histories, geolocation, and other signals. The write up then urges me to use additional mark up.
I think I will stick with my understanding of semantics. My work with Siderean and my research for an investment bank provided a richer base of knowledge about the real world applications of semantic technology. Technology, I wish to point out, which can be computationally demanding unless one has sufficient resources to perform the work.
What is happening in this “Understanding Semantic Search” article is an attempt to generate business for search engine optimization experts. Key word stuffing and doorway pages no longer work very well. In fact, SEO itself is a problem because it undermines precision and recall. Spoofing relevance is not my idea of a useful activity.
For those looking to semantics to deliver Google traffic, you might want to invest the time and effort in creating content which pulls users to you.
Stephen E Arnold, June 9, 2015
Online Shopping Is Too Hard
June 10, 2015
Online shopping is supposed to drive physical stores out of business, but that might not be the case if online shopping is too difficult. The Ragtrader article, “Why They Abandon” explains that 45 percent of Australian consumers will not make an online purchase if they experience Web site difficulties. The consumers, instead, are returning to physical stores to make the purchase. The article mentions that 44 percent believe that traditional shopping is quicker if they know what to look for and 43 percent as prefer in-store service.
The research comes from a Rackspace survey to determine shopping habits in New Zealand and Australia. The survey also asked participants what other problems they experienced shopping online:
“42 percent said that there were too many pop-up advertisements, 34 percent said that online service is not the same as in-store and 28 percent said it was too time consuming to narrow down options available.”
These are understandable issues. People don’t want to be hounded to purchase other products when they have a specific item in mind and thousands of options are overwhelming to search through. Then a digital wall is often daunting if people prefer interpersonal relationships when they shop. The survey may pinpoint online shopping weaknesses, but it also helps online stores determine the best ways for improvement.
“ ‘This survey shows that not enough retailers are leveraging powerful and available site search and navigation solutions that give consumers a rewarding shopping experience.’ ”
People shop online for convenience, variety, lower prices, and deals. Search is vital for consumers to narrow down their needs, but if they can’t navigate a Web site then search proves as useless as an expired coupon.
Whitney Grace, June 10, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph
Free Version of InetSoft Style Scope Agile Edition Available
June 10, 2015
The article titled InetSoft Launches Style Scope Agile Edition for Dashboarding and Visual Analytics on PRWeb tells of a free version of InetSoft’s application for visualizing analysis. Business users will gain access to an interactive dashboard with an easy-to-use drag and drop sensibility. The article offers more details about the launch:
“Advanced visualization types ideal for multi-dimensional charting and point-and-click controls like selection lists and ranger sliders give greater abilities for data exploration and performance monitoring than a simple spreadsheet offers. Any dashboard or analysis can be privately shared with others using just a browser or a mobile device, setting the application apart from other free BI tools… Setting up the software will be straightforward for anyone with power spreadsheet skills or basic knowledge of their database.”
Drawbacks to the free version are mentioned, such as being limited to two concurrent users. Of course, the free version is meant to “showcase” the company’s technology according to CMO Mark Flaherty. There is a demo available, to check out the features of the free application. InetSoft has been working since 1996 to bring users intuitive solutions to business problems. This free version is specifically targeted at smaller businesses who might be unable to afford the full application.
Chelsea Kerwin, June 10, 2015
Sponsored by ArnoldIT.com, publisher of the CyberOSINT monograph