MarkLogic and the New York Times
December 2, 2013
On Saturday, November 30, 2013, The New York Times published “Health Care Site Rushing to Make Fixes by Sunday.” As I now know, mission accomplished. But there was no aircraft carrier, brass band, or flag. (Here’s the link to the online story, but like so many “real” journalistic efforts, the link can go dead and you will have to hunt for a November 30, 2013 Times and look on pages A 1 with a jump to page A 12. Penguin, there is nothing I care to do about the link. Sorry.)
I wanted to document this passage from the Times’ story about MarkLogic. What’s interesting is that the company gets little attention from other “real” journalists. I suppose if I were curious, I would attempt to answer the question, “Why?”
I am not curious. Here’s what snagged my attention on the 30th:
Gary C. Boom, the chief executive officer of another vendor, MarkLogic, said his firm is also moving its software to differently configured servers.
The idea is from MarkLogic’s neighbor in Silicon Valley, Oracle. A few years ago, Oracle wrote a white paper banging on MarkLogic’s technology. You can find a copy of that analysis in “Mark Logic XML Server 4.1.” I wrote about the tempest in “A Coming Dust Up between Oracle and MarkLogic?”
The Times’ story continued:
MarkLogic provided the technology for the database that serves as the system’s internal filing cabinet and index.
The story does not make clear whether MarkLogic is an XML server that acts like a junction box among the moving parts of the HealthCare.gov site, a data management system interacting with Oracle’s technology, or a search engine for the Web site. MarkLogic positions its technology as doing each of these functions plus analytics, business intelligence, customer relationship management, publishing, and probably some other functions as well.
the Times quotes Mr. Bloom as having said:
I am picking up my house and moving it to a better foundation next door,” he [Mr. Bloom] said in an interview. He said MarkLogic is performing up to standard, but “the network and the storage systems are not properly sized and not properly run.”
It is not clear to me which vendor is providing the storage systems. Is it MarkLogic or is it another vendor such as Oracle, a company apparently unimpressed with some of MarkLogic’s technology if I understand the Oracle white paper.
The Times added:
“Another critical problem involved the specifications for a major computer switch that connects the computer services through a security firewall to the Internet. Mr. Bloom said it has been upgraded from four gigabytes a second to 60 [gigabytes a second]. He said the earlier speed was the equivalent of employing four security staffers to screen Heathrow Airport’s passengers. “The line to get through,” he said, “would go back to the city of London.”
I am not sure how these issues did not become known to the vendors pushing data through the system, but apparently, the 15X shortfall was not noticed. I wonder how many home builders move a completed house to a new foundation. Also, what if the security folks at Heathrow are more or maybe less efficient than those located where HealthCare.gov is?
I will keep my eye on this issue because MarkLogic has been emphasizing that it offers a search system. Where there is a search vendor, there seems to be some activity of interest. And where there are MarkLogic and Oracle, there may be some interesting discussion between the parties.
Stephen E Arnold, December 2, 2013
IT Policies Encompass Ergonomics Along with Help Desks and Google Glass
November 28, 2013
The collection of policies on ZDNet is titled 90+ IT Policies At Your Fingertips, Ready For Download. With a subscription the article offers templates for many of the policies necessary to govern IT and other workplace necessities. Of course, the pitch is a bit of a circular argument (instead of paying someone hundreds of dollars for a policy, pay us hundreds of dollars for many policies) but more important is the implied state of governance.
The article explains in the blurb provided for the IT Consultant Conduct Policy:
“By the very nature of their business, IT consultants–who have both access and exposure to a company’s most sensitive data–must be held to the highest ethical standard. Ethics are critical, not only to the consultant’s company, but to the client organization. In addition to ethical behavior, a consultant must maintain appropriate behavior at all times. This IT Consultant Conduct Policy outlines an example code of conduct and a code of conduct for consultants.”
Looking ahead, there is even a policy available for the day when employees start bringing Google Glass to work. This is a technology that provides for sneaky recording of audio and visual, (or perhaps not so sneaky, more of a hiding in plain sight maneuver) and so employers might do well to think ahead. Not included in the long list is any sort of editorial policy, which we thought might be making a comeback. Maybe not.
Chelsea Kerwin, November 28, 2013
Sponsored by ArnoldIT.com, developer of Augmentext
A New Approach to Enterprise Search – Information Governance
September 2, 2013
The article titled Solving the Inadequacies and Failures in Enterprise Search on AIIM addresses the ongoing problems with metadata infrastructure. This problem translates into a major hurdle for all those organizations hoping to utilize enterprise search. According to the author, with an appropriately stringent framework in place, progress is possible. This includes the ability to use search in order to implement actual helpful tools like records management, migration and data privacy. The article states,
“An information governance approach that creates the metadata infrastructure framework to encompass automated intelligent metadata generation, auto-classification, and the use of goal and mission aligned taxonomies is required. From this framework, intelligent metadata enabled solutions can be rapidly developed and implemented. Only then can organizations leverage their knowledge assets to support search, litigation, eDiscovery, text mining, sentiment analysis, and business intelligence. The need for organizations to access and fully exploit the use of their unstructured content won’t happen overnight. “
With these improvements in place, the article suggests that enterprise search will be fixed. Decades of floundering will come to an end if this assertion is proven correct. Features and “bells and whistles” will no longer reside in the place of actual information and wisdom gleaned from content, what search is actually supposed to provide.
Chelsea Kerwin, September 02, 2013
Sponsored by ArnoldIT.com, developer of Augmentext
Keep It Separated
November 14, 2012
Respect data decentralization. That is the key to “The Challenge of Defensible Deletion of Distributed Legacy Data,” according to the eDiscovery Law & Tech Blog at X1 Discovery.
Blogger John Patzakis submits that, for large enterprises with data subject to governance requirements, centralization can make it hard to create a defensible retention schedule. Most archiving systems require that bits of data be pulled from their cozy homes on group and departmental silos and dumped into a central system before any retention and management process can even begin. He writes:
“Forcing centralization on these many pockets of productivity is highly disruptive and rarely effective due to scalability, network bandwidth and other logistical challenges. So what this leaves is the reality that for any information remediation process to be effective, it must be executed within these departmentalized information silos.”
Not surprisingly, Patzakis recommends one of X1’s own products, X1 Rapid Discovery, to do just that. The company has produced an hour-long webinar outlining their method. According to the article:
“X1 Rapid Discovery represents game-changing technology to effectuate the remediation of distributed legacy data due to its ability to install on demand virtually anywhere in the enterprise, including remote data silos, its light footprint web browser access, and intuitive interface. X1 Rapid Discovery enables for effective assessment, reporting, categorization and migration and remediation of distributed information assets by accessing, searching and managing the subject data in place without the need for migration to the appliance or a central repository.”
Sounds good. It may well be that X1 Rapid Discovery is the best solution for this process, or maybe not. Either way, the webinar could be worth a gander.X1 Discovery makes e-discovery tools and enterprise search solutions for IT and legal professionals. Founded by Idealab, the company is located in Pasadena, CA.
Cynthia Murrell, November 14, 2012
Sponsored by ArnoldIT.com, developer of Augmentext
The Huber Affair: Demining Now Underway
June 26, 2012
Google is working overtime to keep attention focused on technical issues. You can wallow in the smart software encomium in the New York Times. (See “How Many Computers to Identify a Cat? 16,000” in the June 27, 2012, environmentally unfriendly newspaper or you can give the newspaper’s maybe here, maybe gone link at http://goo.gl/Twl9I.) The Google I/O Conference fast approaches, so there are the concomitant write ups about a Google hardware and news in “Google’s I/O Conference: New Operating System, Tablet”.
But there are two personnel stories which seem to haunt the company at what is the apex of the Google techno-promo machine: Larry Page’s minor voice problem and a person few people outside of Google have heard about. Both of these are potential “information minefields.” Google does not, as far as I know, have an effective demining system in place.
I have avoided commenting directly on the health thing. You can get the story or what passes for a story in “Google CEO Larry Page and the Healthy Way to Answer, ‘What’s Wrong?’” But I do have an opinion about the wizard responsible for Local Search, Maps, Earth, Travel, Payments, Wallet, Offers, and Shopping. I read more about about one Google executive than I expected in “This Exec May Have The Hardest Job At Google, And His Colleagues Are Tired Of Seeing Him Get Trashed In The Press.”
The basic idea, as I understand it, is:
Last week, we [Business Insider] published a story headlined: “Depending On Whom You Ask, This Google Exec Is Either ‘Weak’ Or He Just Drew The Short Straw?
The publication did some digging and learned from “senior sources”:
Their view is that Huber is a top-notch Google executive who asked for the hardest challenge his boss could give him and he got it – in the form of nascent, unproven products and an executive reporting to him that ended up being vastly under-qualified for her job.
The weak link in the Google brain mesh was a person from PayPal. Yikes. A female goofed with some PayPal type projects. The story wraps up:
Yahoo and Governance: Semel to Scott Thompson
May 4, 2012
Short honk: I don’t much, if any, attention to Yahoo. My last big analysis of Yahoo was shortly after its then Chief Technology Officer tried to explain to a financial services client of mine that Yahoo was ahead of Google in search. Crazy assertion from a crazy outfit. In my report, I included an image of Terry Semel as the captain of the Titanic. Got a laugh. Yahoo got zero positives from me. (By the way is that Wikipedia profile of Mr. Semel accurate? Check it out between conference calls and SMS texting.)
Navigate to “Scott Thompson Resume Scandal Is Not an Inadvertent Mistake—He Also Claimed Comp Sci Degree as CTO of PayPay.” I want to comment on the spelling of résumé but who cares? That’s my attitude to the coverage of an executive fudging a biography. Furthermore, in my analysis Yahoo is the type of outfit which lives in a world of illusion, silos, and confusion.
The fact that a senior executive would take the time to do a little digging is absolutely no surprise to me. I hear the phrase “I’m too busy” from people whom I know are not too busy. Some of these people ask me for work and then tell me, “We have a spring vacation.” I heard this phrase from a company president who is guiding a company which is losing millions of dollars each quarter. Right. Vacation. Spring break.
I think we have plenty of solid evidence of a core governance problem at Yahoo, but the same issue exists in many US organizations. Whether it is the confusion about the actions of US government employees or the unfortunate Google Street View incident, governance is not a core competency in many US organizations. Enron, Lehman Brothers, Tyco—remember these executive edifices?
Furthermore I don’t think governance can be fixed quickly, if at all.
When an individual professional does not do the basics like checking key facts, the egregious mistakes will continue and most likely increase.
Governance problems are not black swans.
Governance problems are a direct outcome of people who do not focus, gather information, analyze, and reflect.
Rushing to meetings, asking for others to collect information, and staring at mobile devices—these are flashing signals of trouble at Yahoo and elsewhere.
Fiddling with a biography is either effective public relations, impactful marketing, or the shortest distance between a person and the top of Maslow’s hierarchy. For me, Yahoo and fake credentials are no big deal.
Baloney is the business of many businesses.
Desperation marketing is the new normal marketing.
Stephen E Arnold, May 4, 2012
Sponsored by no one but me.
Google Search Saves the Day
March 1, 2012
According to the Nextgov article “Solyndra Investigation Led to New Search Tool at Energy” Google Search has joined the fight against crime. A “congressional investigation into alleged mishandling of a $528 million Energy Department loan guarantee” to Solyndra caused a data overload for Energy’s Loan Department. Energy workers were trying to provide congressional investigators with the information they needed but their current document search tools simply created even bigger problems. According to Energy Chief Technology Officer Peter Tseronis the searches “resulted in “GS-15s standing at printers hitting print, print, print, copy, copy, copy for emails, attachments, PDFs — information that was just voluminous.” In response to this mounting problem the CTO office joined forces with an outside vendor to help modify their existing Google search system. Users throughout the department then had the ability to index as well as sort through various emails, Word documents etc. Looks like Google Search was ready and saved the day.
April Holmes, March 1, 2012
Sponsored by Pandia.com
Autonomy Serves US Government with Governance
February 2, 2012
It seems our nation is finally getting its records in digital order. The Sacramento Bee reports, “Autonomy Empowers U.S. to Meet President Obama’s New Memorandum on Government Records.” According to the Memorandum, government agencies must standardize their content policies and transfer relevant files to the U.S. National Archives and Records Administration. The press release presents Autonomy as the tool for the job:
Autonomy delivers a comprehensive suite of information governance solutions that addresses the broad and varying needs of enterprises and government agencies. With support for over 150 languages and access to over 1,000 file types through 400 pre-built connectors to disparate content sources, repositories and legacy systems, the Autonomy solutions can apply policy consistently to every information source in the enterprise simultaneously, while managing content in place and reducing duplicates across all enterprise repositories.
Autonomy, owned by HP, is a leader in the field of unstructured data management and serves prominent public and private organizations around the globe. The company was founded in 1996, and has made its fortune on the fruit of research originally performed at Cambridge University.
Cynthia Murrell, February 2, 2012
Sponsored by Pandia.com
United Nations and Its Tech Challenges
December 20, 2011
From the “Why Am I Not Surprised” Department. News Flash.
UN Computer System Failure
A flub at the United Nations— an estimated nearly $400 million flub– has been made public as UN officials are scrambling to get the botched project back on track. Perhaps “flub” is too strong? Maybe in UN speak, the error was an administrative concern. Yes, that’s it. Administrative concern.
The United Nations’ project, known as Umoja, is a computer and software system that promised to reform the organization but has been at a standstill since June. Umoja, which was intended to be an administrative system to cut down on waste and fraud, was led by Secretary General Ban Ki-moon. Fox News’ article, “UN’s Botched Computer-System Overhaul: A Major ‘Failure’ of Ban Ki-Moon’s Management” tells us more:
Ban’s officials are scrambling to get the jinxed project known as Umoja (Swahili for unity) back on track after a key UN budget committee heard from Ban’s office last week that the sweeping information technology overhaul, already a year behind schedule, won’t be finished until 2015, three years beyond the original target date. The committee also said it was “deeply disturbed and dismayed” by the UN’s “apparent lack of awareness and foreknowledge” about the sputtering status of the project.”
This is entropy from top to bottom. Is this the UN’s approach to information management? It appears that guessing about technology may not work and the organization should probably make more solidified plans before pushing such a large and costly project forward. From peacekeeping to computing, the UN is rowing against the current of competence in my opinion.
Andrea Hayden, December 20, 2011
Sponsored by Pandia.com
Business Process Is Not Information Management
November 15, 2011
We continue to run across some interesting stories about Enterprise Data. This one from Catherine Lamsfuss caused quite a debate at lunch. Here’s what we read:
As the amount of data within a business or industry grows the question of what to do with it arises. The article, “Business Process Management and Mastering Data in the Enterprise“, on Capgemini’s website explains how Business Process Management (BPM) is not the ideal means for managing data.
According the article as more and more operations are used to store data the process of synchronizing the data becomes increasingly difficult.
As for using BPM to do the job, the article explains,
While BPM tools have the infrastructure to do hold a data model and integrate to multiple core systems, the process of mastering the data can become complex and, as the program expands across ever more systems, the challenges can become unmanageable. In my view, BPMS solutions with a few exceptions are not the right place to be managing core data[i]. At the enterprise level MDM solutions are for more elegant solutions designed specifically for this purpose.
The answer to this ever-growing problem was happened upon by combining knowledge from both a data perspective and a process perspective. The article suggests that a Target Operating Model (TOM) would act as a rudder for the projects aimed at synchronizing data. After that was in place a common information model be created with enterprise definitions of the data entities which then would be populated by general attributes fed by a single process project.
While this is just one man’s answer to the problem of data, it is a start. Regardless of how businesses approach the problem it remains constant–process management alone is not efficient enough to meet the demands of data management.
“It’s not the process its the people that implement and use the process that matter” stated Jasmine Ashton in a final summary of the lunch debate. We had to agree. However, as we looked through the Polyspot data management description that Ms. Lamsfuss’ article pointed us to we had to agree that starting with a good technology implementation could go a long way towards helping the people follow the processes.
Constance Ard November 15, 2011