May 25, 2016
I lost track of MarkLogic when the company hit about $51 million in revenue and changed CEOs in 2006. In 2012, another CEO changed took place Since Gary Bloom, a former Oracle executive took over, the company, according to “Gary Bloom Interview: Big Data Driving Sales Boom at MarkLogic,” the company is now “topping” $100 million in annual revenue.
MarkLogic is one of the outfits laboring in the DCGX / DI2E vineyard. The company may be butting heads with outfits like Palantir Technologies as the US Army’s plan to federate its systems and data move forward.
MarkLogic opened for business in 2003 and has ingested, according to Crunchbase, $175 million in venture funding. With a timeline equivalent to Palantir Technologies’, there may be some value in comparing these two “startups” and their performance. That is an exercise better left to the feisty young MBAs who have to produce a return for the Sequoia and Wellington experts.
The interview contained two interesting statements which I found surprising:
The driver is Big Data: large corporations are convinced there is an El Dorado of untapped commercial opportunities — if only they can run their reports across all their data sources. But integrating all that data is too costly, and takes too long with relational databases. The future will be full of data in many forms, formats, and sources and how that data is used will be the differentiator in many competitive battles. If that data can’t be searched it can’t be used.
That is indeed the belief and the challenge. Based on what I have learned via open sources about the DCGS project, the reality is different from the “all” notions which fill the heads of some of the vendors delivering a comprehensive intelligence system to US government clients. In fact, the reality today seems to me to be similar to the hope for the Convera system when it was doing the “all” approach to some US government information. That, as you may recall, did not work out as some had hoped.
The second statement I highlighted is:
Although MarkLogic is tiny compared to Oracle there are some interesting parallels. “MarkLogic is at about the same size as Oracle was when I began working there. It took a long time for Oracle to get security and other enterprise features right, but when it did, that was when company really took off.”
The stakeholders hope that MarkLogic does “take off.” With more than 12 years of performance history under its belt, MarkLogic could be the next big thing. The only hitch in the git along is that normalization of information and data have to take place. Then there is the challenge of the query language. One cannot overlook the competitors which continue to bedevil those in the data management game.
With Oracle also involved in some US government work, there might be a bit of push back as the future of MarkLogic rolls forward. What happens if IBM’s data management systems group decide to acquire MarkLogic? Excitement? Perhaps.
Stephen E Arnold, May 25, 2016
July 30, 2015
JSON-LD is designed around the concept of a “context” to provide additional mappings from JSON to an RDF model.
Yes, the much loved RDF model.
When I read “JSON-LD and Why I Hate the Semantic Web,” I noticed a bit of friskiness in the word choice; for example, misguided souls, cryptic, complicated, market share, “kick RDF in the nuts,” and similar rhetorical arabesques. I do like the active verb “kick” however.
The passage I highlighted with my bright orange marker was this one:
The problem with getting a room full of smart people together is that the group’s world view gets skewed. There are many reasons that a working group filled with experts don’t consistently produce great results. For example, many of the participants can be humble about their knowledge so they tend to think that a good chunk of the people that will be using their technology will be just as enlightened. Bad feature ideas can be argued for months and rationalized because smart people, lacking any sort of compelling real world data, are great at debating and rationalizing bad decisions.
Seems normal to me.
In my opinion, this write up explains why some XML centric, Semantic Web cheerleaders have labored to generate organic growth. Just a thought. Talking to fellow travelers is reassuring and comfortable. Those not on the cruise ship may have a different point of view.
Stephen E Arnold, July 30, 2015
October 10, 2014
I recall learning a couple of years ago that Amazon was a great place to store big files. Some of the XML data management systems embraced the low prices and pushed forward with cloud versions of their services.
When I read “Amazon’s DynamoDB Gets Hugely Expanded Free Tier And Native JSON Support,” I formed some preliminary thoughts. The trigger was this passage in the write up:
Is JSON better than XML? Is JSON easier to use than XML? Is JSON development faster than XML? Ask an XML rock star and the answer is probably, “You crazy.” I can hear the guitar riff from Joe Walsh now.
Ask a 20 year old in a university programming class, and the answer may be different. I asked the 20 something sitting in my office about XML and he snorted: “Old school, dude.” I hire only people with respect for their elders, of course.
Here are the thoughts that flashed through my 70 year old brain:
- Is Amazon getting ready to make a push for the customers of Oracle, MarkLogic, and other “real” database systems capable of handling XML?
- Will Amazon just slash prices, take the business, and make the 20 year old in my office a customer for life just because Amazon is “new school”?
- Will Amazon’s developer love provide the JSON fan with development tools, dashboards, features, and functions that push clunky methods like proprietary Xquery messages into a reliquary?
No answers… yet.
Stephen E Arnold, October 10, 2014
December 2, 2013
On Saturday, November 30, 2013, The New York Times published “Health Care Site Rushing to Make Fixes by Sunday.” As I now know, mission accomplished. But there was no aircraft carrier, brass band, or flag. (Here’s the link to the online story, but like so many “real” journalistic efforts, the link can go dead and you will have to hunt for a November 30, 2013 Times and look on pages A 1 with a jump to page A 12. Penguin, there is nothing I care to do about the link. Sorry.)
I wanted to document this passage from the Times’ story about MarkLogic. What’s interesting is that the company gets little attention from other “real” journalists. I suppose if I were curious, I would attempt to answer the question, “Why?”
I am not curious. Here’s what snagged my attention on the 30th:
Gary C. Boom, the chief executive officer of another vendor, MarkLogic, said his firm is also moving its software to differently configured servers.
The idea is from MarkLogic’s neighbor in Silicon Valley, Oracle. A few years ago, Oracle wrote a white paper banging on MarkLogic’s technology. You can find a copy of that analysis in “Mark Logic XML Server 4.1.” I wrote about the tempest in “A Coming Dust Up between Oracle and MarkLogic?”
The Times’ story continued:
MarkLogic provided the technology for the database that serves as the system’s internal filing cabinet and index.
The story does not make clear whether MarkLogic is an XML server that acts like a junction box among the moving parts of the HealthCare.gov site, a data management system interacting with Oracle’s technology, or a search engine for the Web site. MarkLogic positions its technology as doing each of these functions plus analytics, business intelligence, customer relationship management, publishing, and probably some other functions as well.
the Times quotes Mr. Bloom as having said:
I am picking up my house and moving it to a better foundation next door,” he [Mr. Bloom] said in an interview. He said MarkLogic is performing up to standard, but “the network and the storage systems are not properly sized and not properly run.”
It is not clear to me which vendor is providing the storage systems. Is it MarkLogic or is it another vendor such as Oracle, a company apparently unimpressed with some of MarkLogic’s technology if I understand the Oracle white paper.
The Times added:
“Another critical problem involved the specifications for a major computer switch that connects the computer services through a security firewall to the Internet. Mr. Bloom said it has been upgraded from four gigabytes a second to 60 [gigabytes a second]. He said the earlier speed was the equivalent of employing four security staffers to screen Heathrow Airport’s passengers. “The line to get through,” he said, “would go back to the city of London.”
I am not sure how these issues did not become known to the vendors pushing data through the system, but apparently, the 15X shortfall was not noticed. I wonder how many home builders move a completed house to a new foundation. Also, what if the security folks at Heathrow are more or maybe less efficient than those located where HealthCare.gov is?
I will keep my eye on this issue because MarkLogic has been emphasizing that it offers a search system. Where there is a search vendor, there seems to be some activity of interest. And where there are MarkLogic and Oracle, there may be some interesting discussion between the parties.
Stephen E Arnold, December 2, 2013
November 23, 2013
I read “Tension and Flaws Before Health Website Crash.” The good news is that the story focuses on what is now old news: Management challenges at the agency responsible for Healthcare.gov. The bad news—at least for champions of XML repositories, XML normalization, and XML as the “answer” to a wide range of information management woes—is that XML (extensible markup language) is not the slam dunk, whiz bang solution some true believers hope.
Here’s the passage that caught my attention:
Another sore point was the Medicare agency’s decision to use database software, from a company called MarkLogic, that managed the data differently from systems by companies like IBM, Microsoft and Oracle. CGI officials argued that it would slow work because it was too unfamiliar. Government officials disagreed, and its configuration remains a serious problem.
MarkLogic has not been identified as a vendor creating some headaches until now. MarkLogic has a system that can store information and data in an XML data management system. The trick is that content not in XML must be normalized; that is, converted to XML. MarkLogic has developed some proprietary methods to perform its data management operations. A person familiar with XML may not be conversant with the MarkLogic conventions. The upside of this approach is that MarkLogic has experts who are able to address most customer requests. The downside is that a person familiar with XML but not MarkLogic can introduce some problems into an otherwise spiffy system.
In the last few years, MarkLogic has had a number of senior management changes. I track the company via my Overflight system and have noted that the firm has gone from a company that does a good job of publicizing itself to an outfit that has trimmed back on its public presence. You can check out the MarkLogic Overflight on the ArnoldIT.com Web site. The minimal news flow, the absence of tweets, and the termination of public blog content can be verified by visiting the paste every few days.
One interesting aspect of MarkLogic is that the company has positioned itself as a publishing platform. Once content is in the repository, it is possible to slice and dice information and data. Publishers can use this feature to whip out books with little or no involvement of human editors. But the company has, like Verity, grafted on other features and services. These range from enterprise search to text mining to electronic mail management.
I heard that the company was to have been a $200 or $300 million dollar a year operation a few years ago. The firm may be the best kept secret in terms of its revenues and profits. If so, kudos. But if the company has not been able to demonstrate strong growth and healthy net profits, the firm may need to ramp up its publicity and marketing activities.
The New York Times’s comment may be hogwash. Even if a stretch, getting a paragraph that strikes me as less than favorable raises some questions; for example:
- Are proprietary extensions a good idea for an XML system that must be used by folks who are not into XML?
- Will the transformations between and among content from disparate systems remain bottlenecks during periods of high content flow and usage?
- Will Oracle seize on the MarkLogic system and revive its flow of information about the weaknesses of XML as compared with content stored in an Oracle data management system?
MarkLogic has rolled through three of four presidents in the last few years. Dave Kellogg departed, and I mostly lost track of who followed him. At the time of his departure MarkLogic was in the $60 million estimated revenues. Will the management turmoil kick in again? Will the company continue to expand its features and functions as Verity did prior to its initial public offering? Are there parallels between the trajectories of Convera, Delphes, Entopia, and Verity and MarkLogic. For some case analyses, check out www.xenky.com/vendor-profiles.
Stephen E Arnold, November 23, 2013
November 6, 2013
If you need a search system and love Java, you will want to read the most recent Xenky Vendor Profile. Dieselpoint is based in Chicago, Illinois. Compared to some search vendors, Dieselpoint keeps a low profile. The profile is available without charge at Xenky’s Vendor Profile page. Be sure to read the caveats for these free profiles. If you want to make a comment or explain a point I missed by a mile, use the comments section of Beyond Search. The profiles are drafts and will not be updated.
Stephen E Arnold, November 6, 2013
July 31, 2013
For all you XML lovers out there, particularly those with dual-core machines, RaptorXML is here. Market Wired hosts, “Altova Announces General Availability of RaptorXML.” The product is part of Altova’s suite of server products. The press release informs us:
“Altova RaptorXML is a high-performance XML and XBRL server optimized for today’s multi-CPU, multi-core computers and servers. Developers creating solutions using Altova MissionKit XML development and XBRL development tools will be able to power server applications with RaptorXML for hyper-performance, increased throughput, and efficient memory utilization to validate and process large amounts of XML or XBRL data cost-effectively. . . .
“RaptorXML conforms to the latest versions of all relevant XML and XBRL standards and has been submitted to rigorous regression and conformance testing. The server is available in three versions.”
These versions include Raptor XML Server, Raptor XML+XBRL Server, and RaptorXML Development Edition. The last of these facilitates applications testing by developers working in Altova’s XMLSpy, MapForce, and StyleVision. The products are available for use on Windows, 32-bit or 64-bit, and for the 64-bit MacOS. Pricing is on an annual licensing basis, determined by the number of CPU cores in a prospective customer’s server. A few features include a low memory footprint, cross-platform capabilities, and beefed-up error reporting. See the article above (and/or this one) for more details.
The developer-centered Altova focuses on data management, software development, and data integration. The company boasts that 91% of Fortune 500 companies use their products, but emphasizes that small and medium businesses are also valuable clients. Altova splits its headquarters between Beverly, Massachusetts and Vienna, Austria.
Cynthia Murrell, July 31, 2013
April 12, 2013
My in box overfloweth. Temis has rolled out a number of announcements in the last 10 days. The company is one of the many firms offering “semantic” technology. Due to the vagaries of language, Temis is in the “content enrichment” business. The idea is that technology indexes key words and concepts even though a concept may not be expressed in a text document. I call this indexing, but “enrichment” is certainly okay.
The first announcement which caught my attention was a news release I saw on the Marketwatch for fee distribution service. The title of the article was “TEMIS Completes Successful Wide Scale Semantic Content Enrichment Test in Windows Azure.” A news release about a test struck me as unusual. The key point for me was that Temis is positioning itself to go after the SharePoint add in market.
The second announcement was a news story distributed by Eureka Alert called “Wiley Selects Temis for Semantic Big Data Initiative The key point is that a traditional publishing company has licensed software to do what humans used to do in a venerable publishing company which, until recently, was sticking with traditional methods and products. Will Temis propel John Wiley to the top of the leader board of professional publishers? Hopefully some information will become available quickly.
The third announcement which I noted was “Temis and MarkLogic Strengthen Strategic Alliance.” The write up hits the concepts of semantics and big data. Here’s the passage which intrigued me:
MarkLogic® Server is the only enterprise NoSQL database designed for building reliable, scalable and secure search, analytics and information applications quickly and easily. The platform includes tools for fast application development, powerful analytics and visualization widgets for greater insight, and the ability to create user-defined functions for fast and flexible analysis of huge volumes of data.
I am uncomfortable with the notion of “only”. MarkLogic is an XML centric data management system. Software wrappers can use the XML back end for a range of applications. These include something as exotic as a Web site for the US Army to more sophisticated applications for publishing technical documents for an aircraft manufacturing firm. However, there are a number of ways to accomplish these tasks and some of the options make use of somewhat similar technology; for example, eXist-db. While not perfect, the fact that an alternative exists only increases my discomfort with an “only”.
So what’s up? My hunch is that both MarkLogic and Temis are in flat out marketing mode. Clusters of announcements are, in my experience, an indication that the pipeline needs to be filled. Equally surprising is that MarkLogic into a big data player and an enterprise search system, not a publishing system. Most vendors are morphing. The tie up with Temis suggests that Temis’ back end needs some beefing up. The MarkLogic positioning is that it is now a player in semantics and big data. I think that partnering is a quick way to fill gaps.
Will MarkLogic blast through the $100 million in revenue ceiling? Will Temis emerge as a giant slayer in semantic big data? The company recently raised $25 million to become a player in big data. (See “Big Data Boon: MarkLogic Pulls In $25 Million In VC Funding”.) Converting $25 million into high margin revenue could tax the likes of Jack Welch in his prime.
My hunch is that both firms’ management teams have this as a 2013 goal. With the patience of investors wearing thin for many search and content processing vendors, closed deals are a must. The economy may be improving for analysts on CNBC, but for search vendors, making Autonomy-scale or Endeca-scale revenues may be difficult, if not impossible.
In my opinion, the labels “big data” and semantics do not by themselves deliver revenue the way Google delivers Adwords. As more search firms chase additional funding, has the world of search switched from finding information for customers to getting money to stay in business?
No timidity visible as these two firms race down the semantic interstate.
Stephen E Arnold, April 12, 2013
April 8, 2013
The Altova Blog piece “Editing, Converting and Generating JSON” provides a helpful guide to using JSON. The use of JSON as a data transport protocol has been on the rise and so has the debate about the advantages of JSON vs. XML. The debate has been waging on but the author actually sums it up fairly well.
“But when you boil it down, there are simply some cases for which JSON is the best choice, and others where XML makes more sense. While you might need to choose between JSON and XML depending on the development task at hand, you don’t have to choose between code editors – XMLSpy supports both technologies and will even convert between the two.”
Altova has extended its intelligent XML editing features to JSON editor in order to make JSON editing as simple as possible. Users who begin editing JSON in text view will get lots of help along the way from XMLSpy thanks in the form of syntax coloring, bracket matching, source folding, entry helper windows, menus and other helpful tools. A one click option on the XMLSpy convert menu makes converting XML to or from JSON quick and easy. The ability to edit but also convert items directly within the XML editor program is extremely useful. JSON lovers will definitely have something to look forward to.
April Holmes, April 08, 2013
February 26, 2013
Most people never really think about how news organizations transmit data across continents when there is a big event. For the Summer Olympics in 2012 The Press Association relied on MarkLogic’s XML repository’s ability to store and query hundreds of thousands of pieces of metadata per second.
In “How PA Cleared The Big Data Hurdle At The London Olympics” the Press Associations director of technical architecture, John O’Donovan, gives consumers an in depth look at how the office was able to cope with more than 50,000 requests per second.
“The problem with that is having to sit down and design a relational database model that can represent everything that’s in the XML. That takes quite a lot of time, you have to build all of your input/output extenders and map XML objects into relational stores.”
At first look it seems like an impossible task, organizing all of the photos, biographical information, statistics, and competition results for thousands of athletes and beaming it to televisions, phones and computers everywhere, but, by removing the relational database the PA made it possible.XML store instead of storing it in the relational database and then retransferring the data back to XML.
It simplified the delivery system from 100 to 34 man hour days to get off the ground and was so successful that The Press Association will be utilizing the new system for all of its wire and output communications.
Big thumbs ups to MarkLogic’s ability to handle the process and to the PA for finding a new way to utilize an already reliable resource.
Leslie Radcliff, February 26, 2013