Reports Maestro

October 1, 2010

In the last few days, I have had several conversations about converting outputs of search systems into reports. There are high-end systems like Megaputer’s and then carrier-class systems from outfits like IBM SPSS and SAS. As we talked with people about tools, we learned about a Web report and chart builder that is quite interesting.

image

Display of chart types available.

The write up about the program in LearnXpress said:

With ReportsMaestro, no coding is required to convert those huge amounts of data into the interactive, professional-looking charts. The smart solution provides Web Charts Builder with 15+ chart types and styles, from popular pie and bar charts to sophisticated 3D diagrams. It also offers Query Builder to enhance the look of your database joining tables, grouping, sorting and filtering data.

You can work through an online demo and then, if it seems useful, buy it online. What’s appealing about the product is that it costs about $80. We use a tool from Swiff Chart in France. For those confused by Google Fusion, ReportsMaestro may be worth a look.

Stephen E Arnold, October 1, 2010

Freebie

Graphic of Online Communities Reveals Big Social Media Shifts

August 18, 2010

Since the introduction of the first online social media sites the industry has grown immensely. XKCD using a map illustrated the relative size of the various online communities in 2007 but just a mere three years later Flowtown released an updated version of the map as shown in “Map of Online Communities Reveals Staggering Social Media Shifts (Pictures).” According to the 2007 map MySpace was the dominant social media site with Facebook pictured as one of the smaller ones. Flowtown created a network map “that reflects current trends in 2010.” The results were substantially different and show Facebook, which boasts about 500 million members, as the dominant social site. The old giant MySpace experienced a decrease in membership as well as social media power. With so many options and members it can make searching through the sites an endless task. In just a few years the social media world has done a 360.

April Holmes, August 18, 2010

Business Intelligence: Optimism and Palantir

June 28, 2010

Business intelligence is in the news. Memex, the low profile UK outfit, sold to SAS. Kroll, another low profile operation, became part of Altegrity, anther organization with modest visibility among the vast sea of online experts. Now Palantir snags $90 million, which I learned in “Palantir: the Next Billion Dollar Company Raises $90 Million.” In the post financial meltdown world, there is a lot of money looking for a place that can grow more money. The information systems developed for serious intelligence analysis seem to be a better bet than funding another Web search company.

Palantir has some ardent fans in the US defense and intelligence communities. I like the system as well. What is fascinating to me is that smart money believes that there is gold in them there analytics and visualizations. I don’t doubt for a New York minute that some large commercial organizations can do a better job of figuring out the nuances in their petabytes of data with Palantir-type tools. But Palantir is not exactly Word or Excel.

The system requires an understanding of such nettlesome points as source data, analytic methods, and – yikes – programmatic thinking. The outputs from Palantir are almost good enough for General Stanley McChrystal to get another job. I have seen snippets of some really stunning presentations featuring Palantir outputs. You can see some examples at the Palantir Web site or take a gander (no pun intended by the addled goose) at the image below:

image

Palantir is an open platform; that is, a licensee with some hefty coinage in their knapsack can use Palantir to tackle the messy problem of data transformation and federation. The approach features dynamic ontologies, which means that humans don’t have to do as much heavy lifting as required by some of the other vendors’ systems. A licensee will want to have a tame rocket scientist around to deal with the internals of pXML, the XML variant used to make Palantir walk and talk.

You can poke around at these links which may go dark in a nonce, of course: https://devzone.palantirtech.com/ and https://www.palantirtech.com/.

Several observations:

  • The system is expensive and requires headcount to operate in a way that will deliver satisfactory results under real world conditions
  • Extensibility is excellent, but this work is not for a desk jockey no matter how confident that person in his undergraduate history degree and Harvard MBA
  • The approach is industrial strength which means that appropriate resources must be available to deal with data acquisition, system tuning, and programming the nifty little extras that are required to make next generation business intelligence systems smarter than a grizzled sergeant with a purple heart.

Can Palantir become a billion dollar outfit? Well, there is always the opportunity to pump in money, increase the marketing, and sell the company to a larger organization with Stone Age business intelligence systems. If Oracle wanted to get serious about XML, Palantir might be worth a look. I can name some other candidates for making the investors day, but I will leave those to your imagination. Will you run your business on a Palantir system in the next month or two? Probably not.

Stephen E Arnold, June 27, 2010

Freebie

Oracle Text Visualization

March 30, 2010

Short honk: A happy quack to the reader and obvious Oracle aficionado for this tip. You can download the Java library for Oracle Text visualization directly from Oracle and at no charge. The information on the Oracle Web site is sparse:

This Java library for Oracle Text visualization is incorporated in the Oracle Text sample code application to visualize clusters, categories, and themes.

Act now.

Stephen E Arnold, March 30, 2010

No one paid me to pass along this link.

Exclusive Interview: Digital Reasoning

February 2, 2010

Tim Estes, the youthful founder and chief technologist, for Digital Reasoning, a search and content processing company based in Tennessee, reveals the technology the is driving the company’s growth. Mr. Estes, a graduate of the University of Virginia, tackled the problem of information overload with a fresh approach. You can learn about Digital Reasoning’s approach that delivers a system that “deeply, conceptually searches within unstructured data, analyzes it and presents dynamic visual results with minimal human intervention. It reads everything, forgets nothing and gets smarter as you use it.”

Mr. Estes explained:

Digital Reasoning’s core product offering is called “Synthesys.” It is designed to take an enterprise from disparate data silos (both structured and unstructured), ingest and understand the data at an entity level (down to the “who, what, and wheres” that are mentioned inside of documents), make it searchable, linkable, and provide back key statistics (BI type functionality). It can work in an online/real-time type fashion given its performance capabilities. Synthesys is unique because it does a really good job at entity resolution directly from unstructured data. Having the name “Umar Farouk Abdul Mutallab” misspelled somewhere in the data is not a big deal for us – because we create concepts based on the patterns of usage in the data and that’s pretty hard to hide. It is necessarily true that a word grounds its meaning to the things in the data that are of the same pattern of usage. If it wasn’t the case no receiving agent could understand it. We’ve figured out how to reverse engineer that mental process of “grounding” a word. So you can have Abdulmutallab ten different ways and it doesn’t matter. If the evidence links in any statistically significant way – we pull it together.

You can read the full-text of this exclusive interview with Tim Estes on the ArnoldIT.com site in the Search Wizard Speak series. You can get more information about Digital Reasoning from the company’s Web site.

The Search Wizards Speak series provides the largest collection of free, detailed information about major enterprise search systems.Why pay the azure-chip consultants for sponsored listings, write ups prepared by consultants with little or no hands on experience, and services that “sell” advertorials. You hear in the developer’s, founders, and CEO’s own words what a system does and how it solves content-related problems.

Stephen E Arnold, February 2, 2010

No one paid me to write about my own Web site. I will report this charitable act to the head of the Red Cross.

Quintura Nets Interface Patent

January 21, 2010

Quintura Inc. received a patent in December 2009 for a “Search Engine Graphical Interface Using Maps of Search Terms and Images.” You can obtain a copy of US7,627,582, from the outstanding online service available for the USPTO. If that system is a little sluggish, a number of other patent document services are available. This invention by Alexander V. Ershov concerns:

A system, method and computer program product for visualization of search results includes a map displayed to a user on a screen. The map shows search query terms and optionally other terms related to the search query terms. The display of the terms corresponds to relationship between the terms. A graphical image is displayed next to at least one of the search query terms. The graphical image is associated with a URL that corresponds to a search result. The graphical image is a favorite icon that is derived from the HTML script associated with a webpage at the URL, or an animated image, or a video, or a cycling GIF. A plurality of graphical images can be displayed in proximity to the search query term. The graphical image can be a logo or a paid advertisement. A plurality of graphical images are offered for sale in association with the query search term, and a size and/or placement of each graphical image corresponds to a price paid by each purchaser, or multiple images can be displayed at the same location on the screen, and a duration of display of each graphical image corresponds to a price paid by each purchaser.

Quintura’s see and find technology replaces the laundry list approach to a user’s query. Here’s an example of a Quintura search result:

quintura

In addition to suggested queries, the interface provides the user with a tag cloud, which can be quite helpful for many users. I am no patent attorney, but there may be some legal eagle-type conversations about other firms’ use of the system and method set forth in US7,627,582. You can get more information about Quintura from the firm’s Web site at www.quintura.com. I wrote about this company in September 2009.

Stephen E Arnold, January 21, 2010

Oyez, oyez, a freebie. No one offered a single bent penny to write this short item. Alas! I shall report non payment to the Department of Commerce, an entity of repute.

Google and Its Desired Repositories

November 21, 2009

I find “desired repositories” quite enticing. I was going to call this write up “A Repository Named Desire” but I was fearful that some lawyer responsible for the Tennessee Williams’ play would object. Most of the Sergey-and-Larry-eat-pizza Google pundits follow the red herrings dragged by the Googlers toward the end of each week. Not me. I pretty much ignore the Google public statements because those have a surreal quality for me. The messages seem oddly disconnected from what Google’s deep thinkers are * actually doing *. When Google does a webinar, it is too late for the competitors to do much more than go to their health club and work off their frustrations.

desired repository

That looks simple. From US20090287664. Notice that the types of repositories are extensible.

If you want to see some of the fine tuning underway with the Google plumbing, take a peek at 20090287664, Determination of a Desired Repository. This is a continuation of a 2005(!) invention in case you thought the method looked familiar. You can find the write up at your favorite US government Web site, the USPTO. (Don’t you just love that search interface. Someone told me that the search engine was from OpenText, and I am trying to verify that statement.)

Here’s what caught my attention:

A system receives a search query from a user and searches a group of repositories, based on the search query, to identify, for each of the repositories, a set of search results. The system also identifies one of the repositories based on a likelihood that the user desires information from the identified repository and presents the set of search results associated with the identified repository.

Seems obvious, right? Now think of this at Google scale. Different problem? It is in my book. What has the Google accomplished? Just one claim. Desired repositories at Google scale.

Stephen Arnold, November 21, 2009

Again, I want to report to the USPTO that I was not paid to write yet another cryptic comment about a Google plumbing invention.

Google Probes the Underbelly of AutoCAD

October 15, 2009

Remember those college engineering wizards who wanted to build real things? Auto fenders, toasters, and buildings in Dubai. Changes are the weapon of choice was a software product from Autodesk. Over the years, Autodesk added features and functions to its core product and branched out into other graphic areas. In the end, Autodesk was held captive by the gravitational pull of AutoCAD.

In one of my Google monographs, I wrote about Google’s SketchUp program. I recall several people telling me that SketchUp was unknown to them. These folks, I must point out, were real, live Google experts. SketchUp was a blip on a handful of users’ radar screen. I took another angle of view, and I saw that the Google coveted the engineering wizards when they were in primary school and had a method for keeping these individuals in the Google camp until they designed their last, low-cost fastener for a green skyscraper in Shanghai.

No one really believed that this was possible.

My suggestion is that some effort may be prudently applied to rethinking what the Google is doing with engineering software that makes pictures and performs other interesting Googley tricks. The first step could be reading the Introducing Google Building Maker article on the “official” Google Web log. I would gently suggest that the readers of this Web log buy a copy of the Google trilogy, consisting of my three monographs about Google technology. Either path will give you some food for thought.

For me, the most interesting comment in the Google blog post was:

Some of us here at Google spend almost all of our time thinking about one thing: How do we create a three-dimensional model of every built structure on Earth? How do we make sure it’s accurate, that it stays current and that it’s useful to everyone who might want to use it? One of the best ways to get a big project done — and done well — is to open it up to the world. As such, today we’re announcing the launch of Google Building Maker, a fun and simple (and crazy addictive, it turns out) tool for creating buildings for Google Earth.

The operative phrase is “every built structure on early”. How is that for scale?

What about Autodesk? My view is that the company is going to find itself in the same position that Microsoft and Yahoo now occupy with regard to Google. Catch up is impossible. Leap frogging is the solution. I don’t think the company can make this type of leap. Just my opinion.

Stephen Arnold, October 15, 2009
Another freebie. Not even a lousy Google mouse pad for my efforts.

Visual View of Search History

September 21, 2009

A happy quack to the team of readers who sent me a link to the Firefox add in, History Tree 1.1. Now these are sharp readers who know that my honks about visualization make clear that gratuitous interface elements ruffle my feathers. I loaded the History Tree and found that it provided a quick and easy way to locate specific Web pages I had visited.

image

The Firefox add in is available from the Firefox splash page for the software. You can get more information and a one click install button from Normansolomon.org. Useful, not gratuitous, and evidence that there is a better way to deal with history files. I also like it when two bright people tag team what I cover in this Web log. I bet both are pretty good at finding information and keeping addled geese like me in formation.

Stephen Arnold, September 21, 2009

Visualization and Confusion

August 15, 2009

Visualization of search results or other data is a must-have for presentations in the Department of Defense. What’s a good presentation? One that has killer visualizations of complex data. The problem is that sizzle in one colonel’s graphics triggers a graphics escalation. This is a briefing room version of Mixed Martial Arts. The problem, based on my limited experience in this type of content, is that most of the graphics don’t make much sense. In fact, when I see a graphic I usually have zero idea about where the data originated, the mathematical methods used to generate the visual, or what Photoshop wizardry may have been employed to make that data point explode in my perceptual field. Your mileage may differ, but I find that visualization is useful in small doses.

To prove that what I prefer is out of date and that my views are road kill on the information superhighway, you will want to explore “15 Stunning Examples of Data Visualization”. Stunning is an appropriate word. After looking at these examples, I am not sure what is being communicated in some of these graphics. Example: Big fluctuations.

image

If you want to add zing to your briefings, you will definitely get some ideas from this article. If I am in the audience, expect questions from the addled goose. Know your data thoroughly because I am not sure some of these examples communicate on the addled goose wave length.

Stephen Arnold, August 14, 2009

« Previous PageNext Page »

  • Archives

  • Recent Posts

  • Meta