Exclusive Interview: Digital Reasoning
February 2, 2010
Tim Estes, the youthful founder and chief technologist, for Digital Reasoning, a search and content processing company based in Tennessee, reveals the technology the is driving the company’s growth. Mr. Estes, a graduate of the University of Virginia, tackled the problem of information overload with a fresh approach. You can learn about Digital Reasoning’s approach that delivers a system that “deeply, conceptually searches within unstructured data, analyzes it and presents dynamic visual results with minimal human intervention. It reads everything, forgets nothing and gets smarter as you use it.”
Mr. Estes explained:
Digital Reasoning’s core product offering is called “Synthesys.” It is designed to take an enterprise from disparate data silos (both structured and unstructured), ingest and understand the data at an entity level (down to the “who, what, and wheres” that are mentioned inside of documents), make it searchable, linkable, and provide back key statistics (BI type functionality). It can work in an online/real-time type fashion given its performance capabilities. Synthesys is unique because it does a really good job at entity resolution directly from unstructured data. Having the name “Umar Farouk Abdul Mutallab” misspelled somewhere in the data is not a big deal for us – because we create concepts based on the patterns of usage in the data and that’s pretty hard to hide. It is necessarily true that a word grounds its meaning to the things in the data that are of the same pattern of usage. If it wasn’t the case no receiving agent could understand it. We’ve figured out how to reverse engineer that mental process of “grounding” a word. So you can have Abdulmutallab ten different ways and it doesn’t matter. If the evidence links in any statistically significant way – we pull it together.
You can read the full-text of this exclusive interview with Tim Estes on the ArnoldIT.com site in the Search Wizard Speak series. You can get more information about Digital Reasoning from the company’s Web site.
The Search Wizards Speak series provides the largest collection of free, detailed information about major enterprise search systems.Why pay the azure-chip consultants for sponsored listings, write ups prepared by consultants with little or no hands on experience, and services that “sell” advertorials. You hear in the developer’s, founders, and CEO’s own words what a system does and how it solves content-related problems.
Stephen E Arnold, February 2, 2010
No one paid me to write about my own Web site. I will report this charitable act to the head of the Red Cross.
Quintura Nets Interface Patent
January 21, 2010
Quintura Inc. received a patent in December 2009 for a “Search Engine Graphical Interface Using Maps of Search Terms and Images.” You can obtain a copy of US7,627,582, from the outstanding online service available for the USPTO. If that system is a little sluggish, a number of other patent document services are available. This invention by Alexander V. Ershov concerns:
A system, method and computer program product for visualization of search results includes a map displayed to a user on a screen. The map shows search query terms and optionally other terms related to the search query terms. The display of the terms corresponds to relationship between the terms. A graphical image is displayed next to at least one of the search query terms. The graphical image is associated with a URL that corresponds to a search result. The graphical image is a favorite icon that is derived from the HTML script associated with a webpage at the URL, or an animated image, or a video, or a cycling GIF. A plurality of graphical images can be displayed in proximity to the search query term. The graphical image can be a logo or a paid advertisement. A plurality of graphical images are offered for sale in association with the query search term, and a size and/or placement of each graphical image corresponds to a price paid by each purchaser, or multiple images can be displayed at the same location on the screen, and a duration of display of each graphical image corresponds to a price paid by each purchaser.
Quintura’s see and find technology replaces the laundry list approach to a user’s query. Here’s an example of a Quintura search result:
In addition to suggested queries, the interface provides the user with a tag cloud, which can be quite helpful for many users. I am no patent attorney, but there may be some legal eagle-type conversations about other firms’ use of the system and method set forth in US7,627,582. You can get more information about Quintura from the firm’s Web site at www.quintura.com. I wrote about this company in September 2009.
Stephen E Arnold, January 21, 2010
Oyez, oyez, a freebie. No one offered a single bent penny to write this short item. Alas! I shall report non payment to the Department of Commerce, an entity of repute.
Google and Its Desired Repositories
November 21, 2009
I find “desired repositories” quite enticing. I was going to call this write up “A Repository Named Desire” but I was fearful that some lawyer responsible for the Tennessee Williams’ play would object. Most of the Sergey-and-Larry-eat-pizza Google pundits follow the red herrings dragged by the Googlers toward the end of each week. Not me. I pretty much ignore the Google public statements because those have a surreal quality for me. The messages seem oddly disconnected from what Google’s deep thinkers are * actually doing *. When Google does a webinar, it is too late for the competitors to do much more than go to their health club and work off their frustrations.
That looks simple. From US20090287664. Notice that the types of repositories are extensible.
If you want to see some of the fine tuning underway with the Google plumbing, take a peek at 20090287664, Determination of a Desired Repository. This is a continuation of a 2005(!) invention in case you thought the method looked familiar. You can find the write up at your favorite US government Web site, the USPTO. (Don’t you just love that search interface. Someone told me that the search engine was from OpenText, and I am trying to verify that statement.)
Here’s what caught my attention:
A system receives a search query from a user and searches a group of repositories, based on the search query, to identify, for each of the repositories, a set of search results. The system also identifies one of the repositories based on a likelihood that the user desires information from the identified repository and presents the set of search results associated with the identified repository.
Seems obvious, right? Now think of this at Google scale. Different problem? It is in my book. What has the Google accomplished? Just one claim. Desired repositories at Google scale.
Stephen Arnold, November 21, 2009
Again, I want to report to the USPTO that I was not paid to write yet another cryptic comment about a Google plumbing invention.
Google Probes the Underbelly of AutoCAD
October 15, 2009
Remember those college engineering wizards who wanted to build real things? Auto fenders, toasters, and buildings in Dubai. Changes are the weapon of choice was a software product from Autodesk. Over the years, Autodesk added features and functions to its core product and branched out into other graphic areas. In the end, Autodesk was held captive by the gravitational pull of AutoCAD.
In one of my Google monographs, I wrote about Google’s SketchUp program. I recall several people telling me that SketchUp was unknown to them. These folks, I must point out, were real, live Google experts. SketchUp was a blip on a handful of users’ radar screen. I took another angle of view, and I saw that the Google coveted the engineering wizards when they were in primary school and had a method for keeping these individuals in the Google camp until they designed their last, low-cost fastener for a green skyscraper in Shanghai.
No one really believed that this was possible.
My suggestion is that some effort may be prudently applied to rethinking what the Google is doing with engineering software that makes pictures and performs other interesting Googley tricks. The first step could be reading the Introducing Google Building Maker article on the “official” Google Web log. I would gently suggest that the readers of this Web log buy a copy of the Google trilogy, consisting of my three monographs about Google technology. Either path will give you some food for thought.
For me, the most interesting comment in the Google blog post was:
Some of us here at Google spend almost all of our time thinking about one thing: How do we create a three-dimensional model of every built structure on Earth? How do we make sure it’s accurate, that it stays current and that it’s useful to everyone who might want to use it? One of the best ways to get a big project done — and done well — is to open it up to the world. As such, today we’re announcing the launch of Google Building Maker, a fun and simple (and crazy addictive, it turns out) tool for creating buildings for Google Earth.
The operative phrase is “every built structure on early”. How is that for scale?
What about Autodesk? My view is that the company is going to find itself in the same position that Microsoft and Yahoo now occupy with regard to Google. Catch up is impossible. Leap frogging is the solution. I don’t think the company can make this type of leap. Just my opinion.
Stephen Arnold, October 15, 2009
Another freebie. Not even a lousy Google mouse pad for my efforts.
Visual View of Search History
September 21, 2009
A happy quack to the team of readers who sent me a link to the Firefox add in, History Tree 1.1. Now these are sharp readers who know that my honks about visualization make clear that gratuitous interface elements ruffle my feathers. I loaded the History Tree and found that it provided a quick and easy way to locate specific Web pages I had visited.
The Firefox add in is available from the Firefox splash page for the software. You can get more information and a one click install button from Normansolomon.org. Useful, not gratuitous, and evidence that there is a better way to deal with history files. I also like it when two bright people tag team what I cover in this Web log. I bet both are pretty good at finding information and keeping addled geese like me in formation.
Stephen Arnold, September 21, 2009
Visualization and Confusion
August 15, 2009
Visualization of search results or other data is a must-have for presentations in the Department of Defense. What’s a good presentation? One that has killer visualizations of complex data. The problem is that sizzle in one colonel’s graphics triggers a graphics escalation. This is a briefing room version of Mixed Martial Arts. The problem, based on my limited experience in this type of content, is that most of the graphics don’t make much sense. In fact, when I see a graphic I usually have zero idea about where the data originated, the mathematical methods used to generate the visual, or what Photoshop wizardry may have been employed to make that data point explode in my perceptual field. Your mileage may differ, but I find that visualization is useful in small doses.
To prove that what I prefer is out of date and that my views are road kill on the information superhighway, you will want to explore “15 Stunning Examples of Data Visualization”. Stunning is an appropriate word. After looking at these examples, I am not sure what is being communicated in some of these graphics. Example: Big fluctuations.
If you want to add zing to your briefings, you will definitely get some ideas from this article. If I am in the audience, expect questions from the addled goose. Know your data thoroughly because I am not sure some of these examples communicate on the addled goose wave length.
Stephen Arnold, August 14, 2009
Google and Real Time Maps
August 11, 2009
A happy quack to the reader who alerted me to GoogleMapMania’s “Real Time Google Maps”. The article contains a number of links to real time Google Maps created by developers. The one that I found most useful was the Chicago Transit Authority map. Google has a burgeoning transportation services business. Those operating bus, rail, and shuttle services may want to take note of this CTA-centric gizmo.
Stephen Arnold, August 11, 2009
Kartoo Adds New Interface Functions
July 9, 2009
Kartoo’s interface has added some features. If you have not visited the site for a while, you will want to navigate to the Kartoo main page. Set your preferences for this Flash based metasearch system. The interface has visual impact, but an addled goose like me wanted pop up explanation of the icons. The options page looks like this:
Now enter your query in the search box at the top of the page. Unlike the Kartoo interface of the past, you have a larger, cleaner presentation of the relevant hits. When you hover over an icon, Kartoo displays a relationship line. For the query “US financial crisis” the system displayed these results:
When you click on one of the thumbnail images, Kartoo sends you to the source site. If you hover, Kartoo displays a pop up with a text snippet.
On the left column of the interface are two buttons. You can select what supplementary content you want to see. I selected topics, allowing me quick access to only those hits about one of the identified categories. I also instructed the system to show me images. You can see the images, which are presented in low resolution, in the scrollable side bar below the topics.
Kartoo Technologies is based in Paris. The company has been one of the firms pushing the envelope in search interface designs and controls. Information about the company’s products and technologies may be found on the Kartoo corporate Web site. The company now has more than 200 customers who use the firm’s technologies for visualization and intelligence monitoring. The Kartoo teams are located in Clermont-Ferrand, France.
Stephen Arnold, July 9, 2009
Boye 09 Overflight Awards
May 19, 2009
The Overflight Award for Excellence, created by ArnoldIT.com and JBoye.com, was presented to Volker Grünauer, head of E-marketing at Wienerberger in Austria, at the JBoye Conference: Philadelphia 2009, http://jboye08.dk/]http://www.jboye.com/conferences/philadelphia09/, May 5-7, held at the Down Town Club in Philadelphis.
The award recognizes the best presentation at the conference on digital media, which featured more than 50 speakers from around the world.
Grünauer offered a relevant talk called “Developing a customer centric web strategy.” This presentation discussed smart web strategy for promoting real brick and mortar products, including how Wienerberger defines the four elements of web success and how customer behavior has become the trigger for every eMarketing decision. Slides of the presentation are available at http://jboye08.dk/downloads/download.php?file=1226063851.pdf. He was awarded an engraved Lucite trophy and 500 Euros.
Volker is responsible for the marketing strategy of all websites at Wienerberger, the world’s largest manufacturer of bricks, clay roof tiles and clay pavers. In this function he also developed a new brand and domain management strategy. Together with the IT department he managed the rollout of the CMS into new Wienerberger markets. See his profile athttp://www.jboye.com/conferences/philadelphia09/speakers/volker_grunauer.
An honorable mention went to Donna Spencer, a freelance information architect and interaction designer, a mentor, writer and trainer from Australia, who presented a discussion on the user experience track called “Getting Content Right.” She was awarded an engraved Lucite trophy. Her profile is at http://www.jboye.com/conferences/philadelphia09/speakers/donna_spencer.
Stephen E. Arnold and Janus Boye created the award to permit the community attending the conference to identify presentations that met the following criteria: information that would be useful to delegates upon returning to work; research supporting the presentatio; quality of the delivery and examples; and importance of the speakers’ topics at the time of the conference.
A panel of distinguished attendees and information practitioners had the task of assessing the presentations and determining the winners. The judges were Dana Hallman, Office of the Comptroller of the Currency; Karen Rosenzweig, Novartis;Peter Svensson, Lund University; and Troy Winfrey, University of Baltimore.
About ArnoldIT.com
Stephen E. Arnold monitors search, content processing, text mining and related topics from his office in Kentucky. He works with colleagues worldwide on a wide range of online and content-related projects. The company’s Web site is http://arnoldit.com, and the Beyond Search blog is at http://arnoldit.com/wordpress/.
About JBoye.com
J. Boye, a digital media enterprise, is frequently contracted to help with strategy and governance, project planning, requirement specifications, vendor and software selection, project management and ROI optimization. They also produce industry reports and organize educational conferences. Contact the company at info@jboye.co.uk or info@jboye.dk.
Jessica Bratcher, May 19, 2009
Cirilab: Entity Extraction
April 6, 2009
I took a quick look at Cirilab in order to update my files about entity extraction vendors.
Cirilab develops practical search, retrieval and categorization software designed to increase organizational productivity by effectively harnessing key knowledge resources. Cirilab offers a range of advanced analysis and organization applications and tools.
I learned about the company when another consultant sent me links to several online demonstrations of the Cirilab’s technology. I located an older but useful discussion of the Crilab technology here. You can explore a Wikipedia entry about Winston Churchill here and a document navigator of Sir Winston’s writings here. The engine generating these demos is called the KGE or Knowledge Generation. The idea is that KGE can process unstructured text and generate insights into that text.
Source: http://www.cirilab.com/TSMAP/Cirilab_Library/Literature/Winston_Churchill/WikiKMapPage/index.htm
The company’s enterprise solutions include vertical builds of the KGE:
- Publishing. The Web Ready Publishing service allows an organization to take unstructured data in WordPerfect, Word, Adobe PDF, HTML, and even Text files, and publish it in a Web Ready Publishing format so that it is instantly available to your customers in a thematically navigable format.
- Pharma. Cirilab can “read” the documents and therefore allow “mining” of existing data.
- Legal. KGE permits discovery of information.
- Security and intelligence. Cirilab products provide unique insights into this information not otherwise available.
The company offers a range of desktop products. These are excellent ways to learn about the features and functions of the Crilab’s KGE system.
More recently, Cirilab has succeeded in developing and bringing to market a core suite of technologies known as KOS (Knowledge Object Suite) based on its Multidimensional Semantic Spatial Indexing Technology.
You can register and receive a free, thematic map of your Web site. The company is located in Ottawa, Ontario. You can get more information here.
Stephen Arnold, April 6, 2009


