Text Analytics SummitPolySpot: Agile Enterprise Search Infrastructure

Inforbix Cracks Next Generation Search for SolidWorks Users

February 13, 2012

Search means advertising to most Google users. In an enterprise—according to the LinkedIn discussions about enterprise search—the approach is anchored in the 1990s. The problem is that finding information requires a system which can handle content types that are of little interest to lawyers, accountants, and MBAs running a business today.

Without efficient access to such content as engineering drawings, specifications, quality control reports, and run-of-the-mill office information—costs go up. What’s worse is that more time is needed to locate a prior version of a component or locate the supplier who delivered on time and on budget work to the specification. So expensive professionals end up performing what I call Easter egg hunt research. The approach involves looking for colleagues, paging through lists of file names, and the “open, browse, close” approach to information retrieval.

Not surprisingly, the so called experts steer clear of pivotal information retrieval problems. Most search systems pick the ripe apples which are close to the ground. This means indexing Word documents, the versions of information in a content management system, or email.

I learned today that Inforbix, a company we have been tracking because it takes search to the next level, has rolled out two new products. These innovations are data apps which seamlessly aggregate product data from different file types, sources, and locations. The new Inforbix apps will help SolidWorks’ users get more out of their product data and become more productive while improving decision-making. Plus, Inforbix said that it would expand the data access to SolidWords EPDM, making it possible for SolidWords customers to get more from data managed by their PDM system.

The two products are Inforbix Charts and Inforbix Dashboard. Both complement Inforbix Tables which was released in October 2011.

Oleg Shilovitsky, founder of Inforbix, told me:

Manufacturing companies are drowning in the growing amount of product data generated and found within different file types, sources, and company data-silos. They are increasingly using a mix of vendor packages and solutions, all which generate, contain, manage, or store product data, creating a hodgepodge of resources to be combed through. Product data generated in a typical manufacturing company can be both unstructured (valuable BOM and assembly information spread out across different CAD drawings) and structured (CAD drawings within a PDM or PLM system). Our apps are tools that address specific product data tasks such as finding, re-using, and sharing product data. Inforbix can access product data within PDM systems such as ENOVIA SmarTeam and Autodesk Vault and make it available in meaningful ways to CAD and non-CAD users.

When I reviewed the system, I noted that Inforbix’s apps utilize product data semantic technology that automatically infer relationships between disparate sources of data. For example, Inforbix can semantically connect or link a SolidWorks CAD assembly found within EPDM with a related Excel file containing a BOM table stored on a file server in another department.

Inforbix Charts visualizes and presents data saved from Inforbix Tables. The product data is presented in charts that include information to help engineers better manage and run processes by identifying trends and patterns and improving data control. For example, Inforbix Charts visually presents the approval statuses of CAD and ECO documents by author, date approved, last modified date, etc.

Inforbix Dashboard dynamically collects and presents important statistics about engineering and manufacturing data and processes, such as how many versions of a particular CAD drawing currently exist, how many design revisions did it take to complete a CAD drawing, or the number of ECOs processed on time. Easy and intuitive to use, Inforbix Dashboard is an ideal tool for project managers.

SolidWords users can access Inforbix apps and their product data online. Current Inforbix customers can immediately begin using the Inforbix iPad app, available for free on the Apple App Store at http://www.inforbix.com/inforbix-mobile-search-for-cad-and-product-data-on-the-ipad/. Account access taps existing Inforbix credentials. New users are encouraged to register with Inforbix to enable the iPad app to access product data within their company. The apps soon will be available on Android devices.

A video preview of the iPad app is posted at http://www.inforbix.com/inforbix-ipad-app-first-preview/. For more information on Inforbix apps, visit http://www.inforbix.com.

Inforbix is a company on the move.

Stephen E Arnold, February 13, 2012

Sponsored by Pandia.com

MapMaking Used to Prevent Public Health Threats

February 10, 2012

Science Blogs recently reported on a new tool that blows Google Maps out of the water in the article, “New Mapping Tools Bring Public Health Surveillance to the Masses.”

According to the article, HealthMap is a team of researchers, epidemiologists and software developers at Children’s Hospital Boston who use online sources to track disease outbreaks and deliver real-time surveillance on emerging public health threats. They also utilize the help of local residents to help with research.

Blogger, Kim Krisberg writes:

“HealthMap, which debuted in 2006, scours the Internet for relevant information, aggregating data from online news services, eyewitness reports, professional discussion rooms and official sources. The result? The possibility to map disease trends in places where no public health or health care infrastructures even exist, Brownstein told me. And because HealthMap works non-stop, continually monitoring, sorting and visualizing online information, the system can also serve as an early warning system for disease outbreaks.”

Mapmaking and public health are hardly strangers. Public health practitioners use maps to guide interventions. Despite the complexity of most disease outbreaks, maps can still help health professionals raise public awareness about prevention and target interventions in ways that make the most of limited resources.

Jasmine Ashton, February 10, 2012

Sponsored by Pandia.com

Pingar Sets Up Shop in Silicon Valley

February 1, 2012

Pingar, smaller than Google’s catering staff, sets up shop in Silicon Valley. The Bay of Plenty Times announces, “Tauranga Firm Sets Up Silicon Valley Base.” The New Zealand publication reports that co-founders Peter and Jacqui Wren-Hilton were impressed by the size of the big dogs’ campuses when they visited. Pingar follows three other New Zealand tech companies into Silicon Valley: Endace, Xero, and SLI Systems.

Pingar, which, in addition to the Valley, has offices in two New Zealand locations and in London, Hong Kong, Bangalore, and, soon, Singapore. Its innovative search engine works by asking specific questions. The company also offers an API, with 18 components accessible to developers. It is looking to break into the scanner market, with a unique product that automatically applies metadata to scanned documents. Yes, that would be helpful!

The company was recognized by the Silicon Valley Association of Startup Entrepreneurs as one of 30 hot emerging tech companies from around the world. Pingar is growing into its success; the article notes:

Twelve months ago Pingar employed 12 people, now the number is 30 and Mr Wren-Hilton predicts the staff will double to 60 by the end of next year; involving 20 in research and development, and 40 in business development, marketing and support services.
“Twenty-five of them will be based in Auckland and Tauranga, and 35 will be overseas, including seven in Silicon Valley.

Nicely played, Pingar.

Cynthia Murrell, February 1, 2012

Sponsored by Pandia.com

File Extension List

January 28, 2012

Need a handy list of all known file extensions and types? Look no further. Nosa Lee at Seek The Sun Slowly has kindly provided such a list in “The Known File Extensions/ Types References – A” through “Z.” In a translation from the original Chinese, the listing explains:

Now, I collected all the known file extensions/types for your reference, I grouped them according to the first character due to there are too many file extensions/types.

Yes, there’s a page for each letter, and even “Number” and “Symbol.” To download them all in one fell swoop, click here.

I knew there were a lot of file types, but seeing them all in one place really puts the matter into perspective.

Cynthia Murrell, Janaury 28, 2012

Sponsored by Pandia.com

Talend Pitches Holistic Integration

December 21, 2011

Connectors get some new lingo; holistic integration is a term we learned from Talend’s press release, “Talend V5: Democratizing Holistic Integration.” The company defends its coinage of the term:

Frankly, IT often uses loosely some terms from the general corpus. But in this case, holistic does the trick. . . . The promise of Talend v5 is to enable IT organizations to converge traditionally disparate integration efforts and practices through a common set of products, tools and best practices. When an organization deploys Talend v5, it will deploy essentially one platform, regardless of the integration need: data integration, application integration, process integration.

That does fit the definition of the term, but it is a little grand, don’t you think? Hmm, maybe not in a field titled “Big Data.”

Talend positions this release as the result of the changes its products have undergone since it bought the German Sopera this time last year. The company is quick to point out that this comprehensive approach does not result in bloatware. Each product included in the platform works independently; customers must only deploy the parts they need.

The write up emphasizes that Talend’s products are still based on the open source underpinnings on which they were founded. The company boasts of being a leader in the open source data management market.

Cynthia Murrell, December 21, 2011

Sponsored by Pandia.com

Predictions on Big Data Miss the Real Big Trend

December 18, 2011

Athena the goddess of wisdom does not spend much time in Harrod’s Creek, Kentucky. I don’t think she’s ever visited. However, I know that she is not hanging out at some of the “real journalists’” haunts. I zipped through “Big Data in 2012: Five Predictions”. These are lists which are often assembled over a lunch time chat or a meeting with quite a few editorial issues on the agenda. At year’s end, the prediction lunch was a popular activity when I worked in New York City, which is different in mental zip from rural Kentucky.

The write up churns through some ideas that are evident when one skims blog posts or looks at the conference programs for “big data.” For example—are you sitting down?—the write up asserts: “Increased understanding of and demand for visualization.” There you go. I don’t know about you, but when I sit in on “intelligence” briefings in the government or business environment, I have been enjoying the sticky tarts of visualization for years. Nah, decades. Now visualization is a trend? Helpful, right?

Let me identify one trend which is, in my opinion, an actual big deal. Navigate to “The Maximal Information Coefficient.” You will see a link and a good summary of a statistical method which allows a person to process “big data” in order to determine if there are gems within. More important, the potential gems pop out of a list of correlations. Why is this important? Without MIC methods, the only way to “know” what may be useful within big data was to run the process. If you remember guys like Kolmogorov, the “we have to do it because it is already as small as it can be” issue is an annoying time consumer. To access the original paper, you will need to go to the AAAS and pay money.

The abstract for “Detecting Novel Associates in Large Data Sets by David N. Reshef1,2,3,*,†, Yakir A. Reshef, Hilary K. Finucane, Sharon R. Grossman, Gilean McVean, Peter Turnbaugh, Eric S. Lander, Michael Mitzenmacher, Pardis C. Sabet, Science, December 16, 2011 is:

Identifying interesting relationships between pairs of variables in large data sets is increasingly important. Here, we present a measure of dependence for two-variable relationships: the maximal information coefficient (MIC). MIC captures a wide range of associations both functional and not, and for functional relationships provides a score that roughly equals the coefficient of determination (R^2) of the data relative to the regression function. MIC belongs to a larger class of maximal information-based nonparametric exploration (MINE) statistics for identifying and classifying relationships. We apply MIC and MINE to data sets in global health, gene expression, major-league baseball, and the human gut microbiota and identify known and novel relationships.

Stating a very interesting although admittedly complex numerical recipe in a simple way is difficult, I think this paragraph from “The Maximal Information Coefficient”  does a very good job:

The authors [Reshef et al] go on showing that that the MIC (which is based on “gridding” the correlation space at different resolutions, finding the grid partitioning with the largest mutual information at each resolution, normalizing the mutual information values, and choosing the maximum value among all considered resolutions as the MIC) fulfills this requirement, and works well when applied to several real world datasets. There is a MINE Website with more information and code on this algorithm, and a blog entry by Michael Mitzenmacher which might also link to more information on the paper in the future.

Another take on the MIC innovation appears in “Maximal Information Coefficient Teases Out Multiple Vast Data Sets”. Worth reading as well.

Forbes will definitely catch up with this trend in a few years. For now, methods such as MIC point the way to making “big data” a more practical part of decision making. Yep, a trend. Why? There’s a lot of talk about “big data” but most organizations lack the expertise and the computational know how to perform meaningful analyses. Similar methods are available from Digital Reasoning and the Google love child Recorded Future. Palantir is more into the make pictures world of analytics. For me, MIC and related methods are not just a trend; they are the harbinger of processes which make big data useful, not a public relations, marketing, or PowerPoint chunk of baloney. Honk.

Stephen E Arnold, December 18, 2011

Sponsored by Pandia.com, a company located where high school graduates actually can do math.

Social Media and an Enterprise Strategy

December 9, 2011

Social media is an essential ingredient to the success of any organization hoping to excel in today’s market.  However, if not approached intentionally and with a plan, social media efforts can seem daunting and unorganized.  Not as linear as traditional IT needs, social media has to be tackled with a different strategy.  Jonathan Gourlay gives his insight in, “Effective social media strategy: A must for success in a social world.”

Enterprises large and small are embracing social media as a way for employees, partners and customers to collaborate and communicate, but many are doing so without being fully prepared. It’s an issue that many are finding costly in a number of ways.”

Social media must also be tied in to an organization’s overall enterprise strategy.  While the two don’t initially seem to play well together, if combined with a smart enterprise solution, both efforts can be done with more ease.

Gourlay continues:

“The fourth level is described as the stage of enablement. ‘You must get here to scale,’ Owyang said. Enterprises continually measure their use of social media at this stage and are empowering workers rather than constraining them. The more advanced organizations at this level implement their social strategy across corporate functions, ignoring business unit silos.”

Take a solution like Fabasoft Mindbreeze and its Folio Cloud. ’Fabasoft Folio Cloud enables quick, secure and mobile collaboration both internally and between international companies,’ explained Michael Hadrian. ‘Business processes with customers and partners cannot be realized any quicker or more cost effectively.’

Combining internal and external communications is key to social media, ensuring that dreaded social media crises don’t occur.  Such a solution allows an overall view of social media communications, inviting participation from a variety of employees, but ensuring all efforts are moving in the same direction.  When tackling an overall social media initiative, planning is key, and solutions such as those offered by Fabasoft Mindbreeze can help.

Emily Rae Aldridge, December 9, 2011

Sponsored by: Pandia.com

SlideShark Seamlessly Brings PowerPoint to the iPad

November 27, 2011

If you  have struggled with moving content to the iPad, you may want to check out SlideShark. Although PowerPoint continues to lose ground to PDFs of presentation, PowerPoints are used by some search wizards. (If you want to search Google for PowerPoints, you may have noticed that there are fewer fresh files than in years past. To locate presentations, you may want to check out SlideShare or similar services.)

BrainShark, a creator of cloud based software for video presentations, has recently released a new iPad app called SlideShark.

A recent KillerStartups post “SlideShark.com – View Presentations on Your iPad” asserted:

SlideShark is an application that lets you watch PowerPoint presentations on your iPad. This app (which you can download for free) lets you do that by uploading the presentations you want to view to your BrainShark account, or to a SlideShark account. That is, you can log in using your already-existing Brainshark username and password, or sign up for a SlideShark account of its own. It’s all the same in the end, as you upload your presentations online for them to be converted into something your iPad can render more than smoothly.

Since many people use their iPad for both business and pleasure, and many businesses use PowerPoint, it seems only natural that a company would come up with an app to make the transition seamless.

The more mobile we get, we’re often times having to give up some of the activities that we have become accustomed to. SlideShark is solving at least one problem in a way that accessible and fun.

Jasmine Ashton, November 27, 2011

Backup Alternative for IDOL Content Engine

November 17, 2011

WorkSite Zen offers a useful Autonomy tip with “Schedule IDOL Backups with Task Scheduler and PowerShell.” Poster JB Trexler feels the default IDOL content engine backup method can be inconvenient:

The ‘out of the box’ backup method requires you to set a section in each config file with various parameters. This method works well but it does present some challenges around planned interruptions and reporting. What if you want to skip the backups this weekend because of a maintenance window? What if you want a log file of just the backups or an email notification upon completion / failure? In addition to the backups, you also need to copy .cfg and .db files.

Trexler goes on to describe his method for using Windows Task Scheduler and Windows PowerShell to schedule your backups, complete with scripts and screenshots. Doing so can provide more flexibility; you gain the ability to temporarily disable backups, for example, or to send a completion/failure email, or to log to the Windows Event Log. Need more meat? See the source document for more detail.

Cynthia Murrell, November 17, 2011

Sponsored by Pandia.com

Softpedia Presents Another All-Encompassing Freeware Clipboard

November 13, 2011

Softpedia now features the Spartan Lite Multi-Clipboard as a free software download. Based on the website’s description, it appears to have some handy features. It reminds me of Evernote minus the graphics editors.

This software sells itself as being more than a clipboard application–it claims to be a complete information center for your computer. Looking at the list it can help you remember a cornucopia of different things: addresses, phone numbers, to-do lists, graphics, recipes, etc. When it boils down to it, basically the program will allow you to see if you’ve typed the same thing before, browse photos and paste them into an email, and other typical clipboard apps functions.

The site description also mentions this version’s shortcomings. It says:

“The Lite version has no time limit and no nags. The only difference between it and the full version is that it can only store 500 permanent clips whereas the full version can store 10,000.”

The purpose of this freeware in theory sounds great, but in reality it is another one of the jack of all trades but master of none applications.

Megan Feil, November 13, 2011

Next Page »

  •  Only search links from this page: