Informatica: An Old Dog Is Trying to Learn New Tricks?

August 20, 2020

Old dogs. Many people have to pause a moment when standing. Balancing is not the same when one is getting old. Others have to extend an arm, knee, or finger slowly. Joints? Don’t talk about those points of failure to a former athlete. Can bee pollen, a vegan diet, a training session with Glennon Doyle, or an acquisition do the trick?

Informatica Buys AI Startup for Entity and Schema Matching” explains a digital rejuvenation. The article reports:

Informatica’s latest acquisition extends machine learning capabilities into matching of data entities and schemas.

Entities and schemas are important when fiddling with data. I want to point out that Informatica was founded in 1993 and has been in the data entities and schema business for more than a quarter century. Obviously the future is arriving at the venerable software development company.

The technology employed by Green Bay Technologies is what the article calls “Random Forest” machine learning. The article explains that Green Bay’s method possesses:

the ability to handle more diverse data across different domains, including semi-structured and unstructured data, and a crowd-sourcing approach that improves performance.

The Green Bay method employs:

a machine learning approach where multiple decision trees are run, and then subjected to a crowd sourced consensus process to identify the best results. It is a supervised approach where models are auto generated after the user applies some declarative rules – that is, he or she labels a sample set of record pairs, and from there the system infers “blocking rules” to build the models.

Informatica will add Green Bay’s capabilities to its existing smart software engine called CLAIRE.

The write up does not dig into issues related to performance, over fitting, or dealing with rare outcomes or predictors.

Glennon Doyle does not dwell on her flaws either.

Stephen E Arnold, August 20, 2020

Comments

Comments are closed.

  • Archives

  • Recent Posts

  • Meta