July 21, 2014
The article titled Text Analytics Company Linguamatics Boosts Enterprise Search with Semantic Enrichment on MarketWatch discusses the launch of 12E Semantic Enrichment from Linguamatics. The new release allows for the mining of a variety of texts, from scientific literature to patents to social media. It promises faster, more relevant search for users. The article states,
“Enterprise search engines consume this enriched metadata to provide a faster, more effective search for users. I2E uses natural language processing (NLP) technology to find concepts in the right context, combined with a range of other strategies including application of ontologies, taxonomies, thesauri, rule-based pattern matching and disambiguation based on context. This allows enterprise search engines to gain a better understanding of documents in order to provide a richer search experience and increase findability, which enables users to spend less time on search.”
Whether they are spinning semantics for search, or if it is search spun for semantics, Linguamatics has made their technology available to tens of thousands of users of enterprise search. Representative John M. Brimacombe was straightforward in his comments about the disappointment surrounding enterprise search, but optimistic about 12E. It is currently being used by many top organizations, as well as the Food and Drug Administration.
Chelsea Kerwin, July 21, 2014
June 18, 2014
Another semantic system turns out the lights. SemanticWeb hosts a guest post from the founders of Sindice titled, “End of Support for the Sindice.com Search Engine: History, Lessons Learned, and Legacy.” The article delves into a wealth of technical details. It opens, however, with this modest introduction:
“Since 2007, Sindice.com has served as a specialized search engine that would do a crazy thing: throw away the text and just concentrate on the ‘markup’ of the web pages. Sindice would provide an advanced API to query RDF, RDFa, Microformats and Microdata found on web sites, together with a number of other services. Sindice turned useful, we guess, as approximately 1100 scientific works in the last few years refer to it in a way or another.”
The team decided to end support for the specialized search engine in order to focus on serving enterprise users. Besides, they say, their vision has been realized. They write:
“With the launch in 2012 of Schema.org, Google and others have effectively embraced the vision of the ‘Semantic Web.’ With the RDFa standard, and now even more with JSON-LD, richer markup is becoming more and more popular on websites. While there might not be public web data ‘search APIs,’ large collections of crawled data (pages and RDF) exist today which are made available on cloud computing platforms for easy analysis with your favorite big data paradigm.”
The account begins at the beginning, with the team’s first goal of developing a simpler API, and ends with their transition to the startup SindiceTech. In between are interesting details, like a description of their 60-machine “Webstar” operations cluster and details on how they leveraged Hadoop for their RDF analytics. We may be sad to see support for Sindice.com go, but at least the team has shared some of their wisdom on the way out.
Cynthia Murrell, June 18, 2014
May 8, 2014
RSuite content management users can now can tap into TEMIS, we learn from “RSuite CMS Leverages TEMIS’s Content Enrichment Capabilities to Deliver a Powerful Semantic Solution.” The partnership makes TEMIS’s semantic enrichment capabilities available to RSuite’s customers in the publishing, government, and corporate arenas. The deal was announced at this year’s MarkLogic World conference, held April seventh in San Francisco; both companies are MarkLogic partners.
The press release elaborates:
“RSuite CMS provides an intuitive user interface that minimizes actions required to execute complex searches across an entire set of content. The solution can globally apply metadata, dynamically organize massive amounts of documents into collections, package and distribute content to licensing partners, and enables customers to meet their multi-channel publishing goals.
“By leveraging TEMIS’s Luxid® Content Enrichment Platform, RSuite CMS can enable customers to automatically enrich their content with domain-specific metadata directly within their publishing workflows. This enables faster and more scalable content indexing, improved metadata consistency and governance, more efficient authoring, and more powerful search and discovery features within customer applications and portals.”
With its focus on publishing and media, RSuite strives to meet today’s ever-evolving publication challenges. The company serves such big names as HarperCollins, Audible, and Oxford University Press. RSuite was launched in 2000 and is located in Audubon, Pennsylvania.
With its collaborative platform, TEMIS adds domain-specific metadata to clients’ data, allowing publishers to supply more relevant information to their own audiences. TEMIS maintains several offices across Europe and North America.
Cynthia Murrell, May 08, 2014
May 1, 2014
Actonomy’s slogan is: “We simply search smarter!” Actonomy’s claim comes from its semantic technology to optimize human resources recruitment processes and findability. It is a big claim to make and if challenged would Actonomy be able to back it up? The company’s most recent press release, “Actonomy Now Part Of A Larger HR Group” proves that its semantic search technology was one of the leading HR products in the European market.
As a result, Actonomy has joined a Belgian HR Group owned by the Peumans family. The group includes other HR software and service companies, including Cognsis, Prato, and SAP. Actonomy has been a star product for over seven years and it is one of the groundbreaking developers in matching technology and ontology based search. Joining the Belgian HR Group gives them the ability to increase their client list and extend their service offerings:
“Thanks to Actonomy’s technology, Prato can extend its service offering of HRM related processes and include in its service offering Actonomy’s semantic searching and matching technology. Actonomy on the other hand will be able to bring its software to perfection thanks to Prato’s broad know how allowing us to launch a suite of new services packaged on top of our core semantic technology. A win win situation for both companies!”
While these companies will remain separate, they will exchange their technologies to benefit each other. It kind of sounds like open source, except they are remaining proprietary companies.
April 3, 2014
Armadillos are not native to France, but the Armadillo digital resources management company is. If you are curious to learn more about the French company peruse the “Company Overview” with a little assistance from Google Translate. Armadillo was founded in 1998 and has since acquired a very long and prestigious client list.
Armadillo’s products offer a range of services that include research and development of information technology, custom data solutions, and packages for various digital content. The products are, of course, advertised as a big data solution and can be customized for any data type, content, and organizational method.
The director describes his products as:
“Armadillo packages are integrated into the information systems of companies and other organizations to facilitate data exchange between former silos. This creates repositories harmonized content easily shared and guaranteed “up to date “. Our solutions have a broad functional coverage with excellent performance for near-zero operating costs. Our technology is based on the latest innovations proposed by the Semantic Web and Big Data.”
It looks like another big data player peddling the usual solutions, however, they have been around longer than other big data startups, so longevity and reliability is on their side.
March 23, 2014
What has happened to Lingway, purveyor of vertical semantic solutions for search and analysis? According to a press release on its Web site: “Lingway Chooses Toledo And The Castilla-La Mancha Region As Its Operating Base In Spain.” Lingway has moved to Spain to:
“ ‘The Spanish market is important for Lingway, but the fact that it will give us access to the markets of Latin America makes it even more valuable,” says Bernard Normier, Lingway’s CEO. “One of the main reasons we chose Castilla-La Mancha as our headquarters was that the local authorities were able to put us in touch with the other actors in the region (companies, consultants, universities and government organizations) and provide us with the assistance and support required for our project.”
While Lingway may be brushing up on their Spanish, it was also a foothold for another company. Lingway appears to be part of Eptica, evidenced by a the EpticaLingway blog post,”The Lingway Team Is Pleased To Join Eptica And Will Continue To Serve Its Customers.” Eptica acquired Lingway in 2012 as a way to expand into France, strengthen its research and development investments, and pursuer further international growth.
Eptica has integrated Lingway’s technology to bolster their own products. Eptica has a SaaS to manage online reputation and another software for LEA CV dedicated teams for recruitment companies. Moving and sold, technology companies change constantly.
March 8, 2014
Ontotext delivers very interesting services to their clients. All of their products are associated with semantic technology and utilizing big data to benefit its users. On their Web site, the company describes itself as:
“Ontotext develops a unique portfolio of core semantic technologies. Our RDF engine powers some of the biggest world-renowned media sites. Our text-mining solutions demonstrate unsurpassed accuracy across different domains – from sport news to macro-economic analysis, scientific articles and clinical trial reports. We enable the next generation web of data and we can efficiently extract information from today’s structured web – be it recipes, adverts or anything else.”
It offers services for job extraction, hybrid semantics, and semantic publishing for industries such as life sciences, government, recruitment, libraries, publishing, and media. Ontotext has a range of products to help people harness semantic technology. The most interesting to us is the Semantic Biomedical Tagger that is described as an extraction system that creates semantic annotations in biomedical texts. Ontotext also has the requisite search engine and semantic database. Its product line is fairly robust and we intend to keep an eye on its offerings.
March 7, 2014
Good news for Expert System! According to their press release archive, “Expert Systems Raise $27 Million In IPO.” Expert System specializes in semantic technology. The $27 million was raised in the company’s IPO on the Italian stock exchange AIM Italia, making it the largest in AIM history. With the new funding, Expert System plans to invest in its US firms with a new San Francisco office and additional sales and technical staff.
Expert System saw record growth in 2013. The company offered a range of new products, including its first semantic intelligence API and an end-to-end taxonomy management and categorization platform.
Expert System expects to increase its size in 2014:
“The growing reliance on information to solve business problems is placing increasing demands on software to enhance the effectiveness of search, capture weak signals in information streams and support customer interactions. Expert System is responding to these needs by increasing development of integrations and connectors that brings the power of its semantic technology to existing platforms and traditional applications.”
With that amount of funding and decent product line, Expert System will continue to grow, especially as semantic technology demand rises.
January 27, 2014
We learned about a new semantic search engine. A public demonstration is available at http://www.asknet.ru/EN/index.htm. Some chatter about the system appeared on LinkedIn. Like many of the next-generation search systems, there were some questions and comments from the “experts” who participate in the LinkedIn search discussions.
According to the Web site:
AskNet search technology is its main product and the focus of the commercial licensing and development. The search engine combines the speed of an index with the functionality of linguistic analysis. The AskNet search engine reverses the search result process. Traditionally, search engines provide links to large numbers of documents that contain reviews. They leave the users to hunt their answers in thousands of pages and millions of words. AskNet`s linguistic analysis makes it possible to provide meaningful answers to searches as quickly as traditional search engines. No linking required!
One LinkedIn expert pointed out:
AskNet Search ( online service asknet.ru) is the demo version. Not all algorithms are implemented for asknet.ru. All of them are implemented in the enterprise search engine. AskNet Search realized metasearch functions using snippets from Google. These text snippets are not whole sentences. Therefore, the quality of linguistic search AskNet Search could be better, when it used sentences for search, rather than a snippets from Google.
We suggest running some test queries and determining if the system delivers useful results. Keep in mind that a technology demonstration is usually set up to make it easy to get a “feel” for the basics of a system.
With regard to semantics and analytics, the supporters of today’s hottest technologies often are like supporters of the Liverpool football team. The coach is usually wrong and one or two players are terrible. The team concept, however, is one to support to the death. Rational? Nah. Part of today’s standard operating procedure? You bet.
My view is:
- Many vendors are recycling old algorithms with a Project Runway touch up. The basic design, however, is recognizable as a cute little red carpet number. Innovation is a bow or a tuck.
- Some so-called experts (folks I describe as poobahs, azure chip consultants, of people with a dog in the fight) see their clients’ products as truly wonderful innovations. The notion that a researcher in 1980 hit upon a method and created a product based on that method is of little or no consequence. Who cares what Julius Caesar said after the battle of Alesia. Ancient history.
- Prospects may not be looking for a better search solution. Prospects may be looking for a system that is less of a problem than the incumbent solution. Therefore, the procurement team is trying to keep their paycheck, not revolutionize information retrieval.
- Many systems work only if the user knows what he or she is looking for. Predictive search (go with the search history and the norm for a cluster) is good enough. Who has time to do deep dive research in today’s rush-rush-rush business climate.
The buzzword blizzard makes it difficult to figure out what system delivers what. I know I am easily confused, and my hunch is that others may face the same hurdle. Will Sochi feature a confusion jump involving leaps of faith over search vendor claims?
Stephen E Arnold, January 27, 2014
January 22, 2014
Did you know that there was an open source version of ClearForest called Calais? Neither did we, until we read about it in the article posted on OpenCalais called, “Calais: Connect. Everything.” Along with a short instructional video, is a text explanation about how the software works. OpenCalais Web Service automatically creates rich semantic metadata using natural language processing, machine learning, and other methods to analyze for submitted content. A list of tags are generated and returned to the user for review and then the user can paste them onto other documents.
The metadata can be used in a variety of ways for improvement:
“The metadata gives you the ability to build maps (or graphs or networks) linking documents to people to companies to places to products to events to geographies to… whatever. You can use those maps to improve site navigation, provide contextual syndication, tag and organize your content, create structured folksonomies, filter and de-duplicate news feeds, or analyze content to see if it contains what you care about.”
The OpenCalais Web Service relies on a dedicated community to keep making progress and pushing the application forward. Calais takes the same approach as other open source projects, except this one is powered by Thomson Reuters.