April 16, 2014
The dtSearch Desktop Demonstration Video on nlsblog.org shows how to setup and search with dtSearch for Windows. The 12 minute video begins with an introduction to dtSearch, which is able to “recognize text in over 200 common file types.” By indexing the locations of words in different files, dtSearch is able to build an almost limitless index of documents. The demo walks through the setup of dtSearch. After naming the index,
“It is important to keep in mind that when we add items here, dtSearch is not creating copies… but links to those files. A good practice is to put the files and folder that we want to run searches on into a single centralized location, before we create the index… all we need to do is add this discovery folder, and the subfolders and files will be automatically included…dtSearch reads the text in the linked files and creates a searchable words list.”
Then you are able to search which index to search through, and limit it to one case, or all cases. The word appears with a number, show how often it appears in the index. Then you can add the keyword to the search request to find the documents in which the word appears. You are able to preview a document, copy a file, and create a search report. The demo goes into great detail about all of the search options, and should certainly be viewed in full to learn the best methods, but it does not provide metrics for the time required to build the initial index or update it. These metrics are useful.
Chelsea Kerwin, April 16, 2014
April 11, 2014
A new data integration platform promises to simplify the process of deploying search-driven applications, save organizations time and money, and improve security. BA Insight posts, “BA Insight Announces Knowledge Integration Platform 2014 for Rapid Implementation of Search-Drive Applications.” No definition of “knowledge” is included, however.
The press release specifies:
“The BAI Knowledge Integration Platform turns enterprise search engines into knowledge engines by transforming the way information is found to get the right information to the right people at the right time. It has the flexibility to function as a comprehensive solution or be implemented in a phased approach to meet growing organizational needs. The platform consists of three robust engines:
*User Experience Engine – drives remarkable user experiences for finding and exploring knowledge or experts via an extensible engine and a library of powerful components
*Content Intelligence Engine – increases findability using automated classification, metadata generation, and text analytics
*Content Connectivity Engine – provides secure connectivity to a wide variety of content systems, enabling unified views of all knowledge assets”
The press release notes that several prominent global companies are using this platform, including the Apache Corporation. (No, that has nothing to do with open source software; it is a huge energy-exploration enterprise.) The write-up also emphasizes that the platform builds on an organization’s existing infrastructure to present users with an integrated view of their data.
BA Insight aims to make enterprise search more comprehensive and easier to use. Founded in 2004, the company is headquartered in Boston with offices in Chicago, Washington, DC, and Sacramento, California.
Cynthia Murrell, April 11, 2014
April 7, 2014
One of the major complaints about SharePoint is that users often have to leave the platform in order to accomplish basic tasks. SharePoint is getting closer to complete, and Microsoft is making some needed improvements. However, add-ons are also filling an important role in improving the user experience. Virtual Strategy covers one addition in their article, “SharePoint Now More Killer With PDF Document Converter; It’s No Fool’s Joke.”
The article begins:
“Today, PortalFront Tru Apps announces a new ‘Convert to PDF’ feature in SharePoint, bringing SharePoint a step closer to maturity . . . Converting documents from Word (doc, docx), Excel (xslx), PowerPoint and other formats to PDF directly in SharePoint libraries was not possible. The app also allows batch conversion and supports many other file types to PDF.”
Add-ons have been the key to SharePoint satisfaction according to many experts. Stephen E. Arnold is one of those experts, and he puts his thoughts down on the Web site, ArnoldIT.com. He covers a lot of SharePoint news and has found that user experience is highest when customization is at its best. But since many organizations cannot fully support internal customization, add-ons are key.
Emily Rae Aldridge, April 7, 2014
April 2, 2014
Documentation Toolkit for SharePoint 4.0 was released this week by Acceleratio Ltd. SharePoint 2007 is supported and new best practices and features were added. Read all the details in the PRWeb release, “Documentation Toolkit for SharePoint 4.0 – New SharePoint Best Practices, Enhanced Permissions Reports and Completely New Interface Design.”
The release says:
“Acceleratio Ltd., an innovative software development company, released a new version of Documentation Toolkit for SharePoint. Version 4.0 comes with improved Permissions Reports, a redesigned interface and an improved compare wizard. New Best Practices were added for more efficient analysis of the SharePoint farm configuration.“
Stephen E. Arnold of ArnoldIT.com has made a name for himself following and analyzing all things search, including SharePoint. But one thing is certain from the coverage; SharePoint gets more powerful and more complicated all at the same time. This opens a wide space for add-ons and value-added software that improve the user experience and customization of SharePoint without adding a lot of hassle.
Emily Rae Aldridge, April 2, 2014
March 24, 2014
SharePoint is improved by customization. Third-party add-ons are often the backbone of this customization, since SharePoint has become such a complex infrastructure. In the latest news, SharePoint is partnering with other vendors to increase efficiency. Read more in the Fierce Content Management article, “OpenText Brings Governance to SharePoint, SAP.”
The article begins:
“In one of the more odd product announcements in some time, three giants of enterprise software–OpenText, SharePoint and SAP–have come together around a governance, content management and an ERP solution. This three-headed monster is called SAP Content Management for Microsoft SharePoint by OpenText. You can view SAP data inside SharePoint or SharePoint content inside SAP, and OpenText takes care of the governance bits to make sure everything is done within the rules of the organization.”
Stephen E. Arnold has made a career out of reporting and analyzing all things search. His SharePoint coverage also points to the importance of customization, especially through add-ons. Read more on his Web site ArnoldIT.com.
Emily Rae Aldridge, March 24, 2014
March 19, 2014
Russian search leader Yandex describes its suite for Android developers in a company blog post, “Android Device Manufacturers Get Kitted Out with Yandex.Kit.” The post compares the Android operating system to a “car without the key” for mobile developers—the OS is free, but most of the mobile-device functionality we have come to expect is tangled up in a web of case-by-case agreements. Now, Yandex says their Kit represents that missing key. The write-up explains:
“Yandex.Kit is a customisable suite of mobile components available for most versions of Android OS. It has all the basics indispensable for the up-to-date mobile experience. Vendors selling their original Android devices in Russia can enjoy the full Yandex.Kit package, which currently includes an app store, launcher and dialer, browser, maps, a cloud app – 15 apps overall. OEMs targeting other markets can enjoy Yandex.Kit as a trio of Yandex products – Yandex.Shell UI, Yandex.Browser and Yandex.Store.
“And the best part is there are no fees. Yandex.Kit is distributed on a fee-free basis and performs well on virtually any hardware, including the not-so-powerful devices popular in Russia and the CIS. In addition, smartphones carrying Yandex.Kit can be easily branded under the manufacturer’s name.”
The post goes on to list a number of features developers should be excited about, complete with screenshots. For example, their smart dialer pulls data from Yandex’s Business Directory to identify commercial callers not already in a user’s contacts. They also extol the virtues of their mobile browser, calling it “smart, secure and easy-to-use.” Then there are the cloud service, the geolocation-compatible mapping API, and the Store populated with over 100,000 apps. See the article for details.
One of Europe’s largest internet companies, Yandex is the search engine Russians turn to the most. Folks in the Ukraine, Kazakhstan, Belarus, and Turkey also use the service. Yandex says it’s primary goal is to “make people happy” (a tad more specific than “don’t be evil,” but not by much). Launched in 2011, the company is headquartered in Moscow.
Cynthia Murrell, March 19, 2014
March 5, 2014
A team from Stanford is bringing machine learning to the masses, for free. Is this bad news for the for-a-fee text analytics vendors? Stanford Engineering announces, “Stanford Scientists Put Free Text-Analysis Tool on the Web.” Writers Andrew Meyers and Tom Abate explain:
“The etcML website is based on machine-learning techniques that were developed to analyze the meaning embodied in text, then gauge its overall positive or negative sentiment. To access this computational engine, users drag and drop text files into a dialog box. ‘We wanted to make standard machine learning techniques available to people and researchers who may not be able to program,’ said Richard Socher, a doctoral candidate in computer science at Stanford and lead developer of etcML. Socher said the new site gives researchers and citizen activists in fields ranging from political science to linguistics an easy way to analyze news articles, social media posts, closed-caption transcripts of television newscasts and other texts of possible interest.’All users have to do is copy and paste, or drop their text datasets into their browser and click,’ Socher said.”
Several Stanford-affiliated folks have already leveraged the beta version of etcML. Rebecca Weiss, who studies political polarization and media coverage in her doctoral work, uses the tool to classify words and phrases and to tease patterns from millions of articles and transcripts. Meanwhile, computational linguistics researcher Rob Voight has employed etcML to determine what factors make a Kickstarter pitch most successful. Computer science doctoral student Chinmay Kulkarni has also put the solution to good use; it helps him make short(er) work of test-grading for a free online course with about 2,000 students.
So, what will the general public make of this “free and powerful” drag-and-drop tool? I played around with it a bit, and the results are interesting. I think the team may still have some tweaking to do— I made a Twitter-sentiment-query on Elizabeth Warren (I know, my politics are showing), and it counted a tweet that read “Education is really important! More money for colleges! #Vote4Warren” as “negative.” Perhaps the for-profit machine learning vendors are safe for now. Check etcML out for yourself here and see what you think.
Cynthia Murrell, March 05, 2014
March 1, 2014
OpenText announces its new product: OpenText Tempo. The new file sharing collaboration platform is highlighted in the press release, “Social Collaboration Combined With Secure Files Sync and Share: Introducing OpenText Tempo.” OpenText describes Tempo as a project that required teamwork from all over the world.
OpenText Tempo will be able to:
“…provides an engaging user experience that combines the convenience of secure file sync and share with social collaboration and seamless integration to Content Server. It connects people with each other and with their content in a secure, compliant environment that enables open dialogues to take place, extending the value of content through the process of collaboration.”
It is the company’s first EIM application with integration for other products, including Tempo Note, Tempo Social, and Tempo Box. OpenText also says it improves Web site management, web experience management, and portal applications.
It is a commercially secure file sharing and social platform. Will Dropbox and other free services be able to something similar on at an appealing price point?
February 7, 2014
Whether Autonomy’s product success is true or false, as proprietary software it comes with a large price tag. The average small business or user cannot afford to purchase HP Autonomy’s IDOL Crawler. Open source is the best alternative, but for the longest time you could not get software comparable to IDOL Crawler. Norconex says that has changed in the article, “An Open Source Crawler For Autonomy IDOL.” Norconex released an HP Autonomy IDOL Committer for its open source Web crawler Norconex HTTP Collector.
The HTTP Collector is available for Github. The developer encourages people to download it and contribute to the project. Its features are mostly the same as those from HP Autonomy HTTP Connector.
The article states:
“Most key features of HP Autonomy HTTP Connector are available in Norconex HTTP Collector, including document changes detection on incremental crawls and purging documents from IDOL for deleted web pages. New ones are introduced, such as having different hit interval at different time of the day and the ability to overwrite pretty much every part of the web crawling flow with your own implementation logic. The IDOL Committer has been tested on diverse public and internal web sites with great performance.”
We can learn from the open source community that if there is not a piece of software you want, all you have to do is wait until a developer makes it or you can take the initiative to do it yourself.
Whitney Grace, February 07, 2014
February 4, 2014
SharePoint migration is a consistent problem among SharePoint users. Large updates occur on the Microsoft schedule of every three years and are disruptive at best, devastating at worst. Metalogix is hoping to help tackle this problem with their new product. Read more in the GCN article, “Metalogix Tool Reduces Risk and Cost of SharePoint Upgrades.”
The article begins:
“Preparing for and conducting a platform or software upgrade can be expensive, time-consuming and disruptive for government agencies. Administrators need a clear picture of the current potential risks before they begin a migration of important content. To help agencies better prepare for inevitable upgrades, Metalogix added a new tool to its suite of SharePoint applications that can help administrators reduce the risk and cost associated with migrations in SharePoint.”
Stephen E. Arnold is a longtime leader in search and often writes on ArnoldIT.com about the latest SharePoint trends and troubles. It is clear from the media coverage that SharePoint isn’t going away, and yet users are clear about their frustration on a number of levels. Metalogix hopes to seize upon both angles and help existing SharePoint customers migrate stress-free.
Emily Rae Aldridge, February 4, 2014