May 1, 2015
Enterprise search is limited to how well users tag their content and the preloaded taxonomies. According Tech Target’s Search Content Management blog, text analytics might be the key to turning around poor enterprise search performance: “How Analytics Engines Could Finally-Relieve Enterprise Pain.” Text analytics turns out to only be part of the solution. Someone had the brilliant idea to use text analytics to classification issues in enterprise search, making search reactive to user input to proactive to search queries.
In general, analytics search engines work like this:
“The first is that analytics engines don’t create two buckets of content, where the goal is to identify documents that are deemed responsive. Instead, analytics engines identify documents that fall into each category and apply the respective metadata tags to the documents. Second, people don’t use these engines to search for content. The engines apply metadata to documents to allow search engines to find the correct information when people search for it. Text analytics provides the correct metadata to finally make search work within the enterprise.”
Supposedly, they are fixing the tagging issue by removing the biggest cause for error: humans. Microsoft caught onto how much this could generate profit, so they purchased Equivio in 2014 and integrated the FAST Search platform into SharePoint. Since Microsoft is doing it, every other tech company will copy and paste their actions in time. Enterprise search is gull of faults, but it has improved greatly. Big data trends have improved search quality, but tagging continues to be an issue. Text analytics search engines will probably be the newest big data field for development. Hint for developers: work on an analytics search product, launch it, and then it might be bought out.
April 29, 2015
The Exclusive Interview with Jason Hines, Global Vice President at Recorded Future
In my analyses of Google technology, despite the search giant’s significant technical achievements, Google has a weakness. That “issue” is the company’s comparatively weak time capabilities. Identifying the specific time at which an event took place or is taking place is a very difficult computing problem. Time is essential to understanding the context of an event.
This point becomes clear in the answers to my questions in the Xenky Cyber Wizards Speak interview, conducted on April 25, 2015, with Jason Hines, one of the leaders in Recorded Future’s threat detection efforts. You can read the full interview with Hines on the Xenky.com Cyber Wizards Speak site at the Recorded Future Threat Intelligence Blog.
Recorded Future is a rapidly growing, highly influential start up spawned by a team of computer scientists responsible for the Spotfire content analytics system. The team set out in 2010 to use time as one of the lynch pins in a predictive analytics service. The idea was simple: Identify the time of actions, apply numerical analyses to events related by semantics or entities, and flag important developments likely to result from signals in the content stream. The idea was to use time as the foundation of a next generation analysis system, complete with visual representations of otherwise unfathomable data from the Web, including forums, content hosting sites like Pastebin, social media, and so on.
A Recorded Future data dashboard it easy for a law enforcement or intelligence professionals to identify important events and, with a mouse click, zoom to the specific data of importance to an investigation. (Used with the permission of Recorded Future, 2015.)
Five years ago, the tools for threat detection did not exist. Components like distributed content acquisition and visualization provided significant benefits to enterprise and consumer applications. Google, for example, built a multi-billion business using distributed processes for Web searching. Salesforce.com integrated visualization into its cloud services to allow its customers to “get insight faster.”
According to Jason Hines, one of the founders of Recorded Future and a former Google engineer, “When our team set out about five years ago, we took on the big challenge of indexing the Web in real time for analysis, and in doing so developed unique technology that allows users to unlock new analytic value from the Web.”
Recorded Future attracted attention almost immediately. In what was an industry first, Google and In-Q-Tel (the investment arm of the US government) invested in the Boston-based company. Threat intelligence is a field defined by Recorded Future. The ability to process massive real time content flows and then identify hot spots and items of interest to a matter allows an authorized user to identify threats and take appropriate action quickly. Fueled by commercial events like the security breach at Sony and cyber attacks on the White House, threat detection is now a core business concern.
The impact of Recorded Future’s innovations on threat detection was immediate. Traditional methods relied on human analysts. These methods worked but were and are slow and expensive. The use of Google-scale content processing combined with “smart mathematics” opened the door to a radically new approach to threat detection. Security, law enforcement, and intelligence professionals understood that sophisticated mathematical procedures combined with a real-time content processing capability would deliver a new and sophisticated approach to reducing risk, which is the central focus of threat detection.
In the exclusive interview with Xenky.com, the law enforcement and intelligence information service, Hines told me:
Recorded Future provides information security analysts with real-time threat intelligence to proactively defend their organization from cyber attacks. Our patented Web Intelligence Engine indexes and analyzes the open and Deep Web to provide you actionable insights and real-time alerts into emerging and direct threats. Four of the top five companies in the world rely on Recorded Future.
Despite the blue ribbon technology and support of organizations widely recognized as the most sophisticated in the technology sector, Recorded Future’s technology is a response to customer needs in the financial, defense, and security sectors. Hines said:
When it comes to security professionals we really enable them to become more proactive and intelligence-driven, improve threat response effectiveness, and help them inform the leadership and board on the organization’s threat environment. Recorded Future has beautiful interactive visualizations, and it’s something that we hear security administrators love to put in front of top management.
As the first mover in the threat intelligence sector, Recorded Future makes it possible for an authorized user to identify high risk situations. The company’s ability to help forecast and spotlight threats likely to signal a potential problem has obvious benefits. For security applications, Recorded Future identifies threats and provides data which allow adaptive perimeter systems like intelligent firewalls to proactively respond to threats from hackers and cyber criminals. For law enforcement, Recorded Future can flag trends so that investigators can better allocate their resources when dealing with a specific surveillance task.
Hines told me that financial and other consumer centric firms can tap Recorded Future’s threat intelligence solutions. He said:
We are increasingly looking outside our enterprise and attempt to better anticipate emerging threats. With tools like Recorded Future we can assess huge swaths of behavior at a high level across the network and surface things that are very pertinent to your interests or business activities across the globe. Cyber security is about proactively knowing potential threats, and much of that is previewed on IRC channels, social media postings, and so on.
In my new monograph CyberOSINT: Next Generation Information Access, Recorded Future emerged as the leader in threat intelligence among the 22 companies offering NGIA services. To learn more about Recorded Future, navigate to the firm’s Web site at www.recordedfuture.com.
Stephen E Arnold, April 29, 2015
April 29, 2015
Mobile Web sites, mobile apps, mobile search, mobile content, and the list goes on and on for Web-related material to be mobile-friendly. Online retailers are being pressured to make their digital storefronts applicable to the mobile users, because more people are using their smartphones and tablets over standard desktop and laptop computers. It might seem easy to design an app and then people can download it for all of their shopping needs, but according to Easy Ask things are not that simple: “Internet Retailer Reveals Mobile Commerce Conversion Troubles.”
The article reveals that research conducted by Spreadshirt CTO Guido Laures shows that while there is a high demand for mobile friendly commerce applications and Web sites, very few people are actually purchasing products through these conduits. Why? The problem relates to the lack of spontaneous browsing and one the iPhone 6’s main selling features: a big screen.
“While mobile-friendly responsive designs and easier mobile checkouts are cited as inhibitors to mobile commerce conversion, an overlooked and more dangerous problem is earlier in the shopping process. Before they can buy, customers first need to find the product they want. Small screen sizes, clumsy typing and awkward scrolling gestures render traditional search and navigation useless on a smartphone.”
Easy Ask says that these problems can be resolved by using a natural language search application over the standard keyword search tool. It says that:
“A keyword search engine leaves you prone to misunderstanding different words and returning a wide swath of products that will frustrate your shoppers and continue you down the path of poor mobile customer conversion.”
Usually natural language voice search tools misunderstand words and return funny phrases. The article is a marketing tool to highlight the key features of Easy Ask technology, but they do make some key observations about mobile shopping habits.
April 29, 2015
A new content analysis platform boasts the ability to find “non-obvious” relationships within unstructured data, we learn from a write-up hosted at PRWeb, “Content Analyst Announces Cerebrant, a Revolutionary SaaS Discovery Platform to Provide Rapid Insight into Big Content.” The press release explains what makes Cerebrant special:
“Users can identify and select disparate collections of public and premium unstructured content such as scientific research papers, industry reports, syndicated research, news, Wikipedia and other internal and external repositories.
“Unlike alternative solutions, Cerebrant is not dependent upon Boolean search strings, exhaustive taxonomies, or word libraries since it leverages the power of the company’s proprietary Latent Semantic Indexing (LSI)-based learning engine. Users simply take a selection of text ranging from a short phrase, sentence, paragraph, or entire document and Cerebrant identifies and ranks the most conceptually related documents, articles and terms across the selected content sets ranging from tens of thousands to millions of text items.”
We’re told that Cerebrant is based on the company’s prominent CAAT machine learning engine. The write-up also notes that the platform is cloud-based, making it easy to implement and use. Content Analyst launched in 2004, and is based in Reston, Virginia, near Washington, DC. They also happen to be hiring, in case anyone here is interested.
Cynthia Murrell, April 29, 2015
April 27, 2015
A new enterprise search system startup is leveraging the SAP HANA Cloud Platform, we learn from “EnterpriseJungle Tames Enterprise Search” at SAP’s News Center. The company states that their goal is to make collaboration easier and more effective with a feature they’re calling “deep people search.” Writer Susn Galer cites EnterpriseJungle Principal James Sinclair when she tells us:
“Using advanced algorithms to analyze data from internal and external sources, including SAP Jam, SuccessFactors, wikis, and LinkedIn, the applications help companies understand the make-up of its workforce and connect people quickly….
“Who Can Help Me is a pre-populated search tool allowing employees to find internal experts by skills, location, project requirements and other criteria which companies can also configure, if needed. The Enterprise Q&A tool lets employees enter any text into the search bar, and find experts internally or outside company walls. Most companies use the prepackaged EnterpriseJungle solutions as is for Human Resources (HR), recruitment, sales and other departments. However, Sinclair said companies can easily modify search queries to meet any organization’s unique needs.”
EnterpriseJungle users manage their company’s data through SAP’s Lumira dashboard. Galer shares Sinclair’s example of one company in Germany, which used EnterpriseJungle to match employees to appropriate new positions when it made a whopping 3,000 jobs obsolete. Though the software is now designed primarily for HR and data-management departments, Sinclair hopes the collaboration tool will permeate the entire enterprise.
Cynthia Murrell, April 27, 2015
April 24, 2015
There was a time before the Internet that if you wanted to research something you had to go to the library, dig through old archives, and check encyclopedias for quick facts. While it seems that all information is at your disposable with a few keystrokes, but search results are often polluted with paid ads and unless your information comes from a trusted source, you can’t count it as fact.
LifeHacker, like many of us, knows that if you want to get the truth behind a topic, you have to do some old school sleuthing. The article “How To Research Like A Journalist When The Internet Doesn’t Deliver” drills down tried and true research methods that will continue to withstand the sands of time or the wrecking ball (depending on how long libraries remain brick and mortar buildings).
The article pushes using librarians as resources and even going as far as petitioning government agencies and filing FOIA requests for information. When it makes the claim that some information is only available in person or strictly for other librarians, this is both true and false. Many libraries are trying to digitize their information, but due to budgets are limited in their resources. Also unless the librarian works in a top secret archive, most of the information is readily available to anyone with or without the MLS degree.
Old school interviews are always great, especially when you have to cite a source. You can always cite your own interview and verify it cam straight from the horse’s mouth. One useful way to team the Internet with interviews is tracking down the interviewees.
Lastly, this is the best piece of advice from the article:
“Finally, once you’ve done all of this digging, visited government agencies, libraries, and the offices of the people with the knowledge you need, don’t lose it. Archive everything. Digitize those notes and the recordings of your interviews. Make copies of any material you’ve gotten your hands on, then scan them and archive them safely.”
The Internet is full of false information. By placing a little more credence out there, will make the information more safe to use or claim as the truth.
These tips are useful, even if a little obvious, but they however still fail to mention the important step that all librarians know: doing the actual footwork and proper search methods to find things.
Whitney Grace, April 24, 2015
April 23, 2015
Despite SharePoint’s broad adoption, it suffers from a perceived lack of user commitment. So it becomes a paradox that it is one of the fastest growing software options ever, and shows no signs of slowing down. CMS Wire tells us more in their article, “Businesses Committed to SharePoint, Despite Stalled Deployments.”
The article begins:
“It is little surprise then, that in a recent AIIM survey of 422 organizations respondents described their SharePoint projects as stalled (26 percent) or just not meeting expectations (37 percent). Inadequate user training and a general lack of planning, investment and expertise were the main reason given for this malaise. And the recent talk about how Office365 and cloud fit in with SharePoint has further muddied the waters. And yet support for SharePoint remains strong.”
In recent news, Microsoft has pushed the general availability of SharePoint Server 2016 back. Combine these release delays with the local delays organizations face regarding customization, investment, and expertise, and most deployments face an uphill battle. For most users and managers, staying in touch with the latest news is essential. Stephen E. Arnold of ArnoldIT.com offers an efficient newsfeed regarding all things search. His SharePoint feed is an efficient way to keep an eye on news, tips, tricks, and workarounds that impact all aspects of SharePoint use.
Emily Rae Aldridge, April 23, 2015
April 17, 2015
The article on TweakTown titled Gartner: Smart Machines Must Include Ethical Programming Protocols briefly delves into the necessity of developing ethical programming in order to avoid some sort of Terminator/ I,Robot situation that culminates in the rise of the machines and the end of humanity. Gartner is one of the world’s leading technology research and advisory companies, but it hardly sounds like the company stance. The article quotes Frank Buytendijk, a Gartner research VP,
“Clearly, people must trust smart machines if they are to accept and use them…The ability to earn trust must be part of any plan to implement artificial intelligence (AI) or smart machines, and will be an important selling point when marketing this technology.”
If you’re thinking, sounds like another mid-tier consultant is divining the future, you aren’t wrong. Researching ethical programming for the hypothetical self-aware machines that haven’t been built yet might just be someone’s idea of a good time. The article concludes with the statement that “experts are split on the topic, arguing whether or not humans truly have something to worry about.” While the experts figure out how we humans will cause the end of the human reign over earth, some of us are just waiting for the end of another in a line of increasingly violent winters.
Chelsea Kerwin, April 17, 2014
Stephen E Arnold, Publisher of CyberOSINT at www.xenky.com
April 16, 2015
The word collaboration has become one of those corporate power words like “synergy” and “KISS method.” Many people groan inwardly at new ways to “collaborate,” because it usually means another tool they have to learn and will fall out of use in under a year. With the myriad of ways to collaborate digitally, getting any actual collaborating done is difficult. The SAP News blog says enterprise collaboration might be getting a little easier in the article, “EnterpriseJungle Tames Enterprise Search.”
EnterpriseJungle created an application with the SAP Hana Cloud Platform to help companies connect quickly find and connect with experts within or outside their company. The Principal at EnterpriseJungle states that a company’s people search is vital tool to locate and harness information.
“ ‘Large companies are desperate to get a handle on understanding and accessing the expertise available to them at any given moment,’ said Sinclair. ‘Our solutions help companies solve fundamental questions like how do we find the people who are fantastic at what they do, but only known to their closest core group of co-workers? And, how do we easily bring their knowledge and expertise to the front line with minimal extra work? If we can help get information to employees that need it, we’re fundamentally making their lives easier, and making the company’s life easier.’ “
After a description of how EnterpriseJungle’s works and its usefulness for companies, it makes a claim to offer Google-like search results. While it might be a people search tool, the application is capable of much more. It can help people locate experts, track down skill sets, and even improve IT relations.
EnterpriseJungle is hitting on a vital tool for companies. People search has a severe need for improvement and this might be the start of a new enterprise niche market.
Whitney Grace, April 16, 2015
Stephen E Arnold, Publisher of CyberOSINT at www.xenky.com
April 16, 2015
A recent study by harmon.ie has found that Mobile Office 365 is growing quickly among its users. Mobile is a huge consideration for all software companies, and now the data is proving that mobile is the go-to for even heavy-hitting work and enterprise applications. Read more in the AppsTechNews article, “The state of mobile Office 365 usage in the workplace – and what it means for SharePoint.”
The article begins with the research:
“24% of harmon.ie mobile users are now using mobile Office 365 in the cloud, compared to 18% six months ago. Not surprisingly, the most popular activity conducted by business users on mobile devices was online and offline document access, according to 81% of the vote. 7% most frequently use their mobile devices to add a SharePoint site, while 4% prefer to favourite documents for later offline access.”
Retrieval is still proven to be the most common mobile function, as devices are still not designed well for efficient input. To keep up with future developments regarding mobile use in the enterprise, stay tuned to ArnoldIT.com. Stephen E. Arnold has made a career out of following all things search, and his SharePoint feed is an accessible place to stay tuned in to the latest SharePoint developments.
Emily Rae Aldridge, April 16, 2015
Stephen E Arnold, Publisher of CyberOSINT at www.xenky.com