CyberOSINT banner

Maverick Search and Match Platform from Exorbyte

August 31, 2015

The article titled Input Management: Exorbyte Automates the Determination of Identities on Business On (a primarily German language website) promotes the Full Page Entity Detect from Exorbyte. Exorbyte is a world leader in search and match for large volumes of data. They boast clients in government, insurance, input management and ICT firms, really any business with identity resolution needs. The article stresses the importance of pulling information from masses of data in the modern office. They explain,

“With Full Page Entity Detect provides exorbyte a solution to the inbox of several million incoming documents.This identity data of the digitized correspondence (can be used for correspondence definition ) extract with little effort from full-text documents such as letters and emails and efficiently compare them with reference databases. The input management tool combines a high fault tolerance with accuracy, speed and flexibility.Gartner, the software company from Konstanz was recently included in the Magic Quadrant for Enterprise Search.”

The company promises that their Matchmaker technology is unrivaled in searching text without restrictions, even without language, allowing for more accurate search. Full Page Entity Detect is said to be particularly useful when it comes to missing information or overlooked errors, since the search is so thorough.

Chelsea Kerwin, August 31 , 2015

Sponsored by, publisher of the CyberOSINT monograph

Monopoly On Scientific Papers

August 31, 2015

If you work in the academic community this headline from Your News Wire shouldn’t come as a surprise: “Nearly All Scientific Papers Controlled By Same Six Corporations.”  A group of researchers studied scientific papers published between 1973-2013 and discovered that six major publishers ruled the industry: Wiley-Blackwell, Springer, Taylor & Francis, Sage, Reed Elsevier, and ACS.  During the specified time period, it was found that the larger ones absorbed smaller publishers.  Another, more startling, fact came to light as well: academic research groups must rely more and more on the main six publishers’ interests if they want to get their research published.

“Much of the independence that was once cherished within the scientific community, in other words, has gone by the wayside as these major publishers have taken control and now dictate what types of content get published. The result is a publishing oligopoly in which scientists are muzzled by and overarching trend toward politically correct, and industry-favoring, ‘science.’”

The six publishers publish subjects that benefit their profit margin and as a direct result they influence major scientific fields.  Fields concerning chemistry, social sciences, and psychology are the most influenced by the publishers.  This leads to corruption in the above disciplines and researchers are limited by studies that will deliver the most profits to the publishers.  The main six publishers can also publish the papers digitally for a 40% profit margin.

There is good news.  The study did find that publishing a paper via a smaller venue does not affect its reach.  It also has the added benefit of the smaller venue not pushing a special interest agenda.  The real question is are big publishers even needed in a digital age anymore?

Whitney Grace, August 31, 2015
Sponsored by, publisher of the CyberOSINT monograph

Yammer Improvements and Changes on the Horizon

August 27, 2015

A few years ago, Yammer was an integral part of SharePoint’s marketing campaign as they sought to persuade users that they were moving toward a focus on social. With the upcoming release of SharePoint 2016, social is still important, although it feels less forced and more natural this time around. There will be changes to Yammer and Redmond Magazine covers it in their article, “Microsoft Announces Yammer Improvements To Come While Deprecating Some Yammer SharePoint Apps.”

The article says:

“Microsoft announced this week that it is working on a more team-oriented Yammer, and it will be bringing along some mobile app improvements, too. Yammer is Microsoft’s enterprise-grade social networking application that’s part of some Office 365 subscription plans. Yammer can be used as a standalone service, but it’s also used with SharePoint Server products and SharePoint Online implementations.”

To stay current on what else may change with the release of SharePoint Server 2016, stay tuned to Stephen E. Arnold is an expert on search and the enterprise. His dedicated SharePoint feed is a great way to stay up to date on the latest new surrounding SharePoint.

Emily Rae Aldridge, August 27, 2015

Sponsored by, publisher of the CyberOSINT monograph

Insights into the Cut and Paste Coding Crowd

August 26, 2015

I read “How Developers Search for Code.” Interesting. The write up points out what I have observed. Programmers search for existing — wait for it — code.

Why write something when there are wonderful snippets to recycle. Here’s the paragraph I highlighted:

We also learn that a search session is generally just one to two minutes in length and involves just one to two queries and one to two file clicks.

Yep, very researchy. Very detailed. Very shallow. Little wonder that most software rolls out in endless waves of fixes. Good enough is the sort of sigma way.

Encouraging. Now why did that air traffic control crash happen? Where are the back ups to the data in Google’s Belgium server center? Why does that wonderful Windows 10 suck down data to mobile devices with little regard for data caps? Why does malware surface in Android apps?

Good enough: the new approach to software QA/QC.

Stephen E Arnold, August 26, 2015

How to Search the Ashley-Madison Data and Discover If You Had an Affair Too

August 26, 2015

If you haven’t heard about the affair-promoting website Ashley Madison’s data breach, you might want to crawl out from under that rock and learn about the millions of email addresses exposed by hackers to be linked to the infidelity site. In spite of claims by parent company Avid Life Media that users’ discretion was secure, and that the servers were “kind of untouchable,” as many as 37 million customers have been exposed. Perhaps unsurprisingly, a huge number of government and military personnel have been found on the list. The article on Reuters titled Hacker’s Ashley Madison Data Dump Threatens Marriages, Reputations also mentions that the dump has divorce lawyers clicking their heels with glee at their good luck. As for the motivation of the hackers? The article explains,

“The hackers’ move to identify members of the marital cheating website appeared aimed at maximum damage to the company, which also runs websites such as, causing public embarrassment to its members, rather than financial gain. “Find yourself in here?,” said the group, which calls itself the Impact Team, in a statement alongside the data dump. “It was [Avid Life Media] that failed you and lied to you. Prosecute them and claim damages. Then move on with your life. Learn your lesson and make amends. Embarrassing now, but you’ll get over it.”

If you would like to “find yourself” or at least check to see if any of your email addresses are part of the data dump, you are able to do so. The original data was put on the dark web, which is not easily accessible for most people. But the website Trustify lets people search for themselves and their partners to see if they were part of the scandal. The website states,

“Many people will face embarrassment, professional problems, and even divorce when their private details were exposed. Enter your email address (or the email address of your spouse) to see if your sexual preferences and other information was exposed on Ashley Madison or Adult Friend Finder. Please note that an email will be sent to this address.”

It’s also important to keep in mind that many of the email accounts registered to Ashley Madison seem to be stolen. However, the ability to search the data has already yielded some embarrassment for public officials and, of course, “family values” activist Josh Duggar. The article on the Daily Mail titled Names of 37 Million Cheating Spouses Are Leaked Online: Hackers Dump Huge Data File Revealing Clients of Adultery Website Ashley Madison- Including Bankers, UN and Vatican Staff goes into great detail about the company, the owners (married couple Noel and Amanda Biderman) and how hackers took it upon themselves to be the moral police of the internet. But the article also mentions,

“Ashley Madison’s sign-up process does not require verification of an email address to set up an account. This means addresses might have been used by others, and doesn’t prove that person used the site themselves.”

Some people are already claiming that they had never heard of Ashley Madison in spite of their emails being included in the data dump. Meanwhile, the Errata Security Blog entry titled Notes on the Ashley-Madison Dump defends the cybersecurity of Ashley Madison. The article says,

“They tokenized credit card transactions and didn’t store full credit card numbers. They hashed passwords correctly with bcrypt. They stored email addresses and passwords in separate tables, to make grabbing them (slightly) harder. Thus, this hasn’t become a massive breach of passwords and credit-card numbers that other large breaches have lead to. They deserve praise for this.”

Praise for this, if for nothing else. The impact of this data breach is still only beginning, with millions of marriages and reputations in the most immediate trouble, and the public perception of the cloud and cybersecurity close behind.


Chelsea Kerwin, August 26, 2015

Sponsored by, publisher of the CyberOSINT monograph

It Is a Recommended Title

August 24, 2015

Centripetal Networks offers a fully integrated security network specializing in threat-based intelligence.  Threat intelligence is being informed about potential attacks, who creates the attacks, and how to prevent them.  Think of it as the digital version of “stranger danger.”  Centripetal Networks offers combative software using threat intelligence to prevent hacking with real-time results and tailoring for individual systems.

While Centripetal Networks peddles its software, they also share information sources that expand on threat intelligence, how it pertains to specific industries, and new developments in digital security.  Not to brag or anything, but our very own CyberOSINT: Next Generation Information Access made the news page!  Take a gander at its description:

“The RuleGate technology continues to remain the leader in speed and performance as an appliance, and its visualization and analytics tools are easy-to-use. Because of federal use and interest, its threat intelligence resources will continue to rank at the top. Cyber defense, done in this manner, is the most useful for its real time capacity and sheer speed in computing.”

CyberOSINT was written for law enforcement officials to gain and understanding of threat intelligence as well as tools they can use to arm themselves against cyber theft and track potential attacks.  It profiles companies that specialize in threat intelligence and evaluates them.  Centripetal Networks is proudly featured in the book.

Whitney Grace, August 24, 2015

Sponsored by, publisher of the CyberOSINT monograph




Do Search and CMS Deliver a Revenue Winner?

August 21, 2015

I spotted a write up called “Look for Enterprise Search, Analytics and These ECM Leaders for Your Transactional Content.” I found the article darned amazing even for public relations about a mid tier consulting firm and one of its analyses.

The main point of the article is that analysts have analyzed enterprise software and identified vendors who provide “ECM Transactional Content Services.” Fabricating collections of objects and slapping a jargon laded label on the batch is okay with me.


Empty calories await you, gentle reader.

What struck me as interesting was this statement:

Forrester Vice President and Principal Analyst Craig Le Clair points to key advancements and opportunities by the leading ECM providers to help enterprises realize greater value in these systems:

  • Ramping analytics to drive insight and reduce administrative burden
  • Accelerating their move to cloud
  • Improved search and content sharing
  • Using stronger and more open application program interfaces (APIs) that spur innovation
  • Moving quickly to fill gaps in their mobile road maps.

Notice the “ECM”. The acronym refers to software which provides editing, access, and publishing functions to its users. The idea, it seems, is that an employee will write a memo and the ECM will keep track of the document. In practice, based on my experience, the ECM recipe usually fails to satisfy my hunger.

ECM and its close cousins in acronym land are similar to the approach articulated by my kindergarten teacher more than half century ago. She said, according to my mother, “Keep your mittens and lunch in your cubby.” The spirit of the kindergarten teacher lives on in enterprise content management systems.

Unfortunately those who have work to do often create content using tools suited for a specific task. For an engineer, that tool might be Solidworks. Bench chemists are often confused when an ECM is described as the tool for their work. One chemist said to me after an enthusiastic presentation by an information technology person, “I work with chemical structures. What’s this person talking about?” Lawyers in the midst of big risk litigation want to use their own and often flawed document systems.  Even the marketer who cheers for ECM for Web content parks some high value data in that wonderful Adobe creative cloud with some back up data on iCloud. I have spotted a renegade analyst with an off the books workstation equipped with an Australian text processing and search system. is notable for what is not available because executive brand entities roll their own content solutions.

I was able to review a copy of the consultant report upon which the article was based. Wowza. The write up assembled a grad bag of widely disparate companies, added three cups of buzzwords, and output mixed in one kilo of MBAisms.

To be fair, the report identified “challenges.” These items baffled me. For example, “Deep experience in key transactional applications.” This is a challenge, really?

But the vendors in the report are able to “address emerging opportunities.” Okay, so these are not opportunities. The opportunities are emerging. Hmmm. Here’s an example: “Ramping analytics to drive insight and reduce administrative burden.” Yikes. Ramping analytics. Driving analytics. Reducing administrative burden. Very active stuff this ECM. Gerund alert. Gerund alert.

What companies are into this suite of challenges and emerging opportunities? Here’s the list of the mid tier touted stallions from the ECM stable:

  1. EMC, a company which is considering having a subsidiary of itself purchase the parent company. Folks, when a company does this type of recursive stuff, the core business might be a little bit uncertain.
  2. HP. Yep, an outfit which has lost its way, suffered five consecutive quarters of declining revenue, and bought a company for $11 billion and then wrote off most of that expense because the sellers of the company fooled HP, its consultants, accountants, and lawyers. Okay. A winner for the legal eagles maybe.
  3. IBM. Heaven help me. IBM has suffered declining revenues for 13 consecutive quarters, annoyed me with a blizzard of Watson silliness, and spent lots of time getting rid of businesses. I have a difficult time believing that IBM can manage enterprise content. But, hey, that’s just my rural Kentucky ignorance, right?
  4. Laserfiche. The company offers a “flexible, proven enterprise content management system. I believe this statement. The company was founded in 1987 and sure seems to have its roots in well seasoned technology. The company has lots of customers and lots of award. The only hitch in the git along is that I never ran across this outfit in my work. Bad luck I guess.
  5. Lexmark. Folks, let us recall the rumor that Lexmark and its content businesses are not money makers. I heard that the content cluster achieved an astounding $70 to $80 million shortfall. Who knows if this rumor is accurate. I do know that Lexmark is cutting staff, and one does not take this drastic step unless one needs to reduce costs pronto.
  6. M Files. I never heard of this outfit. I did a quick check of my files and learned that the company “helps enterprises find, share, and secure documents and information. Even in highly regulated industries.” The company is also “passionate about productivity.” The outfit relies on dtSearch for information access. This is okay because dtSearch can process most of the content within a Microsoft-centric environment. But M Files strikes me as a different type of outfit from HP or IBM. As I flipped through the information I had collected, the company struck me as a collection of components. Assembly required.
  7. Newgen Software. Another newbie for me. The company was in my Overflight archive. The firm provides BPM (business process management), ECM (enterprise content management), DMS (I have no idea what this acronym means), CCM (I have no idea what this acronym means), and workflow (I thought this was the same as BPM). The company operated from New Delhi. My thought? Another collection of components with assembly in someone’s future.
  8. Hyland OnBase. This is the third outfit on the list about which I have a modest amount of information. The company says that it is a “leader in ECM.” I believe it. The firm’s url is the same as its flagship product. The company was founded in 1991 and created OnBase, which is a plus. After 25 years, the darned thing should work better than a Rube Goldberg solution assembled from a box of components.
  9. OpenText. Okay, OpenText is a company which has more search engines and content processing systems than most Canadian firms. The challenge at OpenText is having enough cash to invest in keeping the diverse assortment of systems current. Which of these systems is the one referenced in the mid tier firm’s report? SGML search, BASIS, BRS, Nstein, the Autonomy stub in RedDot, Nstein, Fulcrum, or some other approach? Details can be important.
  10. Unisys. Okay, finally a company that is essentially an integrator which still supports Burroughs mainframes. Unisys can implement systems because it is an integrator. For government work, Unisys matches the statement of work to available software. Although some might question this statement, Unisys can implement almost any kind of system eventually.

Several observations:

First, enterprise content management is a big and fuzzy concept. The evidence of this is the number of acronyms some of the companies use to explain what they do. I assume that it is my ignorance which prevents me from understanding exactly how scanning, indexing, retrieval, repurposing, workflow, and administrative functions work in a cost constrained, teleworker, mobile gizmo world.

Second, open source is knocking on the door of this sector. At some point, organizations will tire of the cost and complexity of collections of loosely federated and integrated software subsystems and look for an alternative. Toss in the word Big Data, and there will be a stampede of New Age consultants ready to step forward and reinvent these outfits. Disruption is probably less of a challenge than the challenge of keeping existing revenues from doing the HP, IBM, and Lexmark drift down.

Third, the search function seems to be a utility or an after thought. The only problem is that search does not work particularly well in an enterprise where the workers log in from Starbucks and try to interact with enterprise software from a Blackberry.

Fourth, what an odd collection of outfits? HP, IBM, and Lexmark along with 30 year old imaging firms plus some small outfits. Maybe the selection of firms makes sense to you, gentle reader. For me, the report make evident the struggles of some experts in ECM, BPM, and the acronyms I know zero about.

In short, this mid tier report strikes me as a russische punschtorte. On the surface, the darned thing looks good, maybe mouth watering. After a chomp or two, I want a paprikahenderl.

This ECM thing is a confection, not a meaty chicken. Mixing in search does nothing for the recipe.

Stephen E Arnold, August 22, 2015

Sensible Advice on Content Marketing

August 21, 2015

Here’s a post on structured-content marketing that is refreshingly free of semantic search baloney. Tatiana Tilearcio at Synthesio shares what she learned from a seminar in, “Four Insights from a Content Marketing Crash Course.” The symposium, scheduled to be repeated in October in Connecticut, was presented by content-strategy outfit Content Boost. Tilearcio’s first takeaway promotes a firm foundation; she writes:

“Get Organized And Understand Your Goals Before You Create Your Content Marketing Plan.

Before you sit down to put together your strategic plan, you have to know the answer to the question ‘what’s the purpose for your content marketing, and what will it do to your brand?’ To do this, you need to first create a dream wish-list of what you would like to see for your brand. Next, you need to address how you want to go about enhancing your brand’s content marketing efforts and what your budget is. When creating a content marketing plan, or any marketing plan, a budget is essential. Without a proper budget of what your plan will cost, your ideas will never come to fruition. If you have identified all of this, then you are already well on your way to understanding what your campaign strategy is.”

The article also discusses blending efforts in blogging, social media, and email; co-sourcing content; ensuring users find value in gated assets; repurposing content; and the importance of strong titles. See the post for more details on each of these points. Based in Norwalk, Connecticut, Content Boost is part of the Technology Marketing Corporation, aka TMCnet.

Cynthia Murrell, August 21, 2015

Sponsored by, publisher of the CyberOSINT monograph


Its Hacker Season

August 21, 2015

One of the quintessential cartoon feuds exists between Bugs Bunny and Daffy Duck as they argue whether or not it is duck or rabbit hunting season.  Whoever wins gets the lovely prize of having their face blown off, thankfully cartoon violence does not obey the rules of life and death.  The ensuing argument ends with hilarious consequences, but everyday another type of big game is always in season: your personal information.  Hackers are constantly searching for ways to break into vulnerable systems and steal valuable information.

One a personal level it is frightening to be hacked, but corporations stand risk millions of dollars, customer information, trade secrets, and their reputations if their systems get hacked.   There are many companies that specialize in software to prevent potential hackings, but Cybereason offers unique selling points in the article, “Introducing Cybereason: Real-Time Automated Cyber Hunting.”

“This is why Cybereason exists, to bring the fight against hackers off of the frontlines and into the depths of your environment, where they lurk after gaining unnoticed access. Security needs to be about having an ever-watchful eye over your endpoints, servers, and network, and the Cybereason platform will allow you to perform real-time, automated hunting across your entire environment.”

On their Web site they posted a product video that feeds on the US’s culture of fear and they present an Armageddon like situation complete with a female voice over artist with a British accent, a Guy Fawkes mask, and Matrix-like graphics.  My favorite bit is when Cybereason is made to resemble a secret intelligence agency of superheroes.

Despite the clichéd video, it does give a thorough visualization of what Cybereason’s software and services can do.  The fear factor might be a selling point for some clients, but I’d rather hear hard facts and direct solutions.  It takes out the dramatic elements and actually tells me what the product can do for me.  You have to love Cybereason’s ending phrase, “Let the hunt begin.” It makes me want to respond with, “May the odds ever be in your favor.”

Whitney Grace, August 21, 2015

Sponsored by, publisher of the CyberOSINT monograph

Compare Trump to Lincoln with Watson Personality Insights

August 19, 2015

IBM’s Watson is employing its capabilities in a new and interesting way: BoingBoing asks, “What Does Your Writing Say About You? IBM Watson Personality Insights Will Tell You.” The software derives cognitive and social characteristics about people from their writings, using linguistic analytics. I never thought I’d see a direct, graphically represented comparison between speeches of Donald Trump and Abe Lincoln, but there it is. There are actually some similarities; they’re both businessmen turned politicians, after all. Reporter Andrea James shares Watson’s take on Trump’s “We Need Brain” speech from the recent Republican primary debate:

“You are a bit dependent, somewhat verbose and boisterous. You are susceptible to stress: you are easily overwhelmed in stressful situations. You are emotionally aware: you are aware of your feelings and how to express them. And you are prone to worry: you tend to worry about things that might happen. Your choices are driven by a desire for efficiency. You consider both independence and helping others to guide a large part of what you do. You like to set your own goals to decide how to best achieve them. And you think it is important to take care of the people around you.”

For comparison, see the write-up for the analysis of Lincoln’s Gettysburg Address (rest assured, Lincoln does come out looking better than Trump). The article also supplies this link, where you can submit between 3500 and 6000 words for Watson’s psychoanalysis; as James notes, you can submit writing penned by yourself, a friend, or an enemy (or some random blogger, perhaps.) To investigate the software’s methodology, click here.

Cynthia Murrell, August 19, 2015

Sponsored by, publisher of the CyberOSINT monograph

Next Page »