<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Beyond Search &#187; Real time search</title>
	<atom:link href="http://arnoldit.com/wordpress/category/real-time-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://arnoldit.com/wordpress</link>
	<description>by Stephen E. Arnold</description>
	<lastBuildDate>Sun, 12 Feb 2012 05:07:41 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Al Jazeera and Its US Reach</title>
		<link>http://arnoldit.com/wordpress/2012/01/24/al-jazeera-and-its-us-reach/</link>
		<comments>http://arnoldit.com/wordpress/2012/01/24/al-jazeera-and-its-us-reach/#comments</comments>
		<pubDate>Tue, 24 Jan 2012 05:11:07 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Real time search]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=22372</guid>
		<description><![CDATA[We were surprised, then resigned. Has the US slipped lower on yet another yardstick of achievement? Al Jazeera English, an international 24 hour English-Language news and current affairs TV channel headquartered in Doha, Qatar, has now reached 250 million homes — 5 million of those being in the U.S. The Los Angeles Times reported on [...]]]></description>
			<content:encoded><![CDATA[<p>We were surprised, then resigned. Has the US slipped lower on yet another yardstick of achievement?</p>
<p><a href="http://www.aljazeera.com/">Al Jazeera English</a>, an international 24 hour English-Language news and current affairs TV channel headquartered in Doha, Qatar, has now reached 250 million homes — 5 million of those being in the U.S.</p>
<p>The <a href="http://www.latimes.com/">Los Angeles Times</a> reported on this startling milestone in the article <a href="http://latimesblogs.latimes.com/showtracker/2011/12/al-jazeera-english-now-reaches-250-million-households.html">“Al Jazeera English Now Reaches 250 Million Households.”</a></p>
<p>We learned:</p>
<blockquote><p>Five years after its launch, there are 130 countries that carry Al Jazeera English, but in the U.S., the channel has limited availability; it can be found on cable systems in Washington, D.C.; New York; Burlington, Vt.; Toledo, Ohio; and, recently, Chicago and in Los Angeles on KCET. And while the U.S. makes up a fraction of the quarter-billion households, it is a major source of AJE&#8217;s Web traffic, totaling 40 percent, according to the network.</p></blockquote>
<p>The fact that Al Jazeera English has such a large web following in the United States despite its limited availability, leads me to think that a significant shift has taken place.</p>
<p>Jasmine Ashton, January 24, 2012</p>
<p>Sponsored by <a href="http://www.pandia.com/enterprise-search">Pandia.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2012/01/24/al-jazeera-and-its-us-reach/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Does Real Time Again</title>
		<link>http://arnoldit.com/wordpress/2011/10/28/google-does-real-time-again/</link>
		<comments>http://arnoldit.com/wordpress/2011/10/28/google-does-real-time-again/#comments</comments>
		<pubDate>Fri, 28 Oct 2011 05:10:29 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=20728</guid>
		<description><![CDATA[Google+ Rolls Out Real-Time Search and Hashtag Support On October 12, Google Plus rolled out two new features; both allow users to create custom news streams based around topics being shared and build upon the search functionality of the network. The first feature, a real-time search, finds results from Google+ posts that are related to [...]]]></description>
			<content:encoded><![CDATA[<p>Google+ Rolls Out Real-Time Search and Hashtag Support</p>
<p>On October 12, <a href="https://plus.google.com/up/?continue=https://plus.google.com/&amp;type=st">Google Plus</a> rolled out two new features; both allow users to create custom news streams based around topics being shared and build upon the search functionality of the network. The first feature, a real-time search, finds results from Google+ posts that are related to the search term a user enters. As new posts are created centering around the search topic, the user is notified and a real-time stream of posts is begun. ZDNet’s article, “<a href="http://www.zdnet.com/blog/google/google-real-time-search-the-social-news-ticker/3397">Google+ Real-Time Search: The Social News “Ticker”</a> tells us more about the changes:</p>
<blockquote><p>… Google engineer Vic Gundotra &#8211; who <a href="https://plus.google.com/u/0/107117483540235115863/posts/dXovwc1hSyY">posted the news</a> from his Google Plus feed &#8211; notes that it’s a great way to keep up with real-time news events, such as a speech, a court trial or a sporting event. Basically, it’s a real-time news ticker for niche topics. The second feature &#8211; hashtag support &#8211; essentially turns any hashtag in a post into a searchable term that can be used as another way to create feeds and real-time streams.</p></blockquote>
<p>This is a catchy notion. I’m interested to see if Google+ will begin integrating all social networking posts into their search results. What they’re doing right now isn’t groundbreaking; <a href="http://www.twitter.com">Twitter</a> already offers the exact same feature. However, it would be groundbreaking to be able to follow trending topics on all the major social networking sites as they correlate to breaking news.</p>
<p>But Google did real time before. What’s “real time”? Whatever Google wants it to be I suppose from a marketing viewpoint.</p>
<p>Stephen E Arnold, October 28, 2011</p>
<p>Sponsored by <a href="http://Pandia.com">Pandia.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/10/28/google-does-real-time-again/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lucid Imagination: Open Source Search Reaches for Big Data</title>
		<link>http://arnoldit.com/wordpress/2011/09/30/lucid-imagination-open-source-search-reaches-for-big-data/</link>
		<comments>http://arnoldit.com/wordpress/2011/09/30/lucid-imagination-open-source-search-reaches-for-big-data/#comments</comments>
		<pubDate>Fri, 30 Sep 2011 05:04:09 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business process]]></category>
		<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[Data mining]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[Feature]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=20129</guid>
		<description><![CDATA[We are wrapping up a report about the challenges “big data” pose to organizations. Perhaps the most interesting outcome of our research is that there are very few search and content processing systems which can cope with the digital information required by some organizations. Three examples merit listing before I comment on open source search [...]]]></description>
			<content:encoded><![CDATA[<p>We are wrapping up a report about the challenges “big data” pose to organizations. Perhaps the most interesting outcome of our research is that there are very few search and content processing systems which can cope with the digital information required by some organizations. Three examples merit listing before I comment on open source search and “big data”.</p>
<p>The first example is the challenge of filtering information required by orgnaizatio0ns produced within the organization and by the organizations staff, contractors, and advisors. We learned in the course of our investigation that the promises of processing updates to Web pages, price lists, contracts, sales and marketing collateral, and other routine information are largely unmet. One of the problems is that the disparate content types have different update and change cycles. The most widely used content management system based on our research results is SharePoint, and SharePoint is not able to deliver a comprehensive listing of content without significant latency. Fixes are available but these are engineering tasks which consume resources. Cloud solutions do not fare much better, once again due to latency. The bottom line is that for information produced within an organization employees are mostly unable to locate information without a manual double check. Latency is the problem. We did identify one system which delivered documented latency across disparate content types of 10 to 15 minutes. The solution is available from <a href="http://www.exalead.com" target="_blank">Exalead</a>, but the other vendors’ systems were not able to match this problem of putting fresh, timely information produced within an organization in front of system users. Shocked? We were.</p>
<p><a href="http://arnoldit.com/wordpress/wp-content/uploads/2011/09/luciddecisioncopy.png"><img style="display: inline; border: 0px;" title="lucid decision copy" src="http://arnoldit.com/wordpress/wp-content/uploads/2011/09/luciddecisioncopy_thumb.png" alt="lucid decision copy" width="187" height="191" border="0" /></a></p>
<p><span style="color: #800000; font-size: x-small;">Reducing latency in search and content processing systems is a major challenge. Vendors often lack the resources required to solve a “hard problem” so “easy problems” are positioned as the key to improving information access. Is latency a popular topic? A few vendors do address the issue; for example, Digital Reasoning and Exalead.</span></p>
<p>Second, when organizations tap into content produced by third parties, the latency problem becomes more severe. There is the issue of the inefficiency and scaling of frequent index updates. But the larger problem is that once an organization “goes outside” for information, additional variables are introduced. In order to process the broad range of content available from publicly accessible Web sites or the specialized file types used by certain third party content producers, connectors become a factor. Most search vendors obtain connectors from third parties. These work pretty much as advertised for common file types such as Lotus Notes. However, when one of the targeted Web sites such as a commercial news services or a third-party research firm makes a change, the content acquisition system cannot acquire content until the connectors are “fixed”. No problem as long as the company needing the information is prepared to wait. In my experience, broken connectors mean another variable. Again, no problem unless critical information needed to close a deal is overlooked.</p>
<p><span id="more-20129"></span></p>
<p>Third, many organizations are seeking to make sense of flows of social content. In our research, we learned that a majority of those with whom we spoke expressed interest in Facebook, Twitter, and other social content. However, few firms were using social content, electing to tap into a subset of information via a third party provider like <a href="http://www.datasift.net" target="_blank">DataSift</a> and processing the content with a third party solution optimized for that content stream. This is a workable solution, but it underscores the problem of making needed information available to employees or analysts who need timely data for a business decision.</p>
<p>Net net.</p>
<p>It is easy to talk about low latency and real time content processing, but it remains expensive and mostly an unmet goal. Many vendors focus on jumping over less formidable hurdles.</p>
<p>We found the write up “<a href="http://www.fiercecontentmanagement.com/story/lucid-imagination-brings-open-source-search-big-data/2011-09-28" target="_blank">Lucid Imagination Brings Open Source Search to Big Data</a>” quite interesting. I have not associated open source search and big data in the manner set forth in the Fierce write up. Here’s a <a href="http://www.lucidimagination.com" target="_blank">Lucid</a> statement which caught my attention:</p>
<blockquote><p>One of the biggest issues for companies adopting open source software is that they often lack the polished installation packages of the more commercial software packages. LucidWorks 2.0 provides new setup and management tools via an administrative console that in its words, &#8220;streamlines configuration, deployment and operations&#8221; for IT when setting up, deploying and managing the application.</p></blockquote>
<p>This is indeed a point. Installation of some open source software impossible without appropriate technical expertise. However, installation is manageable by those with enough knowledge to decide to use open source software in the first place. Managers may not be able to do it, but managers can hire people who can. My experience is that those who are interested in downloading <a href="http://lucene.apache.org/java/docs/index.html" target="_blank">Lucene</a>/Solr from Apache.org presumably have some technical chops.</p>
<p>More important, in my opinion, is that open source search and content processing must be more capable with regard to the three examples I identified above. A query run on incomplete or stale information is likely to undermine a decision at some point. All to often in my research, I encounter search results which often—perhaps usually—do not reflect the <strong>most current</strong> information in digital form.</p>
<p><a href="http://arnoldit.com/wordpress/wp-content/uploads/2011/09/image2.png"><img style="display: inline; border: 0px;" title="image" src="http://arnoldit.com/wordpress/wp-content/uploads/2011/09/image_thumb2.png" alt="image" width="221" height="244" border="0" /></a></p>
<p><span style="color: #800000; font-size: x-small;">Search is going to climb Big Data Mountain. Aren’t specialized tools and systems needed? </span></p>
<p>I am okay with improved installation and deployment, but vendors will have to work overtime to beat the <a href="http://www.google.com/enterprise/search/gsa.html" target="_blank">Google Search Appliance</a> or <a href="http://www.blossom.com" target="_blank">Blossom Software’s hosted solution</a> for speedy, painless, search-today roll outs. In short, the problem with “open source search” in the enterprise is narrowed to installation. Several firms are robust, yet different approaches to open source search; for instance:</p>
<ul>
<li><a href="http://www.digitalreasoning.com" target="_blank">Digital Reasoning</a>. The company has proprietary software, but it has teamed with other firms so that it can deliver a comprehensive solution to massive flows of content in a near zero latency implementation. Yep, near zero latency. Installation is part of a search solution, but it is the starting point. The end point is making sure the outputs are usable and reflect the freshest possible data in a form appropriate to the user of the system.</li>
<li><a href="http://www.flax.co.uk" target="_blank">FLAX</a>. This open source solution delivers a range of features. Installation is reliable and dozens of organizations have found the FLAX approach flexible, scalable, and cost effective. Its open source nature eliminates the “license handcuffs” that limit degrees of freedom imposed by some commercial search system vendors.</li>
<li><a href="http://www.searchblox.com/" target="_blank">SearchBlox</a>. This firm’s approach seems quite similar to the one described in the Fierce article. Amazon highlights the company’s use of Amazon Web services in a <a href="http://aws.amazon.com/solutions/case-studies/searchblox/" target="_blank">case study</a>. We are not sure how much revenue SearchBlox is generating, but the company says, it is “a leading provider of enterprise search solutions based on Apache Lucene. Over 300 customers in 30 countries use SearchBlox to power their website, intranet and custom search. SearchBlox Software, Inc. was founded in 2003 with the aim to develop commercial search products based on <a href="http://lucene.apache.org/java/docs/index.html" target="_blank">Apache Lucene</a>.”</li>
</ul>
<p>One negative in the open source search world was the fact that <a href="http://www.tesuji.eu" target="_blank">Tesuji</a>, an open source search vendor in Hungary, is repositioning its operations. Despite the strong interest in open source search, the Tesuji approach did not gain sufficient traction.</p>
<p>Our view is that open source search is a viable option for many organizations with the appetite for a solution that sidesteps license fees and permits by-the-drink technical support. However, most of today’s search solutions have far more significant challenges to address. One may argue with the price tags paid for <a href="http://www.microsoft.com/presspass/press/2008/jan08/01-08FastSearchPR.mspx" target="_blank">Fast Search &amp; Transfer</a>, <a href="http://www.exalead.com/software/news/press-releases/2010/06-09.php" target="_blank">Exalead</a>, and <a href="http://www.bloomberg.com/news/2011-08-18/hp-said-to-be-near-10-billion-autonomy-takeover-spinoff-of-pc-business.html" target="_blank">Autonomy</a>, but there is a reason. <a href="http://www-01.ibm.com/software/data/enterprise-search/omnifind-enterprise/" target="_blank">IBM</a> has embraced open source search and wrapped it with its particular array of add ons and special services.  The problem with open source search as well as other search solutions is that once deployed the systems exhibit the latency problem, the connector glitch problem, and the single point of access problem.</p>
<p>My view is that a single attribute like easy installation echoes Oracle’s “secure enterprise search” marketing approach. A single factor is presented as a way to capture market interest. We think that the issues associated with content acquisition, system latency, index updates, interface, and a single point of access to needed information are ultimately more important than installation hassles or even the open source idea itself. I will explore open source search in my next Online Magazine column. The connector issue is of particular interest to me, almost as important as the latency challenge. After all, who wants to run a query and get information which has not been refreshed to make the most current information findable?</p>
<p><a href="http://www.arnoldit.com/sitemap.html">Stephen E Arnold</a>, September 30, 2011</p>
<p>Sponsored by <a href="http://www.pandia.com/enterprise-search">Pandia.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/09/30/lucid-imagination-open-source-search-reaches-for-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Endeca Clicks into Real Time Search with DataSift</title>
		<link>http://arnoldit.com/wordpress/2011/09/26/endeca-clicks-into-real-time-search-with-datasift/</link>
		<comments>http://arnoldit.com/wordpress/2011/09/26/endeca-clicks-into-real-time-search-with-datasift/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 05:07:47 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Social]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=20055</guid>
		<description><![CDATA[Endeca, known for its e-commerce software, is pairing with DataSift, a provider of aggregated social data feeds at Web scale. Their partnership will produce visualizations and advanced analytics on semi-structured content in real time. Benzinga covers the latest in, “Endeca and DataSift Team to Analyze the Real Time Web.” The write up asserts: Pairing Endeca [...]]]></description>
			<content:encoded><![CDATA[<p>Endeca, known for its e-commerce software, is pairing with DataSift, a provider of aggregated social data feeds at Web scale. Their partnership will produce visualizations and advanced analytics on semi-structured content in real time. Benzinga covers the latest in, “<a href="http://www.benzinga.com/pressreleases/11/09/p1937193/endeca-and-datasift-team-to-analyze-the-real-time-web">Endeca and DataSift Team to Analyze the Real Time Web</a>.” The write up asserts:</p>
<blockquote><p>Pairing Endeca Latitude®, an Agile BI platform, with the breadth of social data like Facebook, Twitter, and WordPress as well as other popular social solutions, enables organizations to react to the “big data fire hose” alongside internal data, for marketing analytics, customer intelligence, CRM and competitive intelligence. Endeca and DataSift will demonstrate their joint offering at O&#8217;Reilly&#8217;s Strata Conference on September 22-23 in New York.</p></blockquote>
<p>DataSift’s granular and modular sifting abilities combine with Endeca Latitude’s intuitive interface to produce a product that is both powerful and cost-effective. The yet unnamed offering will help companies mine the business value out of the gushing well of new social data.</p>
<p>Our view is that “latency” exists across the six major types of “real time” solutions. What does “real time” mean? Well, it means different things depending upon the application. Some solutions are mind bogglingly expensive. Think Thomson Reuters’ feeds of financial data on certain investments. Others are pretty leisurely; for example, what is trending in the world of Lady Gaga. Interesting tie up. No solid definition of latency yet. We are watching and waiting. You know. Latency.</p>
<p>Emily Rae Aldridge, September 23, 2011</p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/09/26/endeca-clicks-into-real-time-search-with-datasift/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Alerts When Search Is Hit and Miss</title>
		<link>http://arnoldit.com/wordpress/2011/08/21/alerts-when-search-is-hit-and-miss/</link>
		<comments>http://arnoldit.com/wordpress/2011/08/21/alerts-when-search-is-hit-and-miss/#comments</comments>
		<pubDate>Sun, 21 Aug 2011 05:01:32 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business process]]></category>
		<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=19229</guid>
		<description><![CDATA[Search seems like the answer to Every Man’s information needs. It is not. Not by a long shot. If organizations cannot search by individual as to who needs information, they will invariably push content onto a whole group of people. AFV-News reported “U.S. Army Deploys AtHoc IWSAlerts Emergency Mass Notification System.” Businesses, schools, universities, and [...]]]></description>
			<content:encoded><![CDATA[<p>Search seems like the answer to Every Man’s information needs. It is not. Not by a long shot.</p>
<p>If organizations cannot search by individual as to who needs information, they will invariably push content onto a whole group of people. AFV-News reported “<a href="http://www.afv-news.com/2011/07/u-s-army-deploys-athoc-iwsalerts-emergency-mass-notification-system/">U.S. Army Deploys AtHoc IWSAlerts Emergency Mass Notification System</a>.”</p>
<p>Businesses, schools, universities, and military groups all employ the usage of emergency alerts, providing mass notifications to everyone in their system. <a href="http://www.army.mil/info/organization/jackson">Fort Jackson</a> brags that their <a href="http://www.athoc.com/">AtHoc</a> alerts span 25,000 personnel and dependents.</p>
<p>AtHoc IWS Alerts offer control from a unified Web-based console, which allows Fort Jackson to send alerts to cell phones, landlines, smart phones, SMS text and email. It’s not just Fort Jackson—AtHoc services more than 1.5 million <a href="http://www.defense.gov/">Department of Defense</a>, more than any other provider.</p>
<p>We learned about AtHoc’s capabilities and infrastructure from the AFV-News article:</p>
<blockquote><p>[The] system integrates with the post’s existing Internet Protocol network services, which means reduced infrastructure and maintenance costs. Personnel accountability is accomplished through the bi-directional capability, allowing responses to notifications in real-time. Network alert delivery and response can be tracked, ensuring that targeted recipients have received and responded to alerts.</p></blockquote>
<p>While alerts for dangerous situations and testing can save lives and are obviously a necessity, mass alert systems also unfortunately end up in too many unnecessary inboxes.</p>
<p>Megan Feil, August 21, 2011</p>
<p>Sponsored by <a href="http://www.pandia.com/enterprise-search" target="_blank">Pandia.com</a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/08/21/alerts-when-search-is-hit-and-miss/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM May Need a More Robust Classification Solution</title>
		<link>http://arnoldit.com/wordpress/2011/08/18/ibm-may-need-a-more-robust-classification-solution/</link>
		<comments>http://arnoldit.com/wordpress/2011/08/18/ibm-may-need-a-more-robust-classification-solution/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 05:11:30 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[Editorial opinion]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Taxonomy]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=19174</guid>
		<description><![CDATA[According to talk around the water cooler, some IBM content and search units are poking around for a classification &#8220;solution&#8221;. We think the rumor is mostly big company confusion since IBM already has software available to assess and address an organization’s content classification needs through the use of several components. According to the IBM website: [...]]]></description>
			<content:encoded><![CDATA[<p>According to talk around the water cooler, some <a href="http://www.ibm.com/us/en/">IBM</a> content and search units are poking around for a classification &#8220;solution&#8221;. We think the rumor is mostly big company confusion since IBM already has software available to <a href="http://www-01.ibm.com/software/data/content-management/assessment.html">assess</a> and address an organization’s content classification needs through the use of several components. According to the IBM website:</p>
<blockquote><p>Most unstructured content is either trapped in silos across the organization or entirely unmanaged “content in the wild.” A majority of that unstructured content can be deemed unnecessary – over-retained, irrelevant, or duplicate &#8211; and should be either decommissioned or deleted.</p></blockquote>
<p>As we understand it, one licenses the  <a href="http://www-01.ibm.com/software/data/content-management/classification/">Classification Module</a> and/or <a href="http://www-01.ibm.com/software/data/content-management/analytics/">Content Analytics</a> software to prevent the previously stated problem and to provide content classification.</p>
<p>Sounds great like the ads for IBM mainframes and the promotional information about</p>
<p>But a disturbing question to the ArnoldIT goslings who wear blue IBM logos: What if this stuff costs too much and does not deliver on the fly classification for real time processing of tweets and Google Plus public content?</p>
<p>Maybe an IBM box of parts with an expensive IBM engineering team is not exactly what some outfits require? Perhaps IBM should look around and maybe snap up one of the hot players in the space. IBM has been announcing partnerships with a <a href="http://www-01.ibm.com/software/data/bigdata/business-partners-profiles.html" target="_blank">number of interesting companies</a>. We track  <a href="http://www.digitalreasoning.com/">Digital Reasoning</a> and and think its technology looks very promising? IBM is in a good position to have an impact in the data analysis space, but it needs tools that go beyond its in house code and <a href="http://www-01.ibm.com/software/data/cognos/" target="_blank">Cognos</a> and <a href="http://www-01.ibm.com/software/analytics/spss/" target="_blank">SPSS</a> methods in our opinion.</p>
<p>Jasmine Ashton, August 19, 2011</p>
<p>Sponsored by <a href="http://www.pandia.com">Pandia.com</a>, publishers of <a href="http://www.arnoldit.com/wordpress/landscape">The New Landscape of Enterprise Search</a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/08/18/ibm-may-need-a-more-robust-classification-solution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Social Content Feed Tool from Know about It</title>
		<link>http://arnoldit.com/wordpress/2011/08/02/social-content-feed-tool-from-know-about-it/</link>
		<comments>http://arnoldit.com/wordpress/2011/08/02/social-content-feed-tool-from-know-about-it/#comments</comments>
		<pubDate>Tue, 02 Aug 2011 05:05:58 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Social]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=18888</guid>
		<description><![CDATA[When all your Facebook, Twitter, and other social streams become so convoluted, you might miss out on that link, photo, or music video you would’ve loved. You’ll never know &#8211; until now…maybe. Marshall Kirkpatrick looks at the new start-up, Know About It, in “New Service Sniffs out Secret Gems from across Your News Feeds.” The [...]]]></description>
			<content:encoded><![CDATA[<p>When all your Facebook, Twitter, and other social streams become so convoluted, you might miss out on that link, photo, or music video you would’ve loved. You’ll never know &#8211; until now…maybe. Marshall Kirkpatrick looks at the new start-up, <a href="http://knowabout.it/">Know About It</a>, in <a href="http://www.readwriteweb.com/archives/new_service_sniffs_out_secret_gems_from_across_you.php">“New Service Sniffs out Secret Gems from across Your News Feeds.”</a></p>
<blockquote><p>The service brings in all your subscribed content from major social networks, then offers a number of different ways to sort what it finds. My favorite is the filter called “Potentially Missed &#8211; links from people who don’t share a lot of links.</p></blockquote>
<p><a href="http://knowabout.it/">Know About It</a> explains on its Web site they collect all the links passing through your social streams and perform a “bunch of analysis on each one to determine which are most likely to be of interest to you.”</p>
<p>Sounds helpful. The idea of sorting all your inbound information in a variety of ways is appealing. You can also look at the service’s recommendations based on your expressed interest or get a personalized email digest.</p>
<p>Mr. Kirkpatrick has not yet tested the service but likes the idea. What isn’t mentioned? Privacy. So what is the ‘bunch of analysis’ and where do all those links end up? Advertisers? If the start-up is successful, time will tell. But with the social web moving at a never-ending pace and growing, social media users wanting to sort their feeds likely won’t mind too much. We think these types of tools are likely to grow in importance as free real time search becomes a difficult service to monetize.</p>
<p>Philip West, August 2, 2011</p>
<p><span style="font-size: x-small; color: #800000;">Sponsored by </span><a href="http://www.pandia.com" target="_blank"><span style="font-size: x-small; color: #800000;">Pandia.com</span></a><span style="font-size: x-small; color: #800000;">, publishers of </span><a href="http://www.arnoldit.com/wordpress/landscape" target="_blank"><span style="font-size: x-small; color: #800000;">The New Landscape of Enterprise Search</span></a></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/08/02/social-content-feed-tool-from-know-about-it/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Synthesio Releases New Social Media Monitoring Tool</title>
		<link>http://arnoldit.com/wordpress/2011/05/22/20-may-synthesio-releases-new-social-media-monitoring-tool/</link>
		<comments>http://arnoldit.com/wordpress/2011/05/22/20-may-synthesio-releases-new-social-media-monitoring-tool/#comments</comments>
		<pubDate>Sun, 22 May 2011 05:13:49 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business strategy]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=17690</guid>
		<description><![CDATA[More social media monitoring. “This New Dashboard Lets You Monitor Social Media Conversations About Your Brand Everywhere describes a dashboard called Unity. The solution is from Synthesio, and it could quickly become an essential marketing tool. Unlike TweetDeck, Unity is not free. However, the cost may be worth it. The article points to two components [...]]]></description>
			<content:encoded><![CDATA[<p>More social media monitoring. “<a href="http://www.businessinsider.com/synthesio-unity-2011-5">This New Dashboard Lets You Monitor Social Media Conversations About Your Brand Everywhere</a> describes a dashboard called Unity. The solution is from <a href="http://synthesio.com/corporate/">Synthesio</a>, and it could quickly become an essential marketing tool.</p>
<p>Unlike <a href="http://www.tweetdeck.com/">TweetDeck</a>, Unity is not free. However, the cost may be worth it. The article points to two components that put this app far ahead:</p>
<ul>
<li>“It monitors much more than Facebook or Twitter, in particular it crawls user forums, which is trickier and in practice is often much more important for many brands;</li>
<li>It works in over 30 languages. Synthesio has teams of translators around the world and around the clock that monitor conversations in many languages and make it all accessible to marketers in one dashboard.”</li>
</ul>
<p>For your money, you get information about how to customize your dashboard. Regular analytic reports are available for an added cost. Such monitoring of the real time environment may soon be essential for companies to stay competitive, “or well”, bring the future home today..</p>
<p>Cynthia Murrell, May 22, 2011</p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/05/22/20-may-synthesio-releases-new-social-media-monitoring-tool/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Digital Reasoning Continues to Expand</title>
		<link>http://arnoldit.com/wordpress/2011/05/16/digital-reasoning-continues-to-expand/</link>
		<comments>http://arnoldit.com/wordpress/2011/05/16/digital-reasoning-continues-to-expand/#comments</comments>
		<pubDate>Mon, 16 May 2011 05:05:24 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[Business intelligence]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Semantic]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Text analytics]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=17665</guid>
		<description><![CDATA[Move over Palantir and i2 Ltd. Digital Reasoning is expanding due to its rapid growth. As reported in MSN’s “Digital Reasoning Introduces Federal Advisory Board,” the data analytics leader has created a board to guide its push into the federal market. We learned: With the federal government&#8217;s increased focus on cloud computing, (Digital Reasoning’s) flagship [...]]]></description>
			<content:encoded><![CDATA[<p>Move over Palantir and i2 Ltd. Digital Reasoning is expanding due to its rapid growth. As reported in MSN’s <a href="http://www.msnbc.msn.com/id/42971621">“Digital Reasoning Introduces Federal Advisory Board,”</a> the data analytics leader has created a board to guide its push into the federal market. We learned:</p>
<blockquote><p>With the federal government&#8217;s increased focus on cloud computing, (<a href="http://www.digitalreasoning.com/">Digital Reasoning’s</a>) flagship product <a href="http://www.digitalreasoning.com/products/">Synthesys®</a> provides a unique Entity Oriented Analytics solution that enables government agencies to tap into the power of big data. The Advisory Board represents a team with unique insight into the requirements of Big Data, text analytics and intelligence solutions for government agencies.</p></blockquote>
<p>The board members are: Gen. William T. Hobbins, who retired as Commander, U.S. Air Forces in Europe; Bob Flores, founder and president of <a href="http://www.applicology.com/">Applicology Inc.</a>, who spent 31 years in the US intelligence community; Anita K. Jones, who managed the Department of Defense’s science and technology program; Capt. Nick Buck, who spent 15 years in National Security Space, including 10 years in the National Reconnaissance Office; and Mike Miller, currently president of <a href="m4associates.com">M4 Associates</a> and previously VP of Juniper Networks&#8217; Public Sector Division where he was responsible for all business with Juniper&#8217;s Public Sector customers in the US. This kind of talent should be valuable guiding Digital Reasoning’s federal sector strategy.</p>
<p>We have tracked this Franklin, Tennessee, company since its inception. To get some insight into the firm’s approach, you may want to read these two interviews ArnoldIT.com, the owner of this news service, conducted with Tim Estes, the founder of Digital Reasoning. The <a href="http://www.arnoldit.com/search-wizards-speak/digital-reasoning.html" target="_blank">February 2010 interview</a> explores the core technology of the firm and how it differs from other vendors’ methods. The <a href="http://www.arnoldit.com/search-wizards-speak/digital-reasoning-2.html" target="_blank">December 2010 interview</a> probes the new version of the firm’s flagship technology.</p>
<p>Stephen E Arnold, May 16, 2011</p>
<p><em>Freebie</em></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/05/16/digital-reasoning-continues-to-expand/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Twitter Firehose News</title>
		<link>http://arnoldit.com/wordpress/2011/04/15/twitter-firehose-news/</link>
		<comments>http://arnoldit.com/wordpress/2011/04/15/twitter-firehose-news/#comments</comments>
		<pubDate>Fri, 15 Apr 2011 05:55:09 +0000</pubDate>
		<dc:creator>Stephen E. Arnold</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Real time search]]></category>
		<category><![CDATA[Social]]></category>
		<category><![CDATA[Text analytics]]></category>
		<category><![CDATA[Text processing]]></category>

		<guid isPermaLink="false">http://arnoldit.com/wordpress/?p=17060</guid>
		<description><![CDATA[There is a tweak to the Witter and Mediasift partnership. You can read about it in the DataSift write up “Twitter Partnership”. Mediasift and Twitter have agreed to a partnership that has the potential to change how marketers and companies understand conversations about their products as well as how they choose to market them to [...]]]></description>
			<content:encoded><![CDATA[<p>There is a tweak to the Witter and Mediasift partnership. You can read about it in the DataSift write up “<a href="http://blog.datasift.com/2011/04/04/twitter-partnership/" target="_blank">Twitter Partnership</a>”.</p>
<p><a href="http://mediasift.com/">Mediasift</a> and <a href="http://www.twitter.com/">Twitter</a> have agreed to a partnership that has the potential to change how marketers and companies understand conversations about their products as well as how they choose to market them to target audiences. By utilizing the advanced <a href="http://datasift.net/">DataSift</a> software they are able to break down &#8220;tweets&#8221; into a language that is easily understandable and searchable and is still quite cost effective with it&#8217;s &#8220;pay per use&#8221; subscription. The article said:</p>
<blockquote><p>As a company we have been very fortunate to have access to the Twitter Firehose for quite some time. This has enabled us over the past two years to refine our thinking, leading to the incarnation of DataSift.</p></blockquote>
<p>DataSift compiles multiple social media feeds and additional data sets to create a common abstract layer which provides meaningful insight into much of the chaotic and unstructured data from the outlets. It took nearly 18 months to complete the DataSift platform but it has already seen a huge outpouring of company and marketing support with more than a billion requests per month.</p>
<p>Important stuff for the real time crowd.</p>
<p>Leslie Radcliff, April 15, 2011</p>
<p><em>Freebie</em></p>
]]></content:encoded>
			<wfw:commentRss>http://arnoldit.com/wordpress/2011/04/15/twitter-firehose-news/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

