CyberOSINT banner

MovieGraph from Senzari Offers a Better Way to Find Movies

March 5, 2015

The article on Dataversity titled Creating Detailed Semantic Graphs Around Video Content with MovieGraph suggests a possible breakthrough in video sense making. MovieGraph is the platform of entertainment data company Senzari. Chief Operating Officer Demian M. Bellumio spoke to the methods utilized by MovieGraph, which include machine learning and an API for recommendations. The article continues to refer to Bellumio’s statements,

“Senzari focused on metadata while building MovieGraph. He also said that Senzari trained machine learning algorithms to break down the narratives of movies, extracting the data with precision across each element. The company designed their own matrix for cataloging movies; MovieGraph uses machine learning techniques to semantically tag and organize every movie and TV show across hundreds of dimensions. Senzari also added proprietary narrative features to MovieGraph such as setting, conflict, symbols or tones present in a film.”

The possibilities for recommendations seem much more targeted than the Netflix model, which often makes suggestion based on categories that are too wide and abstracted to be accurate. The article mentions that since Netflix only recently closed its public API, MovieGraph may be in a position to fill that gap. MusicGraph is also built to work with MusicGraph, another Senzari platform. Content creators in particular might find the crossover to be useful in terms of finding appropriate content for their projects.

Chelsea Kerwin, March 05, 2015

Sponsored by ArnoldIT.com, developer of Augmentext

Google: Just the Facts, Folks

March 2, 2015

Short honk: I read “Google Wants to Rank Websites Based on Facts Not Links.” The article could be a jumping off point for some dictionary excitement. The article reports:

Google research team is adapting that model to measure the trustworthiness of a page, rather than its reputation across the web. Instead of counting incoming links, the system – which is not yet live – counts the number of incorrect facts within a page. “A source that has few false facts is considered to be trustworthy,” says the team (arxiv.org/abs/1502.03519v1). The score they compute for each page is its Knowledge-Based Trust score.

A couple of questions come to my tiny mind:

  1. What is a fact?
  2. When two documented facts conflict, which fact is more correct? Example: competing theories in physics about dark matter.
  3. What is knowledge?
  4. Will Google be able to manage knowledge is a manner that satisfies “experts”?

The PageRank thing drives so much ad cash because statistical funkiness seems to make intuitive sense for popularity rankings. Will facts generate equivalent financial excitement?

I suppose Google could license Watson and just ask IBM’s system? Here’s what the questions might look like:

Watson, what’s a fact? What’s knowledge? What’s accurate information?

And Watson’s answer, “Tamarind.”

Stephen E Arnold, March 2, 2015

Enterprise Search: A Haunting Phrase Evokes the Ghosts of Failed Search Past

March 1, 2015

I read a practical explication about setting up SharePoint search to facilitate people search. I am okay with the approach in “More SharePoint 2013 Search Tips for Power Users.” The publisher is one focused on generating received wisdom or hoped for truths. That’s okay.

I did note one important and telling phrase. Here is is:

So how can this knowledge be used to create a real business solution?

When I read this sentence, several thoughts flitted through my mind. Here they are:

  1. Is this opposed to an unreal business solution.
  2. Are search solutions chimera?
  3. 3. Are search solutions false, fake, ersatz?

Am I unduly sensitive to a single statement? No. The phrase strikes at the core of search challenges, not just sticky wickets of the SharePoint variety.

Search allows individuals to “find” something in theory. The reality is that what search outputs for a user crafty enough to use the right term, phrase, or hot link is often wildly off the mark.

The fix is to layer additional controls on top of a child’s wagon, not a vehicle designed to carry the weight of today’s information access requirements.

Result? Search is an endless disappointment to users. How does one find a person in an organization who can and will answer a question? More than search is required in my experience.

Stephen E Arnold, March 1, 2015

A Dark Search Engine

February 27, 2015

If anyone mentions the dark Web or the invisible Web, most people would make a Star Wars reference and insert a Darth Vader quote. While getting in touch with your “dark side” can help even out your personality, searching the dark Web reveals a whole new world of information. The only problem is that there isn’t a strict search engine for it. Wired explains that “Darpa Is Developing A Search Engine For the Dark Web.” Darpa is creating a dark Web search engine to help law enforcement discover patterns and relationships in online data about illegal activities.

“The project, dubbed Memex, has been in the works for a year and is being developed by 17 different contractor teams who are working with the military’s Defense Advanced Research Projects Agency. Google and Bing, with search results influenced by popularity and ranking, are only able to capture approximately five percent of the internet. The goal of Memex is to build a better map of more internet content.”

The search engine’s main goal is to have a one-size-fits all approach to search results. The data will not only be pulled from the same places commercial search engines crawl, but also the dark Web hidden sites that include TOR network’s Hidden Services. The Memex team also want to automate methods to analyze the data to save law enforcement research time.

Memex is only a tool for uncovering the dark Web, how it is used depends on the organization. It is estimated Memex will cost between $10-20 million to fund.

Whitney Grace, February 27, 2015
Sponsored by ArnoldIT.com, developer of Augmentext

Interface Changes for Google Mobile Search

February 26, 2015

Google is the top search US search engine for many reasons and it can maintain this title because the company is constantly searching (pun unintended) to improve its products and services. Google wants to deliver high quality search just as much as it wants to stay ahead of its competition. Mobile search is one of the most competitive digital markets and Google has developed ways to augment its already popular mobile application. BGR highlights the new changes to its mobile search as described in “Google’s Latest Mobile Search Change Brings Some Key Interface Changes.”

One feature that changes is the “Google box” that displays results that are supposed to be the best matches for a query. The Google box will also have a news carousel that lists the latest information on the query.

“ ‘When you search for a topic, just scroll down to see a ‘carousel’ of recent articles, videos or more on that subject,’ Google Search product manager Ardan Arac wrote in a blog post. Tap any link to read or watch exactly what you’re interested in. For example, if you search for NPR, you’ll see links to all their latest articles and videos.”

Google is doing its best to improve mobile search, a task that has usually evaded mobile devices. Mobile technology needs to have more features that are readily available on laptops and computers to make them more reliable and useful.

Whitney Grace, February 26, 2015
Sponsored by ArnoldIT.com, developer of Augmentext

Google Search: Live Chat, But Relevance?

February 25, 2015

i read “Google Tests Live Chat With Businesses From Search Results.” According to the write up:

Google is testing out a service that incorporates live chat with businesses right into search results, via a new link that shows whether a business is currently available, and immediately launches a chat via Google Hangouts (on either desktop or mobile) if they are.

I have been doing the online research thing for years. Ellen Shedlarz, formerly Booz, Allen New York’s head information guru, exposed me to commercial online systems in 1973 or so. She was kind enough to let me fumble away with a dorky dumb terminal with bunny rabbit ears.

In the last 40 years, I have to make a confession about my stupidity. I never wanted to enter into a live chat with a person who wrote an article, offered a product, or pretended to be an expert like a mid tier search expert with a degree in English.

I wanted to perform what I naively thought was research. I would obtain information, either print out information or copy it on the 5×8 inch note cards my debate coach in high school mandated I use for research, thus forming a life long habit. I would then read the information I gathered, make notes, and prepare more note cards with identifiers that allowed silly old me to find the connections among the note cards.

After I knew what the heck I was learning, formulating my questions, and then thinking about whom I could approach for more information—then I wanted to talk to a human with alleged expertise.

No wonder I am a loser. When I enter a query for “terminal”, I want to enter a category code so I get the exact meaning of terminal I have in mind and information directly related to documents with that notion of terminal. When I want terminal for a train, I want train stuff.

I suppose now I can run a query for terminal and see these “relevant” results:

image

Perhaps I can ring the director of the motion picture and ask the fellow where the train station is. Seems very useful, just not to me. Google, how about a return to relevance?

Stephen E Arnold, February 25, 2015

WCC Smart Search and Match Partners with Diona Mobility

February 23, 2015

We learn of a recent deal from PR Newswire’s post, “Government Employment Services Brought into the 21st Century Through WCC and Diona Partnership.” The deal will bring WCC’s search-and-match technology to Diona Mobility’s human-services solutions. The write-up tells us:

“The partnership will integrate WCC solutions with Diona Mobility solutions to provide clients in the employment market with unique options when using mobile devices to find sustainable and appropriate jobs. WCC and Diona will deliver leading-edge solutions to help clients and their caseworkers efficiently find jobs that match the clients’ skills. Clients and caseworkers can locate timely employment positions through their smartphones and tablets while on the go.

“The mobile solution pioneered by WCC and Diona will provide jobseekers with:

*Real-time notifications of available matching jobs;

*The ability to manage their profile and skills, and search for jobs on the fly;

*Benchmarking, analytics and insight into career opportunities; and

*Access to enrollment and referral services for courses and training programs.”

WCC’s CEO Peter Went emphasizes his company’s experience with some of the biggest staffing firms and public employment services around the world, and praises Diona’s focus on quality and customer access. Launched in 1996, WCC is headquartered in Utrecht, The Netherlands, and has locations in the U.S., Serbia, and Saudi Arabia. Founded in 2012 and based in Ireland, Diona maintains several offices around the world. They aim to make social services globally accessible through mobile platforms by 2020.

Cynthia Murrell, February 23, 2015

Sponsored by ArnoldIT.com, developer of Augmentext

Kelsen Enters Legal Search Field

February 23, 2015

A new natural-language search platform out of Berlin, Kelsen, delivers software-as-a-service to law firms. Basic Thinking discusses “The Wolfram Alpha of the Legal Industry.” Writer Jürgen Kroder interviewed Kelsen co-founder Veronica Pratzka. She explains what makes her company’s search service different (quote auto-translated from the original German):

“Kelsen is generated based on pre-existing legal cases not a search engine, but a self-learning algorithm that automatically answers. 70-80 percent of the global online data are very unstructured. Search engines look for keywords and only. Google has many answers, but you have to look for them yourself thousands of search results together and hope that you just entered the correct keywords. Kelsen, however, is rather a free online lawyer who understands natural language practitioner trained in all areas of law, works 24/7 and is always up-to-date….

“First Kelsen understands natural language compared to Google! That is, even with the entry of long sentences and questions, not just keywords, Kelsen is suitable answers. Moreover, Kelsen searches ‘only’ relevant legal data sources and provides the user with a choice of right answers ready, he can also evaluate.’
“One could easily Kelsen effusive as ‘the Wolfram Alpha the legal industry,’ respectively. We focus on Kelsen with legal data structure and analyze them in order to eventually make available. From this structuring and visualization of legal data not only seeking advice and lawyers can benefit, but also legislators, courts and research institutions.”

Pratzka notes that her company received boosts from both the Microsoft Accelerator and the IBM Entrepreneur startup support programs. Kelsen expects to turn a profit on the business-to-consumer side through premium memberships. In business-to-business, though, the company plans to excel by simply outperforming the competition. Pratzka seems very confident. Will the service garner the attention she and her team expect?

Cynthia Murrell, February 23, 2015

Sponsored by ArnoldIT.com, developer of Augmentext

Partition the Web to Manage It

February 22, 2015

I noted that the mid February 2015 Forbes article did not get much coverage. “US Defense Giant Raytheon: We Need To Divide The Web To Secure It” contains a suggestion that could, if implemented, force changes upon Bing, Google, and other Web indexing outfits.

Here’s the passage I highlighted in lovely ice blue:

But some, including Michael Daly, chief technology officer for cyber security at US defense giant Raytheon, believe that the web needs to be divided into communities. As more critical devices, from insulin pumps to cars, connect to the internet, the more likely a genuinely destructive digital attack will occur. To stop this from happening, some people just shouldn’t be allowed into certain corners of the web, according to Daly.

There are some interesting implications in this notion.

Stephen E Arnold, February 22, 2015

Antidot Semi-Pivot to eCommerce Search

February 21, 2015

I wanted to capture Antidot’s semi pivot from enterprise search to eCommerce search. The French company provides a useful description of its afs@store product. If you bang this product name into the GOOG, you find that the American Foundry Society, Associated Food Stores, and the American Fisheries Society push Antidot’s product down the results list. In general, names of search and content processing systems often disappear into search results. Perhaps Antidot has a way to make the use of the “@” sign somewhat less problematic.

The system, according to Antidot, system delivers features that sidestep the unsticky nature of most eCommerce customer visits. Antidot asserts:

  • Rich, tolerant and customizable auto complete featuring products, brands, categories…
  • Fully typo-tolerant search
  • Semantic search that understands your customer’s words
  • Dynamic filtering facets to rapidly select desired products
  • Web interface to simply monitor and manage your searchandising

the company offers a plug in for Magento, the open source eCommerce system, that enjoyed love from eBay. It is difficult to know if that love is growing stronger with time, however.

I did notice that the “See and read more” panel had zero information and no links. Hopefully this void will be addressed.

Stephen E Arnold, February 21, 2015

Next Page »