The Elusive Video Recognition

April 22, 2015

Pictures and video still remain a challenge for companies like Google, Facebook, Apple, and more.  These companies want to be able to have an algorithm pick up on the video or picture’s content without relying on tags or a description.  The reasons are that tags are sometimes vague or downright incorrect about the content.  VentureBeat reports that Google has invested a lot of funds and energy in a deep learning AI.  The article is called “Watch Google’s Latest Deep Learning System Recognize Sports In YouTube Clips.”

The AI is park of a neural network that is constantly fed data and programmed to make predictions off the received content.  Google’s researchers fed their AI consists of a convolutional neural network and it was tasked with watching sports videos to learn how to recognize objects and motions.

The researchers learned something and wrote a paper about it:

“ ‘We conclude by observing that although very different in concept, the max-pooling and the recurrent neural network methods perform similarly when using both images and optical flow,’ Google software engineers George Toderici and Sudheendra Vijayanarasimhan wrote in a blog post today on their work, which will be presented at the Computer Vision and Pattern Recognition conference in Boston in June.”

In short, Google is on its way to making video and images recognizable with neural networks.  Can it tell the differences between colors, animals, people, gender, and activities yet?

Whitney Grace, April 22, 2015

Stephen E Arnold, Publisher of CyberOSINT at www.xenky.com

Comments

Comments are closed.

  • Archives

  • Recent Posts

  • Meta