Monday, September 22, 2008

google guadi

According to its FAQ, GAudi currently works on YouTube’s political channels and uses its own speech recognition software. Eventually, the transcription and indexing will undoubtedly reach other YouTube videos: “the US election is just a first step. We see it as an experiment platform where we can learn what features make the best user experience for people looking for spoken content on the Web.”

Sounds like this technology will be integrated into universal SERPs in the future as well.

audio search on video

Google has elevated the profile of its attempt to make videos searchable through speech recognition technology, a move that portends a potentially more financially successful YouTube division.

The speech recognition technology was used in an online application that let people search political speeches launched in July, and now the Gaudi (Google Audio Indexing) project has an official interface at Google Labs.

Google Audio Indexing (Gaudi) lets people use a text search of some YouTube videos.

Google Audio Indexing (Gaudi) lets people use a text search of some YouTube videos. (Click to enlarge.)

(Credit: Google)

The site's search box has instructions: "Search what the politicians are saying." The search results are presented next to a YouTube video player, and clicking each result sets the player to show the part of the video where the words were spoken. It doesn't just show speeches--a search for "bridge to nowhere" also returned the "Real Mavericks" ad from the John McCain-Sarah Palin campaign.

Extracting words from videos could make it easier for Google to determine what content is in the video and therefore what ads are most appropriate to show next to them. Making money from YouTube is a top priority this year.

Speech-to-text conversion also could help Google blend relevant videos into search results. Currently, the best way to understand what's in a video is by examining the accompanying metadata, such as titles and captions, but that's often much narrower than what's spoken.

And with Google's translation work, it's possible that the company could transcribe videos' text into other languages.

Clearly, Google has big ambitions for the audio recognition technology. "The aim of Google Audio Indexing on Google Labs is broader (than that of the and the Google Elections Video Search gadget), and the U.S. election is just a first step. We see it as an experiment platform where we can learn what features make the best user experience for people looking for spoken content on the Web," the company said in a frequently-asked-questions page about the Google Audio Indexing project.

Google is beginning with political information because it's trying to become a prominent part of the democratic process and because political speeches receive a lot of attention, the company said. Also, presumably because politicians generally don't mumble as much as the rest of us, the speech recognition technology performs better, Google said.

google audio search on video

Google Audio Indexing (search for word from video)

Google's efforts to improve video search by using speech recognition technology started to become visible in July, when Google launched a gadget for searching inside the political speeches from YouTube. The gadget has been expanded into a new service called GAudi (Google Audio Indexing), which is now available at Google Labs.

"Google Audio Indexing is a new technology from Google that allows users to better search and watch videos from various YouTube channels. It uses speech technology to find spoken words inside videos and lets the user jump to the right portion of the video where these words are spoken. (...) Political videos and election materials are a special case of broadcast news content, a domain that has received a lot of academic and industry attention and is known to perform well," explains the FAQ.

The interface is attractive because you can find all the mentions of your keywords and go directly to the appropriate sequence. And if you find an interesting video, there's another search box that lets you search inside that video.

Google's technology is far from being perfect, so you'll find many mistakes. In the video "Barack Obama on the 40th Anniversary of the Prague Spring", "Czechoslovakia" is incorrectly detected as "tech also but there", "free" is replaced by "forty" and there are many other mistakes.

Click on this link to see live searching on u-tube uploade video => http://labs.google.com/gaudi?q=europe&start=0&num=10&longId=1757701392240045293