Enabling Speech-To-Text YouTube Videos Enhances Business Potential

Catalog's Logo Logo of Moji-Moji TV

On Wednesday, Tokyo-based tech start-up in media rep service and speech recognition technology development Catalog just released the closed alpha version of the brand new service that enables speech-recognize what is spoken in a movie on the video sharing services such as YouTube and Nico Nico Douga[J].   In this release, the first 1,000 applicants can get access to the service to evaluate its functionality and usability.

The service is named "Moji-Moji TV[J]".   It captures audio in a movie and to result it as a text output, which can be used for subtitles in a movie as well as for describing and tagging the movie itself.   Moji-Moji TV also has a remarkable feature that collects people's names, newly-coined words and slang expressions from the Internet and add them to the dictionary of the service's speech-to-text engine.

Referring to comments to be given by the alpha test users, the company plans to improve and enhance the service for its official release which is scheduled in several months.

Incidentally, Google released a gadget called Election Video Search last summer, which allows you to search through a bunch of YouTube videos for what was spoken, using Google's speech-to-text technology (only for American-English speaking).


Author Information  Masaru IKEDA has co-founded several system integration companies and consulting firms in Tokyo. He has been contributing serial columns to nationwide newspapers and IT periodicals, also he's currently serving as tech consultant for several web companies. His biography is here. His private blog is here.


  • http://jp.techcrunch.com/archives/20091112infinity-ventures-summit-in-miyazaki-japan-12-demos-from-japanese-startups/ 宮崎市で行われたInfinity Ventures Summitで日本の12社がデモ–ARアプリがめちゃ多し

    [...] 優勝したMoji Moji TVは、ビデオ用のとても強力な音声認識と書き起こしサービスのようで、先月非公開アルファで立ち上げられた。対応言語は日本語のみだが、英語用と中国語用も目下開発中だ。Moji Mojiは、ビデオ(自作のムービー、YouTubeのクリップなど)から音声を取り出し、それを自動的にテキストに変換して表示する。そのテキストは、ムービーのタグや字幕として使えるし、検索の対象にもできる。Shabetter というiPhoneアプリケーションは、iPhoneのマイクに向かって喋った言葉を自動的に書き起こしてTwitterに投稿する。Moji Moji TVに関する英語の説明がここにある。 [...]