Searching User-generated Video Content based on Link Relationships between Videos and Blogs

Similar documents
Review Guide: Picasa 3 (beta) and Picasa Web Albums Fast and easy photo sharing from Google

KS Blogs Tutorial Wikipedia definition of a blog : Some KS Blog definitions: Recommendation:

Executed by Rocky Sir, tech Head Suven Consultants & Technology Pvt Ltd. seo.suven.net 1

Blogger Frequently Asked Questions

Search Engine Optimization (SEO)

Internet Applications. Q. What is Internet Explorer? Explain features of Internet Explorer.

Social Media Tools. March 13, 2010 Presented by: Noble Studios, Inc.

GUIDE. The Definitive Guide to. Video SEO. Toll-Free

3 Media Web. Understanding SEO WHITEPAPER

COPYRIGHTED MATERIAL. Lesson 1

Module 1: Internet Basics for Web Development (II)

What s a module? Some modules. it s so simple to make your page unique

Blogs.mcgill.ca guide

10 WEBSITE MISTAKES EVEN GREAT MARKETERS CAN MAKE. Jessica Bybee-Dziedzic Saffire

Specialized Application Software The McGraw-Hill Companies, Inc. All rights reserved.

Welcome to ClipShack! This document will introduce you to the many functions and abilities of this program.

ONLINE EVALUATION FOR: Company Name

A PRACTICE BUILDERS white paper. 8 Ways to Improve SEO Ranking of Your Healthcare Website

::The Art of Multimedia:: Assignments*

OPTIMIZING YOUR VIDEOS FOR SEARCH ENGINES(SEO)

The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos

A Guide to Improving Your SEO

CURZON PR BUYER S GUIDE WEBSITE DEVELOPMENT

Video Site Maps: Why Bother? Copyright 2008 Customer Paradigm, All Rights Reserved.

THE AUSTRALIAN ONLINE LANDSCAPE REVIEW AUGUST 2015

U.S. Digital Video Benchmark Adobe Digital Index Q2 2014

The Internet and World Wide Web. Chapter4

Want to make $50+/day

What is a Blog? How Can I Use One?

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

GUIDELINES FOR WEB SITE DESIGN CHAPTER 10

Getting Started Guide. Getting Started With Quick Blogcast. Setting up and configuring your blogcast site.

6 WAYS Google s First Page

FOMA 906i Application Functions

Sukjun Lim Strategic Planning, User interaction, Design research specialist

Ning Frequently Asked Questions

KEY BENEFITS OF A TIKILIVE NETWORK

Bloggin For Linux User s Guide Advanced Internet Technologies, Inc. November 11 th, 2005

What the is SEO? And how you can kick booty in the interwebs game

!!!!!! Portfolio Summary!! for more information July, C o n c e r t T e c h n o l o g y

Sweet Digital Home: Enhancing Family Life with HIKARI and Cloud

Interactive Video Retrieval System Integrating Visual Search with Textual Search

Don t make me think*!

COMPARABLE METRICS Q4 2015

Digital Marketing Proposal

Instructions for Web Content Creators and Web Editors Web Transformation design extensions

SEO According to Google

Crazy YouTube Stats. Seminar Topics. sales. According to Nielsen, YouTube reaches more US adults. YouTube is available on 350 million devices

What You Will Learn. What You Will Learn. How to Get Started with Wistia & 5 Ways It Generates More Leads. with Josh White

Nielsen List of Top 10 ios Mobile Apps

Web Video Class for Nonprofits. 1. Intros:

Azon Master Class. By Ryan Stevenson Guidebook #10 Google and YouTube Marketing

Social media webcasting guide:

with Twitter, especially for tweets related to railway operational conditions, television programs and landmarks

How To AWISA SPRING 2017

DreamApps. WorkSpace. A Guide to Demo Site

Building a website. Should you build your own website?

Online music is constantly changing. Static artist bios and old images from the archives no longer deliver an engaging experience.

CONTENT MANAGEMENT - THE USERS' REQUIREMENTS

BUYER S GUIDE WEBSITE DEVELOPMENT

COPYRIGHTED MATERIAL. Using Adobe Bridge. Lesson 1

FIGURING OUT WHAT MATTERS, WHAT DOESN T, AND WHY YOU SHOULD CARE

The Solution Your Legal Department Has Been Looking For

YouTube & Vimeo. Differences, similarities which one is for you or should you be on both?

Blog Pro for Magento 2 User Guide

Tara McPherson School of Cinematic Arts USC Los Angeles, CA, USA

What is SEO? How to improve search engines ranking (SEO)? Keywords Domain name Page URL address

System Development for an HSDPA Music Channel Service

Google Analytics. powerful simplicity, practical insight

D7.6 Project video. Project ref. no. H2020-ICT GA No BINCI Binaural tools for the creative industries

ADVANCE DIGITAL MARKETING VIDEO TRAINING COURSE. Page 1 of 34 Youtube.com/ViralJadhav

Final Site Guide: Part 2 Embedding Stuff (documents, videos, sound files)

What s an SEO Strategy With Out Social Media?

Testing Plan: M.S.I. Website

GOOGLE HAS STARTED SENDING S ABOUT SHUTTING DOWN UNUSED GOOGLE+ PAGES.. BUT DON T PANIC! GOOGLE HAS STARTED ROLLING OUT MOBILE FIRST INDEX

Children needing support to achieve key skills. Children surpassing key skills. Computing Progression in Skills Lower Key Stage Two

the magazine of the Marketing Research and Intelligence Association YEARS OF RESEARCH INTELLIGENCE A FUTURESPECTIVE

Easy Video Blogging and Marketing on Youtube! by Leslie Truex

The influence of caching on web usage mining

Quantum, a Data Storage Solutions Leader, Delivers Responsive HTML5-Based Documentation Centers Using MadCap Flare

SMK SEKSYEN 5,WANGSAMAJU KUALA LUMPUR FORM

SEO: SEARCH ENGINE OPTIMISATION

New Service Merging Communications and Broadcasting NOTTV. Recommendation Technology for Promoting Use of NOTTV

Frequently Asked Questions- Communication, the Internet, Presentations Question 1: What is the difference between the Internet and the World Wide Web?

Introduction to the Publish Content Module

ADD URL SEARCH ENGINE SUBMISSION YAHOO

SharePoint Online Power User

Match the underlined words and expressions to their definitions below:

Computers Are Your Future

Content Curation Mistakes

Page Title is one of the most important ranking factor. Every page on our site should have unique title preferably relevant to keyword.

ADVANCE DIGITAL MARKETING VIDEO TRAINING COURSE. Page 1 of 34 Youtube.com/ViralJadhav

Marketing & Back Office Management

UNIVERSITY OF CYPRUS PUBLIC AND BUSINESS ADMINISTRATION DEPARTMENT

Viewer 2 Beta Frequently Asked Questions

Real-time Tweet Search System

Firefox for Android. Reviewer s Guide. Contact us:

Website Report for facebook.com

whitehvac.com SEO ISSUES FOUND ON YOUR SITE (JANUARY 2, 2017)

User Manual. Introduction. Using the Long-tail Keyword Feature

Transcription:

Searching User-generated Video Content based on Relationships between Videos and Blogs User-generated Video Blogs Video Search Searching User-generated Video Content based on Relationships between Videos and Blogs Due to the spread of digital cameras and video-sharing services, large amounts of user-generated video content are now being posted on the Internet. This makes it harder for users to efficiently discover content that they are interested in watching. As a means of discovering worthwhile content from this large quantity of video material, we proposed a video search method that focuses on networks formed by user-generated videos and the blogs that link to them, and we implemented this search method as an application. Service & Solution Development Department Kozo Noaki Takeshi Kato Yasushi Kondo January 2009, it was reported that the issue, we have proposed a video search number of people using YouTube in the method that focuses on the networks In recent years, multimedia content United States every month had passed formed by user-generated video content has broken free from physical media 100 million (comscore U.S. survey and the blogs that link to it as a means such as CDs and DVDs and can now be report). Video content is now used very of discovering content of value in user- obtained online. Some content delivery heavily. generated video. 1. Introduction service providers are also starting to But unlike Web space, which is In this article, we describe this pro- offer video and movie delivery via built from HTML documents where posed method and an application that applications. Meanwhile, users are not hyperlinks provide a structure for cross- uses it. only interested in watching professional referencing, the data structure of video content from big businesses but are also content does not include links to other diversifying into user-generated con- content. This makes it hard for users to tent. For example, there seems to be an browse content by following links as The findability of information is upsurge of commercial interest in niche when surfing Web pages. It could thus determined by various factors ranging markets such as fanzines and indepen- be said that video content is a medium from the data structure of the content to dent music artists. where it is difficult for people to dis- the design of the platform on which it is is a typical example of cover content they are interested in, and provided. Since the data structure of a service that supports the online circu- which still has a high threshold except video content does not include text data lation of user-generated content. In for heavy users. To break through this or hyperlinks, search techniques for TM*1 YouTube 2. The Issue of Video Findability TM *1 YouTube : A trademark or registered trademark of Google, Inc. 46 0 0 0

video content generally work by matching and extracting keywords from the plain text or tagged text appearing in the vicinity of the content. However, the surrounding text and tags do not provide a consistently objective representation of video or photographic content, and with a simple keyword search it is difficult for a third party to accurately find the desired content from among the diverse variety of content available. To improve the findability of content, some businesses are providing Web browser add-ons with a function for displaying a thumbnail list of video content in order to increase the visibility of video and still image content introduced and provided by the site. These thumbnail lists and the ability to view content by selecting it from a thumbnail not only makes it faster to see a summary of video content but also makes content browsing more convenient. However, they do not provide a means for guiding users from the content they are viewing towards other related content. YouTube addresses this issue by displaying information such as thumbnails and titles of related video content when a user has finished watching a video. By selecting one of these thumbnails, the user can play back the corresponding video. This provides users with a means of playing back other videos they are interested in by selecting the corresponding thumbnail. In this way, YouTube builds a link structure between similar videos, and provides users with a means of following related content. Although YouTube includes a function for suggesting related videos by presenting a list of videos that appear to be highly related to the current video based on a predetermined relevance calculation method, it is impossible for users to find out the basis on which these videos were selected. In other words, the reasons for displaying a particular selection are not made clear and there is insufficient information on why these videos are being recommended to the user. This makes it hard for users to judge whether or not they should follow these recommendations. It is thus thought that there is a need for content navigation to be provided in a new more convenient way compared with current methods. 3. Hypothesis Testing First, to improve the findability of video content, we focused on the fact that Internet users use video embedding tools and the like to insert recommended videos in their own blogs. We studied this search method by testing our hypothesis for a network centered on user-generated video content. 3.1 Hypothesis Relating to Video Content on the Internet According to a report by the Ministry of Internal Affairs and Communications, there were 16.9 million bloggers on the Internet in Japan in January 2008, and between them they had written approximately 1.35 billion articles [1]. Blogs have rapidly become a popular way for people to effortlessly express their opinions on everyday events, and some 3 million blogs (almost 20% of the total) are updated at least once a month. At the same time, many video-sharing sites that handle user-generated video content provide functions that make it easy for users who publish information on the Web to embed videos in their own blogs and Web sites. Such functions not only promote the publishing of information by utilizing videos and other multimedia content as well as text, but are also thought to facilitate the construction of new networks on the Internet due to the strong expressive capabilities of video content and the ease of user cooperation. In this study, we put forward and test the hypothesis that user-generated video with lots of views is linked to from many Web pages, and that these linking pages link to other videos, thereby constructing an independent network with video content as nodes. 3.2 Summary of Test Results In experiments performed to test the hypothesis [2], the following facts became clear: As a result of acquiring Web pages that contain links to the user-generated video content selected as sam- 47

Searching User-generated Video Content based on Relationships between Videos and Blogs ples (reverse links) from the Internet, we were able to obtain at least 100 Web pages in each case. As a result of analyzing these Web pages to collect the videos linked to from these Web pages (forward links), we were able to acquire up to 2,253 items of content. As a result of classifying the Web pages that link to user-generated video content, we found that blog pages accounted for approximately 40% of these pages. 3.4 Data Collection Results In the experiments, we imposed the Web pages acquired has reached approximately 100. following restrictions: Acquire up to ten linking Web pages for each user-generated video. Finish when the total number of The top three results in terms of the number of items of video content were extracted and summarized for each cat- Video ID Video title Video URL 1 P-38 Lightning, Arizona Air http://www.mytube.com/watch/xx 2 Searching for condors in http://www.mytube.com/vxxx/yy 3.3 Testing Method Figure 1 shows the data retention format of user-generated video information and Web page information, and Figure 2 shows the procedure for extracting the link structure from this information. To acquire the Web pages linking to video content, we used the reverse link analysis function provided by an external search service. As the user-generated video that provides the starting point for the data collection sequence, we selected videos that has been published at least one year ago and had received at least 10,000 views from five categories that appeared to be browser by viewers with different attributes. The specific categories were news videos, self-made clay animation videos, self-made robot videos, animal videos and self-made advertising videos. Between five and eight items of content were selected for each category. Web page ID Title Video ID Web page URL 1 2 My Blog Diary Set a video to be used as a starting point Register the video s URL in the video table Search for Web pages that link to the video URLs registered in the video table Acquire the HTML source of each Web page in the search results Search for corresponding video URLs in the HTML source 1 1 Web page table http://www.myblog.. http://www.mydiary.. Figure 1 Data retention format (video table, Web page table) If a corresponding video URL was found If there are any links to videos other than the video URL registered in the video table Register the Web page URL in the Web table Figure 2 structure extraction flowchart 48

egory (Table 1). In each case, the number of acquired Web pages was never less than 100, and we were also able to collect an adequate number of acquired videos. This shows that there is a clear link structure between videos and Web pages, and that they form a fairly large network. To classify the collected Web pages based on their content, we extracted 100 pages at random and classified them visually (Figure 3). As a result, we were able to classify them into four types news sites, blogs (including the blogs of Japanese Web services and other sites with commenting and trackback functions and the like), bulletin boards and other pages. Regardless of the starting video category, we found that blog pages accounted for between 30 and 50% of the results while news pages and bulletin boards only accounted for a few percent. 3.5 Content Importance Calculation Method We calculated the importance of each item of video content based on the reasoning that most of the content linked from good quality Web sites is good quality content. For this calculation, a good quality Web page was taken to be a Web page that most users consider when selecting video content. The variables necessary for this calculation are defined as follows: Importance of the linking Web page: A number added to and associated with a Web page containing a link to video content that a user views by visiting the Web page Number of links in the linking Web page: The number of items of video content that a Web page links to An example of a calculation Category News videos Self-made clay animation Self-made robot videos Animal videos Self-made advertising videos N1 N2 N3 C1 C2 C3 R1 R2 R3 A1 A2 A3 K1 K2 K3 10,414,660 609,437 1,664,550 1,587,420 193,092 144,040 1,226,961 2,022,642 7,757,335 17,387,750 4,523,041 12,168,739 92,055 144,889 139,966 method is shown in Figure 4. The importance values attributed to linking Web pages W1, W2 and W3 (10, 12 and 3 points) are divided by the number of content items that these pages link to. The resulting values are used as weighting values for the linked video content C (5, 12 and 1.5 points). The importance information retained for video Table 1 Web page/video content acquisition results ID News videos Self-made clay animation Self-made robot videos Animal videos Self-made advertising videos Number of views Upload date (YYYY/MM/DD) 2007/01/10 2006/12/10 2007/01/27 2007/12/08 2007/09/26 2007/12/08 2007/08/25 2007/03/27 2008/03/17 2007/01/12 2006/10/25 2007/03/19 2007/03/16 2007/03/07 2007/02/01 Number of Web pages acquired 102 104 103 102 105 103 104 105 103 104 0% 20% 40% 60% 80% 100% Figure 3 Classification structure of extracted Web pages Number of video content acquired 1,737 1,361 1,333 973 954 786 1,764 1,268 908 2,253 1,526 1,413 1,724 1,009 973 Blogs News sites Bulletin boards Others 49

Searching User-generated Video Content based on Relationships between Videos and Blogs content C corresponds to the sum total of these values, which is 18.5 points. By using these proposed methods to retain the importance values calculated from Web pages and video content, these values can be used as an independent ranking measure. The advantage of the proposed method is that video content linked from highly important Web W1 ing Web page Importance: 10 points Number of references: 2 +5 (=10/2) pages (i.e., Web pages that are frequently referred to be users when selecting videos) is calculated as being of higher importance, so that it can be ranked more highly regardless of how many or how few times it has been viewed. For example, it is expected that even a high-quality video that has not had many views because it is not made for a general audience will float up higher in the rankings if it is linked from a Web site owner trusted as an introducer. 4. User-generated Video Navigation System We proposed a link extraction algorithm that forms the basis of this system, and we tested its effectiveness by using it to collect real data. We also devised methods for using link information to calculate the importance of content, and we constructed a prototype system incorporating these methods. W2 C Content information Importance: 18.5 points +12 (=12/1) ing Web page Importance: 12 points Number of references: 1 +1.5 (=3/2) W3 Content information ing Web page Importance: 3 points Number of references: 2 Figure 4 Importance calculation example 4.1 System Overview The system configuration is shown in Figure 5. The system comprises three elements a video search application that runs on an HT-03A terminal, a video information analysis server that performs processes such as extracting link relationships, and an external serv- Accessing the FOMA network via the Internet Accessing the Internet HT-03A (Android TM terminal) Video search application Video search request (keywords, etc.) Video search results (video metadata, etc.) Blog digest request (video URLs, etc.) Blog digest response (titles/digest articles, etc.) Video information analysis server Search for reverse links and forward links to videos on the Internet based on the metadata of high-ranking videos in the external servers Crawling functions Request processing functions History/statistical information management functions Video ranking request Video metadata (original video of search) Video reverse link search request (video URL) Search results (blog URLs linking to the video) External server group Video-sharing service Search server Android TM : A trademark or registered trademark of Google, Inc. Figure 5 System overview 50

er group on the Internet. 4.2 Video Information Analysis Server The video information analysis server consists of the following three functions: 1) Crawling Function This collects data based on a link relationship extraction algorithm. It combines the functions of searching for forward links based on a search for reverse links from the reference video, creating blog digests and acquiring video attribute information such as video thumbnail URLs, and thereby continuously collects Web pages and video content information. The collected information is stored by converting it into a database structure. The blog pages collected here were obtained from Japanese providers as Web services. 2) Request Processing Function This receives requests from the video search application by HTTP, processes these requests, and sends back the processing results such as video searches and blog digest requests. 3) History/Statistical Information Management Function This function receives information on the view counts of video content and the viewing histories of individual users, the hit counts of Web pages and the browsing histories of users from the video search application, and collects and stores this information on the server. The importance calculations are performed based on the numbers stored by this function. 4.3 Correspondence of Video Search Application and System Processing Figure 6 shows the correspondence between operations from the video search application and the data- Video information Blog digest list ed video list Video search application Play: P-38 Lightning> P-38 Lightning, Arizona Air Museum Views: 109,366 Registered on: 2009/05/20 The P-38 Lightning was designed as a twin-engined, twin-boom heavy fighter plane. With two oil-cooled Site list: Lotte Fit> Video workshop Clay animation Ten short video gems. There is still no established shooting method for this genre, so individual film-makers are Other videos this site links to Polar Bear Diary My new pet dog An eight-month-old Shetland sheepdog. Our extended family now comprises three cats, one dog and one human Click Site list: Video workshop> Video workshop Clay animation Ten short video gems. There is still no established shooting method for this genre, so individual film-makers are Other videos this site links to Searching for cond List of sites that link to this video Add to my collection Click Other videos this site links to Every Day is a Good Day A miscellany of cat videos Here s another cat video for you. I never get tired of my own cat, but I get the same feelings for other people s cats Clay animation: La How to make sauc Click Database Reverse link analysis Forward link analysis Video Blog page Figure 6 Operations from the video search application Video 51

Searching User-generated Video Content based on Relationships between Videos and Blogs base that is stored in the processing of the video information analysis server. The video information window, blog digest list window and link destination list window correspond to the videos themselves, the blog pages that link to these videos, and the other videos that these pages link to. The blog and video lists are sorted based on the results of importance calculations. Since this system displays a list of the blog pages that link to video content as a means of conveying information relevant to this video content, it can be expected to enable users to obtain information relating to the popularity of videos from the number and content of blog pages that link to them. Also, whenever a user watches the video content that a blog page links to, the ranking of this blog page increases. Blogs with higher rankings can thus be regarded as video content connoisseurs, and these blog pages can be relied on when searching for other video content. 5. Conclusion In this article, we have described an overview of a system that implements a video search method using the link relationships between video content and blogs with the aim of popularizing the use of video content on mobile terminals. The basic concepts of the search method and importance calculations presented here are applicable not only to video content but also to multimedia content such as photos and music, and downloadable content such as application software. With the introduction of the High Speed Uplink Packet Access (HSUPA) service, it is expected that there will be a lot more video content uploaded to the Internet from mobile terminals. Internet culture tolerates all manner of creative values, and has come to occupy an important position as a source of activity among creators and users. Now that Internet use is shifting away from PCs and expanding into mobile terminals, services that provide a point of entry for mobile terminals can be said to bear a large responsibility for the further development of this culture. Based on this system s test results, we plan to continue developing technologies and services that can reliably and proactively bring users into contact with information by working to improve the findability of information. References [1] Institute for Information and Communications Policy: A Survey Study of Blogging, Mar. 2009 (In Japanese). [2] K. Noaki, M. Terada and H. Sekino: Reference Structure Analysis for Video Searching Via Web Pages, DICOMO 2009, pp. 36-42, Jul. 2009 (In Japanese). 52