Personalized Search for TV Programs Based on Software Man

Similar documents
An Improved Frequent Pattern-growth Algorithm Based on Decomposition of the Transaction Database

Grid Resources Search Engine based on Ontology

Application of Individualized Service System for Scientific and Technical Literature In Colleges and Universities

Research and Application of E-Commerce Recommendation System Based on Association Rules Algorithm

Automatic Generation of Wrapper for Data Extraction from the Web

An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data

Design and Implementation of Agricultural Information Resources Vertical Search Engine Based on Nutch

Analysis of Computer Network and Communication System

Research on An Electronic Map Retrieval Algorithm Based on Big Data

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

Design of Tourism Information System with B/S and C/S Architecture based on Android and Web Platform

Time Series Clustering Ensemble Algorithm Based on Locality Preserving Projection

Preliminary Research on Distributed Cluster Monitoring of G/S Model

Statistical Methods to Evaluate Important Degrees of Document Features

Key Technology of Monitoring and Control for Coal Shearer Supporting Network Convergence

The Analysis and Research of IPTV Set-top Box System. Fangyan Bai 1, Qi Sun 2

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

The Application Research of Semantic Web Technology and Clickstream Data Mart in Tourism Electronic Commerce Website Bo Liu

The principle of a fulltext searching instrument and its application research Wen Ju Gao 1, a, Yue Ou Ren 2, b and Qiu Yan Li 3,c

Research on Socket Communication System Based on Zigbee Wireless Sensor Network

Design and Implementation of Remote Push System of Resources Based on Internet

Wireless Smart Home Security System Based on Android

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha

Research on Full-text Retrieval based on Lucene in Enterprise Content Management System Lixin Xu 1, a, XiaoLin Fu 2, b, Chunhua Zhang 1, c

Top-k Keyword Search Over Graphs Based On Backward Search

A Network-Based Management Information System for Animal Husbandry in Farms

Design and Implementation of Full Text Search Engine Based on Lucene Na-na ZHANG 1,a *, Yi-song WANG 1 and Kun ZHU 1

An Improved KNN Classification Algorithm based on Sampling

Query Optimization Ways of DB2 to Improve Database Performance

Jumpstarting the Semantic Web

The Development of Mobile Shopping System Based on Android Platform

Correlation based File Prefetching Approach for Hadoop

A Two-phase Distributed Training Algorithm for Linear SVM in WSN

Keywords: Motion Capture System, Virtual Interaction, Wireless Sensor Network, Motion Builder

New research on Key Technologies of unstructured data cloud storage

Design of Coal Mine Power Supply Monitoring System

Design of Greenhouse Temperature and Humidity Monitoring System Based on ZIGBEE Technique Ming Xin 1,a, Wei Zhongshan 1,b,*

Information Push Service of University Library in Network and Information Age

The Design and Implementation of Electronic Program Guide In Digital TV Set-Top-Box

A Data Classification Algorithm of Internet of Things Based on Neural Network

Indexing in Search Engines based on Pipelining Architecture using Single Link HAC

SmartQR: a new interactive system based on QR code for smart TV 1

Improving Suffix Tree Clustering Algorithm for Web Documents

Department of Computer Science and Engineering B.E/B.Tech/M.E/M.Tech : B.E. Regulation: 2013 PG Specialisation : _

Research on Mining Cloud Data Based on Correlation Dimension Feature

Intelligent Computer Room Management Platform Based on RF Card

Construction of SSI Framework Based on MVC Software Design Model Yongchang Rena, Yongzhe Mab

The SNM Algorithm Based on Multiple Edit Distances and Variable Window

STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE

The Research and Design of the Android-Based Facilities Environment Multifunction Remote Monitoring System*

Research Article A Two-Level Cache for Distributed Information Retrieval in Search Engines

Realization of Time Synchronization Server Based on C/S

The Design and Application of GIS Mathematical Model Database System with Meta-algorithm Li-Zhijiang

PoS(CENet2017)064. 3D Visualization of Point Clouds Using HTML5 and WebGL. Speaker

A Novel Intrusion Detection Method for WSN Sijia Wang a, Qi Li and Yanhui Guo

Research on Clearance of Aerial Remote Sensing Images Based on Image Fusion

Application of Nonlinear Later TV Edition in Gigabit Ethernet. Hong Ma

Study on Personalized Recommendation Model of Internet Advertisement

The Principle and Improvement of the Algorithm of Matrix Factorization Model based on ALS

Implementation of Parallel CASINO Algorithm Based on MapReduce. Li Zhang a, Yijie Shi b

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

Research on Technologies in Smart Substation

Multi-Source Spatial Data Distribution Model and System Implementation

The research and design of user interface in parallel computer system

Fingerprint Verification Terminal of Examination Information Based on Qt Mengtao Huang 1, Mengyao Han 2

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

A Tentative Study on Ward Monitoring System based on Zigbee Technology Jifeng Liang

Development of remote control software for multiformat test signal generator

Improvements and Implementation of Hierarchical Clustering based on Hadoop Jun Zhang1, a, Chunxiao Fan1, Yuexin Wu2,b, Ao Xiao1

Open Access Research on the Data Pre-Processing in the Network Abnormal Intrusion Detection

Chapter 2. Architecture of a Search Engine

Information Retrieval Spring Web retrieval

APPLICATION LAYER APPLICATION LAYER : DNS, HTTP, , SMTP, Telnet, FTP, Security-PGP-SSH.

Information Retrieval System Based on Context-aware in Internet of Things. Ma Junhong 1, a *

Design and Realization of Agricultural Information Intelligent Processing and Application Platform

Research and Design of Key Technology of Vertical Search Engine for Educational Resources

Embedded Smart Home System Based on ZigBee Song Chi

Research and design on boxing-teaching robot of embedded system and pressure sensor

Chapter 6: Information Retrieval and Web Search. An introduction

Design and Implementation of Dual-Mode Wireless Video Monitoring System

Research on online inspection system of emulsion explosive packaging defect based on machine vision

Mining Web Data. Lijun Zhang

Part I: Data Mining Foundations

Based on correlation coefficient in image matching

A Batched GPU Algorithm for Set Intersection

Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD

Open Access The Three-dimensional Coding Based on the Cone for XML Under Weaving Multi-documents

Background. Problem Statement. Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. Deep (hidden) Web

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

Beijing University of Posts and Telecommunications, Beijing, , China

Research of the Optimization of a Data Mining Algorithm Based on an Embedded Data Mining System

A QR code identification technology in package auto-sorting system

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

Design of Smart Home System Based on ZigBee Technology and R&D for Application

Profile Based Information Retrieval

The Design of Model for Tibetan Language Search System

Study on Image Position Algorithm of the PCB Detection

Deep Web Crawling and Mining for Building Advanced Search Application

The Design of Distributed File System Based on HDFS Yannan Wang 1, a, Shudong Zhang 2, b, Hui Liu 3, c

A Cache Utility Monitor for Multi-core Processor

Transcription:

Personalized Search for TV Programs Based on Software Man 12 Department of Computer Science, Zhengzhou College of Science &Technology Zhengzhou, China 450064 E-mail: 492590002@qq.com Bao-long Zhang 3 Department of Computer Science, Zhengzhou College of Science &Technology Zhengzhou, China 450064 E-mail: 115441690@qq.com Watching TV programs is one of the most popular ways of spending our free time. With the development of TV programs, it s a problem of how to search their favorite TV programs efficiently. In this paper, it proposes the personalized search for TV programs based on Software Man. According to user s interests, the personalized search can quickly sort the search results, and then meet the demand of the user's personalized search; besides, this software man can adapt to the environment and make autonomous decision. Finally, this software man can track users and collect the user s browse of TV programs regularly in order to build the users interests model. CENet2015 12-13 September 2015 Shanghai, China 1 Speaker 2 Corresponding Author 3 2013-2015, key project of science and technology research in Henan, Department of Education,(14B520005). Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0). http://pos.sissa.it/

1. Introduction There are too many duplicate messages and it costs lots of time to get useful information, which becomes the biggest problem of searching TV programs. No matter the user is teenager, adult or the elderly, the search process is the same. The current information retrieval technology meet the needs of people to some extent, but as a result of its general properties, it still cannot meet the query request from the different background of the different purposes in different periods. Personalized search is aimed at this problem, which can provide different services for different users to meet different needs. Personalized search will learn the user's interest and behavior by collecting and analyzing the customer s information in order to realize the active recommendation. Personalized search technology can change the search results of "one-size-fitsall" completely and improve the relevance of search results, so as to improve the user s usage experience. 2. Personalized Search Framework The personalized search system of TV shows is divided into seven subsystems, as shown in Fig.1. Natural Language processing subsystem is the foundation of the whole system and also the most basic subsystem. Because the processing object is Chinese TV description text, it doesn t have obvious space like English word. As to Chinese word, there is no space between the word and the word, thus the word segmentation is required for further processing. Obviously, the whole performance of the system is directly influenced by the segmentation accuracy. Network web spider s subsystem: which is used to grab the TV text description HTML files from TV web, convert TXT file, and be deposited in the TV text corpus. Storage subsystem: the function of the subsystem television is to store files and compress index. In general, as the index will take up a lot of storage space, high efficiency of index compression can save a lot of disk space. Indexing subsystem: which is responsible for the TV program content of the document analysis and establishment of the inverted index structure for merging and updating. Universal search subsystem, which can realize the function to explain the query expression and present the retrieval result through the calculate similarity between inverted index database query and document. Personalized search subsystem: which has realized the user's personalized search function. This function can realize process mainly through the query expansion, the user interest model and user feedback, etc. while the query expansion needs to use the interfaces provided by the semantic repository. User query processing subsystem: which has realized the user's query preprocessing, including the word segmentation and parsing while displaying the final query results. Figure 1: Personalized Search System of TV Show 2

3. Software Man "Software man" is an artificial life from the point of view of generalized artificial life so as to extend and expand people's life. It survives in the software environment and activities in the internet world, just like people who have thinking, awareness, behavior characteristics and information acquisition, processing and utilization ability[1]. "Software man" features the functions of personification attributes, personification, personification, behavior and structure of personification. 3.1 Software Man Model The software model is composed of six layers: core layer, behavior layer, functional layer, interface layer and data layer interaction [2], as shown in Fig.2. Figure 2: Software Hierarchical Model The core layer is a set of basic properties of software man [2]. In this layer, it contains the initialization module, function of the registry, behavior and characteristics of the registry, the self-organizing assembly module and evolutionary clone module. The bhavior layer is a set of the software man natural behavior, spontaneous behavior, self-consciousness and software group collection of environmental administrative tasks. This layer contains the environmental adaptive module, the self extinguishing and clearance module, the function loading module, the migration module, the object recognition module and the basic interaction primitive module. The function layer is the set of all functions of software designed by the designer. This layer contains the spam cleanup module, the information push and pull module, the information security module and other planning and definition of the service module. The interactive layer is the software man and other software man, environment and human interaction function set. The interactive behavior includes the transmission of information, looking for similar, work together, communication and exchange information. The data interface layer is a set of software man and other software man, software man and the environment, software man and human interaction with data exchange and information storage stack [3]. 4. Implementation of Software Man in Personalized Search of TV Program 4.1 Grabbing TV Program Information These six properties can best represent a TV program including the program name, the film types, the actor, the release date, projection television and the audience rating. Seeing each attribute of a film as one dimensional vector, we can get a six - dimensional vector m{xl,x2,x3,x4,x5,x6}. Given the weight of Wi (i=1,2,3 6) of the six vector(xl,x2,x3,x4,x5,x6), we can generate the interest samples on the basis of parsing these six properties. The weight must meet the following two formulas: W1 W6>W2 W3>W4>W5 (4.1) W1+W2+W3+W4+W5+W6=1 (4.2) Then, we cluster all TV programs by using undirected graph according to the weight. The algorithm steps are shown as follows: (1) Select k films from n films at random as the k cluster centers; 3

(2) Calculate each distance between the remaining n-k film and the k cluster centers, choose the shortest distance and add this film to the clustering center; (3) Recalculate the new clustering center of the cluster; (4) If there is any change of the previous cluster center, jump to tstep 2 and reassign films; otherwise, exit from the iterative process and the algorithm ends [4]. 4.2 Semantic Extension of TV Program With the mechanical word segmentation method, it divides the name of TV programs into words. This rule-based method will match the Chinese character string to the sufficiently large mechanized dictionary according to certain strategy. If the match is successful, it returns the match results as a participle; besides, it can be divided into the minimum and maximum matching algorithm according to different scanning directions, and can be divided into the simple word segmentation method and the method in combination of segmentation and labeling according to the principle whether the word labeling is combined. 4.3 Tree Structure of Searching TV Program A tree structure can be used to store a lot of strings. With the help of public prefix of strings, it can save a lot of storage space. Then, it can store the information of the mapping table between search strings and pinyin initials into this tree structure (as shown in Fig.3). Figure 3: Searching Tree The principle of this searching tree is shown as follows: (1) The first search starts from the root node; (2) Find the first letter of string, choose corresponding sub-tree according to the letter and jump to this sub-tree to continue this search; (3) In this sub-tree, find the second letter and select corresponding sub-tree to search; (4) By that analogy, find the rest letters of string and select the corresponding sub-tree to search; (5) At a node, if all of the letters have been taken out, read the information of this node and finish this search; finally, it will return the name of this TV program. 4.4 Detailed Design of Program Search The search system of television program mainly includes the function modules as follows. Collection of Program data. The module is mainly to collect data for structural analysis through the metadata of the program source, establish the corresponding program fetching templates, and then snatch in accordance with the fetching templates to carry out data acquirement, then finally analyze the acquirement of data and input metadata database for network modeling module using[5]. The network modeling module: which is mainly to get programs from Collect of Program data in order to the data mining, build method of social network based on the film properties and get social network program. 4

The program searching: which is mainly to search optimization program and improve the efficiency of search accuracy. The process is as shown in Fig. 4~Fig. 6. Figure 4: Data Collection of Program (1) Obtain program data sets and grab the alias and program evaluation attributes, establish data sets and finally integrate data sets. (2) Repeat (1) in the regular time and update the data Figure 5: Network Database of Program (1) Get attributes of the threshold value through the off-line training; (2) Use the utility function to calculate the program correlation between each other. (3) Gain of correlation from (2) and cluster program from repository program by k-means algorithm [6]. (4) When the network information database is established, send TV program information to the database. Figure 6: Program Searching 5

(1) Interview the network module of information database and establish the programs - initials mapping table (2) Wait for the user input (3) When a user inputs a button, form a search string in front of the input as inquired. (4) Search the query through programs - initials mapping table. If the result of sequence is achieved successfully, return null and jump to (2). (5) Make the intelligent sort by result sequence and return the final query results (6) If user s choice is correct, find the finish; otherwise, jump to (2) (7) If the user selects a film, represent a successful query, record the user behavior, jump to the video module; otherwise, find the failure and jump to (2) 4.5 Search Implementation It uses C/S structure design between client and server. As to the client, the program, which can be run on Windows and Linux platform, is developed by QT. As to the server side, the program, which can be run on the Linux platform and used sqlite and xml to store data, is developed by C++.The socket connection is established on the basis of TCP between client and server, and the message, consisting of message header and message body, is defined by TLV. Then, the length is used to define the length of message content and the value is used to store the variable message content. The client sends commands (such as init, search, pre, next, recommend) to the server through the corresponding port and the server responds to these commands. The searching page is shown in Fig.7. After each user has input a letter, the system will automatically ash button which doesn t combine with later letter. It may help the user to save the number of keystrokes and improve the search efficiency. When the user presses the confirm button, the system from the search results page jumps to the target page. If you do not use smart sort of search results, the user presses R. Sequences generates the search results. Figure 7: Operation Interface 5. Conclusion According to the serious of problems that users encounter when search for TV information such as information repeating,time consuming, and the same searching process and so on, combining the information properties of TV programs, this paper combines personalized search and software model by the program for clustering weights iteration, tree structure being adopted to improve the intelligent search, and through the history of the machine user data mining draws preference information, then realizing personalized search programs on television so as to meet the personalized needs of users. 6

References [1] Min Zuo. A system based on parallel evolution about Robot and Software Man and its application [M]. Beijing: National Defence Industry Press, 2~4(2012).( In Chinese) [2] Guangping Zeng, Xuyan Tu. Software man s concept model and the tectonic characteristics [J]. Journal of computer science, 2005(5): 135~ 136. ( In Chinese) [3] Yi Song. The research of personalized search for TV programs [D]. Shanghai: Shanghai Jiaotong University, 2008. ( In Chinese) [4] Guohui Liu. The research and implementation of interactive television, film and television program search system [D]. Shanghai: Fudan University, 2011. ( In Chinese) [5] Heng Zhuang, Longbing Qi,The experience design of digital TV man-machine interface, Modern TV technology [J], 2010 (4):144~147. ( In Chinese) [6] Qingzhen Wang, Lili Qiu, the design of children's TV show search and monitoring Softrobot soft robot, Intelligent computer and application [J], 2014(2):40-44. ( In Chinese) 7