Project Report on. Google s schema.org Mark-up for Education. Inaugral Regular Internship Program 2014

Size: px
Start display at page:

Download "Project Report on. Google s schema.org Mark-up for Education. Inaugral Regular Internship Program 2014"

Transcription

1 Project Report on Google s schema.org Mark-up for Education Inaugral Regular Internship Program 2014 Submitted in partial fullfillment of internship project By Sanket Gaurav (Sikkim Manipal Institute of Technology) Sudhakar Yadav (National Institute of Technology, Raipur) Jithendranath Gupta Y. (Amrita University, Coimbatore) Kasturi BhanuChander (VITAE, Hyderabad) Mentor: Aruna Adil Program Director: Avinash Awate Under the Guidance of Prof. Deepak B. Phatak Department of Computer Science and Engineering, Indian Institute of Technology, Bombay 1

2 Acknowledgement Our heart pulsates with the thrill for tendering gratitude to those persons who helped me in completion of the project. Unfortunately, the list of expressions of thank no matter how extensive is always incomplete and inadequate. Indeed this page of acknowledgment shall never be able to touch the horizon of generosity of those who tendered their help to me. First of all we would like to thank our guide Dr. Deepak B. Phatak, Professor, Department Of Computer Science and Engineering, Indian Institute of Technology, Bombay for hosting the 6 month Regular Internship under which we could complete our 6 month internship and for his kind attitude, invaluable guidance, keen interest, immense help, inspiration and encouragement which helped us in carrying out our application development. Then we are thankful to Google for giving this project to us and also sponsoring our project which takes care of all the expenses for the whole duration of the project. We are extremely grateful to Mr. Avinash Awate, Program Director and Aruna Adil, Project Mentor, Deptt. Of CSE for providing all kinds of possible help throughout the phase of development of application. We are also thankful to Mr. Ashwani Sharma, Program Manager, Google, India and Ewa Gasperowicz, Developer Programs Engineer, Google, London for helping and solving our doubts related to schema.org etc. We are also very thankful to the Department of Computer Science and Engineering, I.I.T, Bombay for allowing us to carry out this internship and providing the resources. Lastly, we thank all those who are involved directly or indirectly in the successful completion of the project work. Sanket Gaurav Sudhakar Yadav Jitendranath Gupta Yenduri Bhanuchander Kasturi 2

3 List of Content Chapter Title Page No. Abstract 8 Introduction 1.1 General Overview of the Problem 1.2 Objective Literature Survey Schema.org LRMI Kritikos Schools Interoperability Framework(SIF) 21 3 Current Scenario Our Approach SRS Software Platform Hardware Platform Project Scheduling Analysis of websites CDEEP Spoken Tutorial ekshiksha Software Development Design Strategy for the solution Context Diagram Test Plan Structured Data Testing Tool Google Webmaster Tool Implementation Details Results and Discussion Result at Structured data testing tool

4 Result at Webmaster Tool Proposed Tags Summary and Conclusion Summary of Achievements Difficulties Encountered Limitations of the Project Future Scope of the Work Any Special Observation 73 User Manual Appendix 1 77 Appendix Appendix References 86 4

5 List of Figures Figure No. 1 2 Figure name The actual flow of web search for information Display of Amitabh Bachchan description on right side panel Page No Display of IIT Bombay description on right side pane Flow at LRMI The search results after adding LRMI and 'schema.org' markup The search results without adding 'schema.org' markup 16 7 Purpose of internet use by school children at KV, Mysore 23 8 Problems faced by children while using Internet 23 9 The various ways search is performed over web The flow of Educational search when we add 'schema.org' Gantt Chart Overall System Design or architecture of the HTML Processing Software Structure of HTML Processing Software Approach used in the HTML Processing Software Level 0 diagram of HTML Processing Software Test at Structured data tool before adding Schema properties Test at Structured data tool after adding Schema properties Test at Structured data tool after adding Schema properties to all the departments listed in CDEEP Test at Structured data tool after adding Schema properties to all the tutorials list in Spoken-tutorial. Test at Structured data tool after adding

6 Schema properties to all the workshop contents listed in Spoken-tutorial Test at Structured data tool after adding Schema properties. Test at Structured data tool after adding Schema properties to all the contents listed in ekshiksha. Test at Structured data tool after adding Schema properties to all the Interactive contents listed in ekshiksha. Data Highlighter of CDEEP Data Highlighter of courses at CDEEP Structured data found in CDEEP Structured data found in Spoken Tutorial site Structured data before and after data 'schema.org' properties in ekshiksha Search queries graph for CDEEP site search queries made for Spoken-tutorial site search queries made for ekshiksha site Change in average position of queries posted of CDEEP site on search engine 'Spoken-tutorial' website improvements after adding structured data. 'ekshiksha' website improvements after adding structured data

7 List of Tables Table No Table Name Page No. 1 2 LRMI Additions to schema.org/creativework Other important Characteristics of LRMI Alignment Object 20 4 Educational Audience 20 5 Mapping with CDEEP attributes with schema.org Mapping with Spoken Tutorials attributes with schema.org 7 Tags to be added (suggestion) 33 8 Tags to be added for workshop event (suggestion) 34 9 Mapping with ekshiksha attributes with schema.org Tags to be added (suggestion) Data Definition The tags related to education that are suggested to schema.org 13 The tags related to education that are suggested to schema.org 14 Advantages and Disadvantages of different parser in 77 Python 7

8 Abstract Today search engines are playing an important role in obtaining the information from the web. Every individual irrespective of age limit (Schoolchildren, College Students, Working professionals, teachers etc.) depends on the search engine to retrieve the information from the web. However, data present in most of the webpages does not have meaning attached to it, which causes the search engine to fail in providing relevant results about the data, what user are searching for, and this consumes a lot of time in getting the relevant results. Our focus is mainly on the retrieval of educational related information effectively when searched and index-able by search engines. A unique path to search for educational related information is not present, which is making the existing search inadequate for discovering all the educational resources. This is very valuable for school children to learn and teachers to use for educational purposes. Schema.org [6] which was initiated by Google, Bing, and Yahoo provides a collection of schemas that help the webmasters to mark-up the pages with different schemas for enhanced search. We will be using these mark-ups for educational content over the web. This will help school children, teachers, etc. to get better relevant results about what they are searching for, in less time. 8

9 1. Introduction 1.1 General Overview of the Problem Currently, Internet is widely used by majority of people in India to access information. Students use internet mainly for watching videos, social networking, playing games, and to complete their homework and projects (educational resources). Most of the students directly depend on search engines to find the right information over large data present on the web. As a result, a lot of time is wasted searching for the exact and appropriate information. Now-a-days search engines are playing a vital role in accessing the online content over the web. In India around 82% [3] of school children use internet and around 53% [3] of them use it to search for educational purposes- such as to do their assignments, gain in depth knowledge about the subjects, take online quizzes, and also to complete their projects. They depend on search engines, especially Google (around 53 % [3] approx.) by using keywords (around 46% [2] approx.) to search these resources. Though there are enough educational resources present on the web, 58% [2] of these school children face problems in accessing the appropriate information in time, and as a result, spend a lot of time on searching. The flow of this search is shown in the Figure 1. All these problems arise as the data present on the web is in unstructured form, i.e., it has no meaning attached to it. Also, search engines cannot understand the meaning of the content that is made available as a search result. These problems clearly specify a need for the retrieval of educational information effectively, when search is performed using search engines. When a search query is made for any personality like Albert Einstein, any organization like IIT Bombay, or a recipe like Potato Salad, we get a summarized information about it on the right hand side of search result page, which is called as Knowledge Panel'. This happens because of structured data markup; when applied as markups to webpages through 'schema.org'. Schema.org categorizes the data, and provides meaning to this data. It is classified into various types like 'Creativework', 'Event', 'Organization', 'Recipe', etc. However, there is no such unique way which makes existing search adequate to discover the required educational resources that are essential for the children to learn, and teachers to teach. 9

10 1.2 Objective: Our work mainly aims to propose some schema properties for education that will help search engines to understand educational resources on web, and will make these resources 'index-able'. As a result, the search time will be reduced which will not only assist teachers to find the appropriate information to teach students, but also students to learn these resources. In addition, it will help publishers to publish their information effectively. Figure 1: The actual flow of web search for information, statistics is based on survey "Use of Internet by the Researchers of Indian Veterinary Research Institute [13]" and "Internet Usage by the Students of Kendriya Vidyalaya[3]" 10

11 2. Literature Survey Here, we discuss the technologies used by various educational websites, and also the properties proposed by schema.org [6] and LRMI [11] to markup the educational websites. 2.1 Schema.org 'Schema.org' provides a set of properties i.e. markups, which can be utilized by the webmasters to markup their webpages, and to make these markups visible in major search engines like Google, Yandex, Yahoo! and Bing. These schema markups help search engine understand the meaning of the website, and filter search results. It helps in finding the right webpage in less time. Search engines have a very limited understanding of what is being discussed on web pages merely by reading, as opposed to the humans. By adding new tags i.e. 'itemscope', 'itemtype', and 'itemprop' in webpages, we can associate the web pages with specific property such as movie, place, person, video, etc. to make search engines understand those webpages clearly. Details of the properties are given below. itemscope: By adding itemscope, we are specifying that the HTML contained in the <div>...</div> block is about a particular item. <div> <h1>avatar</h1> <span>director: James Cameron (born August 16, 1954)</span> <span>science fiction</span> <a href="../movies/avatar-theatrical-trailer.html">trailer</a> </div> After adding Itemscpoe property, the code will be <div itemscope> <h1>avatar</h1> <span>director: James Cameron (born August 16, 1954) </span> <span>science fiction</span> <a href="../movies/avatar-theatrical-trailer.html">trailer</a> </div> 11

12 Itemtype: itemtype attribute is defined immediately after the itemscope. This specifies that the item contained in the div is in fact a Movie/person, recipe..., as defined in the schema.org type hierarchy. Item types are provided as URLs. <div itemscope itemtype=" <h1>avatar</h1> <span>director: James Cameron (born August 16, 1954)</span> <span>science fiction</span> <a href="../movies/avatar-theatrical-trailer.html">trailer</a> </div> Itemprop: To label properties of an item, itemprop attribute is used. <div itemscope itemtype =" <h1 itemprop="name">avatar</h1> <span >Director: <span itemprop="director">james Cameron</span> (born August 16, 1954)</span> <span itemprop="genre">science fiction</span> <a href="../movies/avatar-theatrical-trailer.html" itemprop="trailer">trailer</a> </div> Examples: After adding some schema properties ( like person, organisation etc. ) in the websites, when searched in google, the description about that comes in right hand side of the screen which is called knowledge panel. If we search for a person like Amitabh bachchan in google, the search result will be displayed as shown in Figure 2. 12

13 Figure 2: Display of Amitabh Bachchan description on right side panel If we search for an organisation like IIT Bombay, the search result will be displayed as shown in Figure 3 13

14 Figure 3: Display of IIT Bombay description on right side panel 2.2 Learning Resource Metadata Initiative (LRMI) The Learning Resource Metadata Initiative[11] is an organization that has been working on filtering search results, particularly when a search about educational resources is performed on the web. The purpose is to publish, discover, and deliver quality content easily on the web. The metadata schema developed by the LRMI was adopted by 'schema.org' in LRMI tries to reduce the gap between users and available resources by changing the way of accessing the information. The Figure 4 shows how meta-tagging is deployed in order to facilitate the search and retrieval of instructional resources. Here from right to left the publishers tag the learning resources and 14

15 save them in the learning registry and from left to right the educators and students search for the educational resources and find what they are looking for exactly in the learning registry. Figure 4: The flow at LRMI LRMI works with metadata, i.e., data about the data. It describes information like publication date, author, type of content, and number of pages etc. Each tag proposed by LRMI when added to the webpage, helps in the discovery of the educational re-sources in a better way. When we search for structure of atom after adding LRMI tags, we get categorized results which are shown in Figure 5. On the other hand, Figure 6 shows the results of Google search when no tags were added in the website. In Figure 5, there is a tag option named typicalagerange to filter results according to the age group. So, the user can find right information at right time with less effort. 15

16 Figure 5: The search results after adding LRMI and 'schema.org' markup. Figure 6: The search results without adding 'schema.org' markup. LRMI has proposed 14 properties which mainly markup educational resources. Now these properties are added into the library of 'Schema.org'/Creative work [6]. LRMI allows extension 16

17 since it wants to build the gap between the available resources and users. The survey results conducted by LRMI about the improvement of search after adding a markup shows an increment in retrieval of successful search results by 10% (35.5% %=10%) [11] and decrement of often unsuccessful results by 10% (13.1%-11.2%=10%) [11] when compared with previous year The schema properties that are added from LRMI in schema.org are shown in Table 1,2,3,4. 17

18 18

19 19

20 20

21 2.3 Kritikos This is a customized search engine developed by University of Liverpool, USA [15]. They used a Learning registry approach to capture, connect, and share data about learning resources available online. It aims to reduce the time required to find useful resources for educators and students. The Learning Registry makes the process easier by acting as an aggregator of metadata data about the learning resources available online. They use this technique for higher education and display the results visually as thumbnails. 2.4 SCHOOLS INTEROPERABILITY FRAMEWORK (SIF) ASSOCIATION The SIF [3] Association is comprised of K-12 organizations that have created a set of principles and definitions that enable software programs from different companies to share information and makes it possible for programs within a school or district to share data without additional programming. 21

22 3. Current Scenario In this section, we discuss about the online search requirements of school children, types of problems school children are facing while searching on the web, and if these problems are addressed by search engines or not. We also discuss the new trends in latest educational websites. This work has been done for our research purpose. Problem faced by children while using Internet: Today, most of the school children depend on search engines to do their homework, assignments, and search reference books. Many teachers also suggest their students to search on the web to get more information for better understanding of the topic. When a student types Structure of atom for class 9 in any search engines like Google, Yandex, Bing, or Yahoo!, students get hundreds of results. However, most of these results are irrelevant from students standard and level. Students cannot identify the correct link which gives appropriate information according to their level. The reason behind getting this huge list of results is the use of keyword matching technology used by the search engines to retrieve the results. School children, however, need a friendly search results for educational related information, which can filter the redundant results according to the level and categorize the results according to the content. This will trim down the amount of time spent on identifying the correct content by the children, teachers, and according their requirement. The below data shows the use of internet by children for education and problems faced during finding the information from the web. The results of the survey conducted by Kendriya Vidyalaya (KV) [2], Mysore, about the internet use by school children (among 120 students) say that around 82% children use internet. Analyzing this data, we need to provide a standard way to access the online resources effectively. Students use the internet for many reasons however; we mainly concentrate about how they use it for educational resources. 'Purpose of internet use' of school children is shown in Figure 7. The study shows that around 36% (25% for assignments +11% for educational information) use internet for educational purpose. Comparatively, the statistics of another country (Thailand) [3] states that 60% of total school children use internet for educational purpose. 22

23 Figure 7: Purpose of internet use by school children at KV, Mysore [3]. Figure 8: Problems faced by children while using Internet [2]. School children also face many problems while searching for the information from the web. According to the survey conducted, around 58% [4] find problem in accessing the relevant information from large, overloaded information present on the web as shown in Figure 8. The above analysis strongly highlights the need to provide a solution that will deal with the biggest problem i.e. getting relevant information. Clearly, there is huge scope for providing better 23

24 search facilities for Indian school children in order to increase the percentage of internet use for education. School children search in the web in different ways, according to a survey [5] conducted (among 316 students) 36% of them use search engines, 34% use subscription database and remaining type the web address directly for search. Search can be performed in various ways according to [4], results as shown in Figure 9. Among all different ways of searching, search with keywords is the most common way. Figure 9: The various ways search is performed over web. With the above data, we conclude that if schema.org markups are used in the websites, the search engines can recognize them and provide relevant results. Currently, we have many websites and technologies that have categorized information according to the content for children, related to education information and work in different ways given below: Creation of Custom Search engine. In these websites, search engine is built manually by adding various sites related to the education facilitating a 'custom search'. For example, website Kidrex [15] that is built by custom search. 24

25 Search in the fixed static database. These websites have their own database for educational resources, and perform search in that database. The website Mid-Continent library [12] searches for the educational information from its own database. Query Refinement: Today, many websites, modify the query and add keywords to the query and display the results accordingly to the child s needs. However, the websites also provide irrelevant results as they do not filter the results completely. 'Gogooglians' [13] is a website that works on this technology. This is the latest way to filter the results from the search engine. Search according to Age group: This is a technique which is very flexible used to search educational information according to diverse age groups. Website was built using custom search. 'Kids.gov' [14] is a website that was powered by 'Bing!' Search engine. This technique requires the user to know the specific websites. Thus, we need to provide a solution in which searching in common search engines be done in such a way that only relevant data is displayed. Figure 10 shows the flow diagram of retrieval the results by search engines when 'schema.org' markups implemented to educational websites. 25

26 Figure 10: The flow of Educational search when we add 'schema.org'. 26

27 4. Our approach After going through the literature survey and current scenario, the approach towards the project is as follows: -Analyze the websites and recognize all the properties that are relevant to education. -Create Software which performs the following steps. a) Takes the websites url as input to the software and generates a DOM structure corresponding to each and every webpage. b) Then it crawls through the DOM structure and find the relevant link where education related material is present and creates the in memory structure. c) In the in-memory structure it adds all the schema properties which map to the current properties at schema.org. d) After adding all the properties at schema.org it writes the modified html in a file. -A Hierarchy of the tags for the Indian educational system to be created and mapped with schema.org current properties. Then the properties which are not present in the schema.org are proposed as an extension to Schema.org -A Research paper on A study to assess and enhance educational specific search on web for school children. is written. 4.1 Software Requirement Specification (SRS) Software Platform Operating System: Ubuntu Language: Python (Version: 2.7.3) Library Used: Beautiful Soup (Version: 4.3.2) Parser Used: Python's Html Parser Header Used: bs4, urllib2, re, error, os Testing tool: Google Structured Data testing tool, Google Webmaster Tool Hardware Platform 27

28 Computer RAM: 4GB Processor: Core i5 I/O Devices: Keyboard, Mouse, Monitor 4.2 Project Scheduling The scheduling is shown in Figure 11: Y A B C A C T I V I T Y D E F G H X 1stJan 10th Jan 20th Jan 30th Jan 10th Feb 28th Feb 15th Mar 30th Mar 10th Apr 30th Apr 15th May 30th May 10th Jun 30th Jun TIME Figure 11: Gantt Chart LEGEND: A) Feasibility Study B) Literature Survey C) Design Strategy of Solution D) Implementation E) Adding schema properties to live site F) Testing at Structured Data Testing Tool G) Testing at Webmaster Tool H) Documentation 28

29 4.3 Analysis of websites We have analysed three educational websites [9] owned by IIT Bombay and, identified the tags and mapped them with schema.org. Analyses of the three websites are given below CDEEP (Center for Distance Engineering Education Programme) [16] It is an IIT Bombay s initiative which offers recorded courses to the outside students, working professional, researches etc. to avail IIT Bombay's courses. One can register online for this. After registration, the registered recorded course(s) are available by video streaming. Attributes of courses at CDEEP website mapping with properties of schema.org: Here in Table 5, we have mapped all the attributes present in CDEEP Website with the Properties of Schema.org Table 5: Mapping with CDEEP attributes with schema.org S.No Attributes/Propert 1) 2) 3) y of CDEEP Course Name Course Code Instructor 4) Department 5) 6) 7) 8) 9) 10) 11) 12) 13) Total Credits Number of Lectures Tutorial Practical Pre-requisite Description Last Updated Date Created Date Published 14) Duration/Semester Property Matching in schema.org Expected Type Text Person ->author/creator Organisatio ->department n ->description Text ->datemodified Date ->datecreated Date Date ->datepublished ->duration ISO 8601 date format 15) 16) 17) 18) Level Organisation Type Ratings ->publisher Organisation ->best Number or Text 29

30 19) url 20) related link Rating(Thing>Intangible>Rating) ->url ->relatedlink URL URL Tags need to be added in CDEEP courses (suggestions): These are few tags which are necessary to be added to the courses at the website so that it becomes more informative and more useful. a) Date Created: - It will give the date of creation. b) Date Published: - It will give the date of publishing. c) Level (Under Graduate/Post Graduate):- It is required to tell that the course is for UG or PG level which makes the course more informative. d) Ratings: - It helps both student and teacher to know the performance of the Course. Properties not found at schema.org: a) Total Credits b) Pre-requisite c) Level d) Type e) Course Code f) Number of Lectures g) Tutorial h) Practical Spoken Tutorial [8] The Spoken Tutorial website is the initiative of the Talk to a Teacher activity of the National Mission on Education through Information and Communication Technology (ICT), launched by the Ministry of Human Resources and Development, Government of India. 30

31 Attributes of Spoken Tutorial website matching with schema.org: Here in Table 6, we have mapped all the attributes present in Spoken Tutorial Website with the Properties of Schema.org. Tags need to be added in Spoken Tutorial Table 6: Mapping with Spoken Tutorial attributes with schema.org Property Matching in Expected Sl. No Attributes/Property schema.org Type SPOKEN TUTORIAL 1 Course Name Text 2 3 Language Tutorial Name Creative work->in language Text Text 4 No of visits e>rating>aggregate Rating Number ->Review Count WORKSHOP EVENTS Start Time Location 3 Topic Name WEB PAGE 1. Address 2. Main content of 3. Description Telephone Date Time >door time Person- Contac Point >address or Place Text Postal zation -> Address Web Page ->Creative Work>Web Page Element >description zation Text Integer zation 31

32 Suggestions These are few tags in Table 7 which are necessary to be added to the Tutorials at the "Spoken Tutorial website so that it becomes more informative and more useful. Table 7: Tags to be added (suggestion) SNO Attributes/Property Description Expected Type 1 Uploaded date Date when this media object was uploaded to this site. Date 2 Rating The rating for the content. Number or Text 3 Person Name Name of the person Text 4 Type 5 Level The type tags provides the suitable location of spoken tutorials in tutorial hierarchy We can search the videos based on level Tags need to be added to Workshop Event: Table 8 shows the tags which are necessary to be added to the Workshop Events at "Spoken Tutorial website so that it becomes more informative and more useful. Table 8: Tags to be added for workshop event (suggestion) S. No Attributes/Propert Description Expected Type 1 yrating The rating for the content. Number or Text 2 Academic code centre We can search the Workshop events Based on Academic centre code 32

33 3 Typical age limit We can search the Workshop events Text Based on Age limit 4 Performer Name Name of the Performing the event. Text Properties not found at schema.org: The following are the tags that have to be included in Spoken Tutorials and also not found at Schema.org website. 1. Level 2. Type 3. Academic Centre Code ekshiksha The ekshiksha[1] website is mainly developed for the school level students to learn and prepare them. Any school/institution can register with ekshiksha[1] and download content for distribution to their students through their own labs provided they agree to give valuable feedback. Mapping of attributes with schema.org property: Here in Table 9, we have mapped all the attributes present in ekshiksha website with the properties of Schema.org. Table 9: Mapping with ekshiksha attributes with schema.org No. Attributes Property Matching in schema.org Expected Type Contents Area(subject) Topic/Subtopic Title Author >name Text Text Text Person uthor 33

34 5. 6. Description Ratings >description Text - Number (Thing>Intangible>Rating>Rating Level Mode Thing-> Person Name a-first name b-middle name c-last name Value) Not in Schema.org Not in Schema.org Text ->additional Text name Text name Text additional name Qualification Text Organization/Depa rtment >worksfor (Organization ->Department) 14. Text -> Creative Work->Web Page Main Content Web Language ->maincontentofpage Element ->in Text Image Language Image Object ->primaryimageofpage Number review Count Page Rating>Aggregate Rating ->review Count 19. Contributor 34

35 Tags need to be added in ekshiksha website (suggestions): Table 10 suggests the tags which are necessary to be added to the ekshiksha[1] website so that it becomes more informative and more useful. Table 10: Tags to be added (suggestion) PROPERTIES EXPECTED SCHEMA Related Link TYPE URL Level NUMBER Not in schema.org Description Important links related to website age group Properties not found at schema.org: The following are the tags that have to be included in ekshiksha[9] and also not found at Schema.org website. Mode: It defines the mode of the content. Pre requisite: The pre knowledge and languages required to know for subjects. Level: Defines the level or standard of the content means it is useful for 9 th standard or 10th standard students. Format: It specifies the Type of the content i.e. content either in Text format of Flash/Applet Format. 35

36 5. Software Development 5.1 Design Strategy for the solution: First of all attributes found in the website were mapped with current schema.org properties. The attributes which were mapped with current schema.org were to be inserted in the websites. For this purpose HTML Processing Software was made. Figure 12: Overall System Design or architecture of the HTML Processing Software Figure 12 explains the principle on which HTML Processing Software is made. The software takes homepage url of a site or specifically the html file where schema.org properties to be inserted. Then it uses a Python s default parser which parses the page and 36

37 gives the standard errors and warnings. It also creates the DOM structure of the html page. From the DOM structure, intelligence(contents on which schema.org properties to be added) is extracted which also gives the errors and warnings. After extracting the intelligence, in-memory structure is created and extracted contents are stored in that inmemory structure. On this in-memory structure, processing is done i.e. schema.org properties are added to the contents. After processing, this is written back onto inmemory structure and reverse process is followed. At the end, the modified HTML page i.e. page containing schema.org properties is returned as output which is stored in a file. Figure 13: Structure of HTML Processing Software Figure 13 gives the basic structure of the software. It takes html page or files as input, process it and adds schema properties to it, finally store the modified HTML in a file. 37

38 Approach used to add schema properties to tag courses at CDEEP is shown in Figure 14 Figure 14: Approach used in the HTML Processing Software 38

39 5.2 Context diagram (Dfd Level 0) of the software: Figure 15: Level 0 diagram of HTML Processing Software Table 11: Data Definition Id a b Data Definition home_url Url of home page of CDEEP Original course detail Original html content of course details stored in a file c html Modified d e detail html course details stored in a file Original course html Original html content of course list stored in a file Modified course html Modified html content(with added schema properties to course Modified html content(with added schema properties) of instructor attribute) of course list stored in a file 39

40 6. Test Plan The added schema.org properties to the above 3 educational websites mentioned can tested on two tools of Google. The tools are as follows: 1. Google Structured Data Testing Tool [14]: In this tool the modified html file content are pasted and it recognizes the schema.org properties and display the list of all the schema.org properties in the webpage. URL of the website can also be passed to this tool and it gives the list of all the schema.org properties attached to the passed url. 2. Google Webmaster Tool: This is a very important tool which gives all the required things for a website testing and performance. For using this tool one has to upload Google verification file in the root directory of the website so that Google bot can recognize that this website has to be tested. It is a continuous testing going for the site. This testing is done for live site. Two other file also has to be uploaded to the root directory i.e. robots.txt which mentions the portion of the site that has to be protected from the bot to access and also specifies which bot can access the site. It also contains the link of the sitemap.xml. The other file to be uploaded to the root directory is sitemap.xml which has the preferences of the different levels of the site. It gives the performance of the website in following terms: Structured Data (It gives the schema.org properties added to the registered site) Data Highlighter (It gives the description of schema.org tagged properties on the site will appear on right hand side of the search engine) HTML improvements (It gives HTML improvements on the site.) Search Queries( It gives the average position percentage change of the queries made by the user on the search engine which is calculated on the basis of CTR i.e. click through rate which is calculated by number of clicks by number of impressions for respective query made queries made. For better understandability, a graph is also plotted for clicks and impression progress. ) Description of terms used at Webmaster Tool: Impressions, the number of times pages of the site appeared in search results. 40

41 Clicks, the number of times a user clicked the sites listed in search results for a particular query. CTR (Click Through Rate), the percentage of impressions that resulted on a clicking the site. Average position, the average top position of the site on the search results page for that query. 41

42 7. Implementation Details Language: Python(Version:2.7.3) Library Used:Beautiful Soup(Version:4.3.2) Parser Used:Python's Html Parser Header Used: bs4, urllib2, re, error, os Varibles Used: home_url,keyword,list_jsp_url,course_list_url,schema_link Software Introduction: This software reads the data at the CDEEP site and adds schema.org properties so that course description comes at knowledge panel when anyone searches for it on any search engine. Input to the Software: url of CDEEP site ( Output from the software: Original course details html file Modified html file Description of Functions: 1. process_course_detail(fname,bsoup):- This function processes the input html data and calls necessary functions to add schema properties on the html fname(it is the course code which is used as filename to save the file with.htm extension) bsoup(beautiful soup object of the page containing html data which will be be used for the processing of no return as processing is done and the object(output) is stored in the file inside the function itself 2. process_dept(fname,bsoup): -This function adds schema properties to the author attribute of each course listed in the department and calls necessary function for adding the fname(it is the course code which is used as filename to save the file with.htm extension) bsoup(beautiful soup object of the page containing html data which will be 42

43 be used for the processing of return as processing is done and the object(output) is stored in the file inside the function itself 3. itemscope(tsoup,schema_link): -This function adds schema properties i,e. itemscope and itemtype= to the body tag so that schema.org scope is tsoup(beautiful soup object of the page containing html data which will be be used for the processing of data) schema_link(schema.org link which contains the properties to be added to the page ex: return as it only modifies the soup object and changes are saved to the soup object only 4. itemprop_cname(psoup,cname,itprop):- It finds the given course name in the given html data and its corresponding tag and add 'name' property to it (ex. For course name attribute psoup(beautiful soup object of the page containing html data which will be be used for the processing of data) cname(course name on which schema property to be added) itprop(schema item property to find return as it only modifies the soup object and changes are saved to the soup object only 5. itemprop(psoup,prop,itprop): -It finds the given keyword/attribute and add respective schema property to psoup(beautiful soup object of the page containing html data which will be be used for the processing of data) prop(keyword/attribute on which schema property to be added) itprop(schema item property to find return as it only modifies the soup object and changes are saved to the soup object 43

44 only 6. b_soup_obj(data):- This function is used to create beautiful soup object of the html data passed to it and return the data created from given url by url_data() the beautiful soup object 7. url_data(url):- Function to open the passed url as parameter. It gets the content of the url and returns the content. It first sends request to the url by using urllib2.request(url function) and stores the reply. Then it passes the reply to urlopen() function to open the given url. At the end it reads the content of the url and store in an object and returns that url(url link for which html data to be fetched return the html content of the passed url 8. find_link(obj,key):- This function finds the required link for courses in the home page of CDEEP by matching the keyword passed to it. The course detail link can be accessed from this keyword. It searches in the html content by matching the regular expression as the keyword and finds the corresponding link i.e., href returns the obj(beautiful soup object on which search is to be made) key(keyword by which course link can be opened on the home page of returns the correponding url to the key word. 9. find_dept(obj):- This function finds all departments and displays them. It also opens all the department by passing the department link to dept() obj(beautiful soup object on which search is to be returns nothing. 10. dept(next_link):- It opens the department link and finds the listed courses in each department and search for the courses listed under that department. It opens each course detail link by combining necessary arguments to the jsp. Then it passes the obtained course id and 44

45 course detail object to process_course_detail() next_link(link of each department returns nothing 11. check_error(e_obj): It detects error in the html content of the link. It checks for the basic errors in the html page like error for html/title tags etc. If error is found then it prints the e_obj(beautiful object of the html page for which error to be returns nothing rather only print the errors. Steps carried in the Software: i. Start ii. Input cdeep home page url. iii. Pass the url to url_data() to fetch the html data of home page of cdeep and store the html data in an object. iv. Call b_soup_obj() to create the beautiful soup object of the html data and store the returned bsoup object. v. Call find_link() by passing the bsoup object and keyword to find the course url on the home page. vi. For the course url found call url_data() and b_soup_obj() to create course url content object. vii. Pass the course url object to find_dept() function to find departments at cdeep. viii. At find_dept() it searches for all the departments and print it and combine department name to the jsp to fetch the url of the respective department. ix. It then pass the link for each department to dept() function to further find the courses listed at each department and to open its respective course details link. x. Pass department name corresponding object to process_dept() function for adding schema property to the instructor of each course. xi. Pass course id and course detail object to process_course_deatil() for processing to be done on course detail page. xii. At process_course_deatil() oringinal html page for the respective course is saved to a file 45

46 named at its id and processing is done by passing bsoup object of course detail page to functons itemscope(), itemprop() etc. and also supplying other necessary elements such as tags on which schema properties to be added. xiii. The modified course details object is stored in file named with its id in modified directory. xiv.stop Steps carried in Keyword Insertion Software: i. START ii. First of all Standard Keyword Database for each course under CDEEP is made with the help of internet. iii. Description of a course is extracted. iv. Set of words separated by ;,. : are clubbed together v. Then keyword from standard database are mapped with the set off words and the keywords are matched then that whole set of words is treated as keyword for that respective course. vi. The extracted sets of words are inserted to webpage where course link is present so that the course can be searchable by different keywords. vii. STOP 46

47 8. Results and Discussion 8.1 Results at Structured Data Testing Tool [14] Test result of CDEEP [16] Figure 16: Test result at Structured data tool before adding Schema properties 47

48 Figure 17: Test result at Structured data tool after adding Schema properties 48

49 Figure 18: Test result at Structured data tool after adding Schema properties to all the departments listed in CDEEP 49

50 8.1.2 Result at Spoken Tutorial Figure 19: Test result at Structured data tool after adding Schema properties to all the tutorials list in Spoken-tutorial. 50

51 Figure 20: Test result at Structured data tool after adding Schema properties to all the workshop contents listed in Spoken-tutorial. 51

52 8.1.3 Results at ekshiksha Figure 21: Test result at Structured data tool after adding Schema properties. 52

53 Figure 22: Test result at Structured data tool after adding Schema properties to all the contents listed in ekshiksha. 53

54 Figure 23: Test result at Structured data tool after adding Schema properties to all the Interactive contents listed in ekshiksha. 54

55 8.2 Results at Webmaster Tool: The average position of the website gets improved by two ways, one by the increase in numbers of people who searched about the website and other by adding structured data to the website. The snapshots of search queries from webmaster tool give the information about the improvements in the website. Figure 24: Data Highlighter of CDEEP 55

56 Figure 25: Data Highlighter of courses at CDEEP Figure 24 and 25 gives the view that marked content will be displayed on knowledge panel of the search engine. 56

57 Figure 26: Structured data found in CDEEP 57

58 Figure 27: Structured data found in Spoken Tutorial site 58

59 Figure 28 Structured data before and after data 'schema.org' properties in ekshiksha Figure 26, 27 and 28 shows the structured data i.e. schema.org properties attached to the webpage of sites. It shows significant increase in structured data items in the graph with time. 59

60 Figure 29: Search queries graph for CDEEP site 60

61 Figure 30: search queries made for Spoken-tutorial site 61

62 Figure 31: search queries made for ekshiksha site Figure 29, 30 and 31 gives the analysis of search query graph for the CDEEP site. The graph shows the clicks in red and impressions in blue. There is remarkable increase in the clicks of three sites as shown in the 3 graphs. 62

63 Figure 32: Change in average position of queries posted of CDEEP site on search engine Figure 33: 'Spoken-tutorial' website improvements after adding structured data. 63

64 Figure 34: 'ekshiksha' website improvements after adding structured data. Figure 32 gives the average position change measured for queries made for contents present at CDEEP site in the search engine. In Figure 33 and 34, we can see highlighted columns CTR and average position. Also we can see the respective changes in the very next columns. These columns provide strong evidence that the structured data will help in discovering educational websites. It is observed that by adding the structured data the average position of the website and Click through Rate (CTR) is increased. CTR measures the effectiveness of the webpage. It is calculated based on the clicks and Impression. A high CTR means that it attracts and gets clicks often. A low CTR means it is under performing and should be evaluated further. A 1% CTR means for every 100 impressions there is 1 click. 8.3 Proposed Tags Our next goal is to filter the irrelevant links and provide appropriate information during the retrieval of search results from any search engine. As part of this, we have analyzed the complete Indian educational system hierarchy. We have divided the complete educational system 64

65 into two levels school and University. Further, the school level education is categorized mainly into three ways: access, assimilation and assessment. Access is the way of retrieving resources either by online or by offline and also it deals with the type of resources used for learning like textbooks, printed notes, video lectures and audio lectures etc. Assimilation relates to the mode of learning and can be done by hands on experiment, by computer training or by tutoring. Assessment is a way to test the level of understanding of the subject by the students it can be done either online or offline. It is done according to various levels, age group etc. Table 12 and 13 shows proposed tags related to education. Table 12. The tags related to education that are suggested to schema.org Properties proposed are added under schema.org/school. This has more specific properties as schema.org/student, schema.org/faculty and schema.org/teachingmethodologies. School > TeachningMethodolgies This hierarchy is created in the the current changes in the Teaching Methodologies in view, this can be extended further. Presently it has three more specific properties Access, Assimilation, Assessment. School > TeachingMethodologies > Access Property Expected Type Description modeofacces schema.org/te The mode of accessing the resources. s xt Ex: "online" or Ex: "offline" School > TeachingMethodologies > Access > Traditional The properties in this are 'Book', 'VideoLectures' and 'AudioLectures'. These can be imported from schema.org/book, schema.org/videoobject and schema.org/audioobject. School > TeachingMethodologies > Access > NonTraditional The more specific type under this hierarchy can be schema.org/discussionfourm and schema.org/correspondence. The properties in this hierarchy are discussed below. peertopeer schema.org Here there will be a teacher and a student, /Text teacher will be guiding the student. 65

Is your Google ranking suffering from outdated SEO?

Is your Google ranking suffering from outdated SEO? Is your Google ranking suffering from outdated SEO? If your competitors are using the new semantic methods of search engine optimization, and you are still relying solely on keyword placement and links,

More information

Microdata and schema.org

Microdata and schema.org Basics Microdata and schema.org l Microdata is a simple semantic markup scheme that s an alternative to RDFa l Developed by WHATWG and supported by major search companies (Goog,e, MSFT, Yahoo) l Like RDFa,

More information

Interactive e-books on Aakash Tablet

Interactive e-books on Aakash Tablet Interactive e-books on Aakash Tablet M.Tech. Dissertation Submitted in partial fulfillment of the requirements for the degree of Master of Technology in Computer Science and Engineering Submitted by Prashant

More information

Microdata and schema.org

Microdata and schema.org Basics Microdata and schema.org l Microdata is a simple seman1c markup scheme that s an alterna1ve to RDFa l Developed by WHATWG and supported by major search companies (Google, MicrosoE, Yahoo, Yandex)

More information

Open Educational Resources

Open Educational Resources IOER offers options to share career and educational resources. Resource formats Existing online resources. Digital files that get uploaded to IOER. Sets of files and/or web pages that need to be kept together

More information

Introduction. Paradigm Publishing. SNAP for Microsoft Office SNAP for Our Digital World. System Requirements

Introduction. Paradigm Publishing. SNAP for Microsoft Office SNAP for Our Digital World. System Requirements Introduction Paradigm Publishing Paradigm understands the needs of today s educators and exceeds the demand by offering the latest technological advancements for coursework settings. With the success of

More information

Best Practices for Structuring Content and Integrating e-learning and e-textbooks

Best Practices for Structuring Content and Integrating e-learning and e-textbooks Best Practices for Structuring Content and Integrating e-learning and e-textbooks The Publisher Perspective Paul Belfanti Pearson Education North America Director of Content Architecture May 30, 2013 Business

More information

"Charting the Course to Your Success!" MOC Microsoft SharePoint 2010 Site Collection and Site Administration Course Summary

Charting the Course to Your Success! MOC Microsoft SharePoint 2010 Site Collection and Site Administration Course Summary MOC 50547 Microsoft SharePoint Site Collection and Site Course Summary Description This five-day instructor-led Site Collection and Site Administrator course gives students who have SharePoint Owner permissions

More information

2018 SEO CHECKLIST. Use this checklist to ensure that you are optimizing your website by following these best practices.

2018 SEO CHECKLIST. Use this checklist to ensure that you are optimizing your website by following these best practices. 2018 SEO CHECKLIST Your website should be optimized to serve your users. This checklist gives you the best practices for Search Engine Optimization (SEO) whether you are a freelancer, small business, SEO

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

ASSESSMENT QUERIES 3 SUBMITTING YOUR WORK IN TURNITIN 4. How do I upload my case report/work in Turnitin? 4 ACCESSING YOUR FEEDBACK 5

ASSESSMENT QUERIES 3 SUBMITTING YOUR WORK IN TURNITIN 4. How do I upload my case report/work in Turnitin? 4 ACCESSING YOUR FEEDBACK 5 FAQs Contents Who should I contact with a query about assessment? 2 Who should I contact with a technical issues or issues regarding the teaching and learning materials on the module? 2 Who should I contact

More information

The CIW Web Foundations courses prepare students to take the CIW Web Foundations Associate certification exam.

The CIW Web Foundations courses prepare students to take the CIW Web Foundations Associate certification exam. xi Course Description Site Development Associate teaches you essential Webpage development skills. You will learn to develop Websites using Hypertext Markup Language version 5 (HTML5) and Cascading Style

More information

Frequently Asked Questions

Frequently Asked Questions Frequently Asked Questions WHAT IS PD IN FOCUS? PD In Focus is an award-winning online platform that features hundreds of hours of videos showing research-based teaching practices in action. Designed as

More information

HHH Instructional Computing Fall

HHH Instructional Computing Fall Quick Start Guide for School Web Lockers Teacher log-on is the same as for Infinite Campus Student log-on is the same initial log on to the network except no school year is required before their user name

More information

Usability Testing Report of College of Liberal Arts & Sciences (CLAS) Website

Usability Testing Report of College of Liberal Arts & Sciences (CLAS) Website Usability Testing Report of College of Liberal Arts & Sciences (CLAS) Website Submitted to: Ceily Hamilton, Director of Information Technology Alex Chapin, Executive Director of Academic Technology Compiled

More information

A Developer s Guide to the Semantic Web

A Developer s Guide to the Semantic Web A Developer s Guide to the Semantic Web von Liyang Yu 1. Auflage Springer 2011 Verlag C.H. Beck im Internet: www.beck.de ISBN 978 3 642 15969 5 schnell und portofrei erhältlich bei beck-shop.de DIE FACHBUCHHANDLUNG

More information

The Intelligent Process Planner and Scheduler. by Carl P. Thompson Advisor: Jeffrey W. Herrmann, Edward Lin, Mark Fleischer, Vidit Mathur

The Intelligent Process Planner and Scheduler. by Carl P. Thompson Advisor: Jeffrey W. Herrmann, Edward Lin, Mark Fleischer, Vidit Mathur UNDERGRADUATE REPORT The Intelligent Process Planner and Scheduler by Carl P. Thompson Advisor: Jeffrey W. Herrmann, Edward Lin, Mark Fleischer, Vidit Mathur U.G. 2000-1 I R INSTITUTE FOR SYSTEMS RESEARCH

More information

Massachusetts Institute of Technology 6.UAP Final Report. MOOCdb: An collaborative environment for MOOC data. December 11, 2013 Sherwin Wu

Massachusetts Institute of Technology 6.UAP Final Report. MOOCdb: An collaborative environment for MOOC data. December 11, 2013 Sherwin Wu Massachusetts Institute of Technology 6.UAP Final Report MOOCdb: An collaborative environment for MOOC data December 11, 2013 Contents 1 Introduction 2 Motivation and Previous Work 3 MOOCdb Visualizations

More information

Introduction. Paradigm Publishing. SNAP for Microsoft Office SNAP for Our Digital World

Introduction. Paradigm Publishing. SNAP for Microsoft Office SNAP for Our Digital World Introduction Paradigm Publishing Paradigm understands the needs of today s educators and exceeds the demand by offering the latest technological advancements for coursework settings. With the success of

More information

AGRICULTURE BASED ANDROID APPLICATION

AGRICULTURE BASED ANDROID APPLICATION AGRICULTURE BASED ANDROID APPLICATION Prof.Aradhana D 1, Shiva Prasad K S 2, Shrivaishnavi J K 3, P. Sowmya 4, Tina Agarwal 5 1 Department of Computer Science & Engineering Ballari Institute of Technology

More information

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.

UNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai. UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index

More information

ANALYTICS DATA To Make Better Content Marketing Decisions

ANALYTICS DATA To Make Better Content Marketing Decisions HOW TO APPLY ANALYTICS DATA To Make Better Content Marketing Decisions AS A CONTENT MARKETER you should be well-versed in analytics, no matter what your specific roles and responsibilities are in working

More information

A Study on Website Quality Models

A Study on Website Quality Models International Journal of Scientific and Research Publications, Volume 4, Issue 12, December 2014 1 A Study on Website Quality Models R.Anusha Department of Information Systems Management, M.O.P Vaishnav

More information

International School of informatics and Management

International School of informatics and Management 1 International School of informatics and Management Subject: System Design Lab Project Name: Student Admission System Group Number: 5 Team Guide: Jyoti Khurana (Lecturer) Members: Ashok Kumar Soni Hridayesh

More information

DEVELOPING MICROSOFT SHAREPOINT SERVER 2013 ADVANCED SOLUTIONS. Course: 20489A; Duration: 5 Days; Instructor-led

DEVELOPING MICROSOFT SHAREPOINT SERVER 2013 ADVANCED SOLUTIONS. Course: 20489A; Duration: 5 Days; Instructor-led CENTER OF KNOWLEDGE, PATH TO SUCCESS Website: DEVELOPING MICROSOFT SHAREPOINT SERVER 2013 ADVANCED SOLUTIONS Course: 20489A; Duration: 5 Days; Instructor-led WHAT YOU WILL LEARN This course provides SharePoint

More information

How to do an On-Page SEO Analysis Table of Contents

How to do an On-Page SEO Analysis Table of Contents How to do an On-Page SEO Analysis Table of Contents Step 1: Keyword Research/Identification Step 2: Quality of Content Step 3: Title Tags Step 4: H1 Headings Step 5: Meta Descriptions Step 6: Site Performance

More information

A Review of Kids Tutor

A Review of Kids Tutor A Review of Kids Tutor Monali Kumbhare, Ashwini Rajput, Bhavna Daswani Department of CSE, Jhulelal Institute of Technology,Lonara Nagpur Prof. Priyanka Dudhe Assistant Professor, Department of CSE, Jhulelal

More information

Career Highlights. Learn More About Me 11/09/2012. Hamlet Batista. Why Pay for Performance When You Can Lead the World To Your Door for Free?

Career Highlights. Learn More About Me 11/09/2012. Hamlet Batista. Why Pay for Performance When You Can Lead the World To Your Door for Free? Why Pay for Performance When You Can Lead the World To Your Door for Free? Hamlet Batista Owner, Hamlet Batista Group http://hamletbatista.com Career Highlights 2002-2005: Affiliate marketer, 1 st page

More information

CANDIDATE SKILL ANALYSER

CANDIDATE SKILL ANALYSER Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,

More information

SEOHUNK INTERNATIONAL D-62, Basundhara Apt., Naharkanta, Hanspal, Bhubaneswar, India

SEOHUNK INTERNATIONAL D-62, Basundhara Apt., Naharkanta, Hanspal, Bhubaneswar, India SEOHUNK INTERNATIONAL D-62, Basundhara Apt., Naharkanta, Hanspal, Bhubaneswar, India 752101. p: 305-403-9683 w: www.seohunkinternational.com e: info@seohunkinternational.com DOMAIN INFORMATION: S No. Details

More information

Meta-Content framework for back index generation

Meta-Content framework for back index generation Meta-Content framework for back index generation Tripti Sharma, Assistant Professor Department of computer science Chhatrapati Shivaji Institute of Technology. Durg, India triptisharma@csitdurg.in Sarang

More information

How to Speak Search Engine

How to Speak Search Engine How to Speak Search Engine Using Schema.org and Structured Data to Power the Search Results of the Future Want Your Search Results to Look Like This? Search engines want to provide more rich data in search

More information

Statement on Continuing Professional Education 2003*

Statement on Continuing Professional Education 2003* 01/2018 * The statement was issued in January, 2003 and amended from time to time. 1.1 The Institute of Chartered Accountants of India (ICAI), set up under the Chartered Accountants Act, 1949 (the Act),

More information

Getting the most from your websites SEO. A seven point guide to understanding SEO and how to maximise results

Getting the most from your websites SEO. A seven point guide to understanding SEO and how to maximise results Getting the most from your websites SEO A seven point guide to understanding SEO and how to maximise results About this document SEO: What is it? This document is aimed at giving a SEO: What is it? 2 SEO

More information

Modern Control Engineering Ogata 2nd Edition

Modern Control Engineering Ogata 2nd Edition We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with modern control engineering

More information

DP Project Development Pvt. Ltd.

DP Project Development Pvt. Ltd. Search Engine Optimization Training Syllabus Training that makes you focus on the correct business: Today's market is competitive and one has to be top in his field to make profits and stay in the business.

More information

How to Drive More Traffic to Your Website in By: Greg Kristan

How to Drive More Traffic to Your Website in By: Greg Kristan How to Drive More Traffic to Your Website in 2019 By: Greg Kristan In 2018, Bing Drove 30% of Organic Traffic to TM Blast By Device Breakdown The majority of my overall organic traffic comes from desktop

More information

Implementation of Enhanced Web Crawler for Deep-Web Interfaces

Implementation of Enhanced Web Crawler for Deep-Web Interfaces Implementation of Enhanced Web Crawler for Deep-Web Interfaces Yugandhara Patil 1, Sonal Patil 2 1Student, Department of Computer Science & Engineering, G.H.Raisoni Institute of Engineering & Management,

More information

Schema org/microdata Exposing Y our Your Data the Web (The Easy Way) Linked Data vs Schema.org: A Town Hall Debate about the Future of Information

Schema org/microdata Exposing Y our Your Data the Web (The Easy Way) Linked Data vs Schema.org: A Town Hall Debate about the Future of Information Schema.org/Microdata Or Exposing Your Data on the Web (The Easy Way) Linked Data vs Schema.org: A Town Hall Debate about the Future of Information Library Technology Conference Macalester College, St.

More information

THE HISTORY & EVOLUTION OF SEARCH

THE HISTORY & EVOLUTION OF SEARCH THE HISTORY & EVOLUTION OF SEARCH Duration : 1 Hour 30 Minutes Let s talk about The History Of Search Crawling & Indexing Crawlers / Spiders Datacenters Answer Machine Relevancy (200+ Factors)

More information

Gerri Holden Ministries Executive Summary Student Consultant, Kate Stepp Community Partner, Gerri Holden

Gerri Holden Ministries Executive Summary Student Consultant, Kate Stepp Community Partner, Gerri Holden Gerri Holden Ministries Executive Summary Student Consultant, Kate Stepp Community Partner, Gerri Holden I. Background Information The mission of Gerri Holden Ministries is to provide academic programs

More information

code pattern analysis of object-oriented programming languages

code pattern analysis of object-oriented programming languages code pattern analysis of object-oriented programming languages by Xubo Miao A thesis submitted to the School of Computing in conformity with the requirements for the degree of Master of Science Queen s

More information

From HyperTEXT to HyperTEC

From HyperTEXT to HyperTEC CIKM 2012 Industry Panel Keynote Speech, Nov 1 st 2012 From HyperTEXT to HyperTEC Xuedong D. Huang (XD) Microsoft Corporation Redmond, WA 98052, USA xdh@microsoft.com Talk Outline Bing It On challenges

More information

Search Engine Optimization. Rebecca Blanchette SEO & Analytics Specialist Carnegie Communications

Search Engine Optimization. Rebecca Blanchette SEO & Analytics Specialist Carnegie Communications Search Engine Optimization Rebecca Blanchette SEO & Analytics Specialist Carnegie Communications What is SEO anyway? The short answer: search engine optimization refers to the process of optimizing your

More information

Give Your DITA wings with taxonomy & modern web design. Joe Pairman

Give Your DITA wings with taxonomy & modern web design. Joe Pairman Give Your DITA wings with taxonomy & modern web design Joe Pairman What do we all want? What do we all want? ~ Free beer What do we all want? ~ Free beer ~ We want our content to be effective What do we

More information

Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos

Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos K.Vijaya Kumar (09305081) under the guidance of Prof. Sridhar Iyer June 28, 2011 1 / 66 Outline Outline 1 Introduction 2 Motivation 3

More information

Full Website Audit. Conducted by Mathew McCorry. Digimush.co.uk

Full Website Audit. Conducted by Mathew McCorry. Digimush.co.uk Full Website Audit Conducted by Mathew McCorry Digimush.co.uk 1 Table of Contents Full Website Audit 1 Conducted by Mathew McCorry... 1 1. Overview... 3 2. Technical Issues... 4 2.1 URL Structure... 4

More information

Report Prepared By: Prof. Aruna Pavate

Report Prepared By: Prof. Aruna Pavate ATHARVA EDUCATIONAL TRUST'S ATHARVA COLLEGE OF ENGINEERING (Approved by AICTE, Recognized by Government of Maharashtra & Affiliated to University of Mumbai - Estd. 1999-2000) Department of Computer Engineering

More information

Online Copywriting Tips and Search Engine Optimisation.

Online Copywriting Tips and Search Engine Optimisation. Online Copywriting Tips and Search Engine Optimisation www.finalsite.co.uk A Bit About Me. Alta Justus, Director of Marketing @ Finalsite UK Over 17 years commercial experience marketing both online and

More information

Overview of Web Mining Techniques and its Application towards Web

Overview of Web Mining Techniques and its Application towards Web Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous

More information

University of Maryland at College Park Department of Geographical Sciences GEOG 477/ GEOG777: Mobile GIS Development

University of Maryland at College Park Department of Geographical Sciences GEOG 477/ GEOG777: Mobile GIS Development University of Maryland at College Park Department of Geographical Sciences GEOG 477/ GEOG777: Mobile GIS Development Instructor: Dr. Ruibo Han Office: LeFrak Hall (LEF) 1111B Email: ruibo@umd.edu (preferred)

More information

SQA Advanced Unit specification. General information for centres. Unit title: Web Development Fundamentals. Unit code: HR7M 47

SQA Advanced Unit specification. General information for centres. Unit title: Web Development Fundamentals. Unit code: HR7M 47 SQA Advanced Unit specification General information for centres Unit title: Web Development Fundamentals Unit code: HR7M 47 Unit purpose: This Unit is designed to enable candidates to develop websites

More information

SYLLABUS. Departmental Syllabus. HTML Web Page Development. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus

SYLLABUS. Departmental Syllabus. HTML Web Page Development. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus. Departmental Syllabus SYLLABUS DATE OF LAST REVIEW: 11/6/2018 CIP CODE: 11.0801 SEMESTER: COURSE TITLE: COURSE NUMBER: HTML Web Page Development CIST-0137 CREDIT HOURS: 3 INSTRUCTOR: OFFICE LOCATION: OFFICE HOURS: TELEPHONE:

More information

Website Quality Evaluation Based on Search Engine Queries using Web Rank Position Algorithm (WRPA)

Website Quality Evaluation Based on Search Engine Queries using Web Rank Position Algorithm (WRPA) Indonesian Journal of Electrical Engineering and Computer Science Vol. 4, No. 1, October 2016, pp. 224 ~ 230 DOI: 10.11591/ijeecs.v4.i1.pp224-230 224 Website Quality Evaluation Based on Search Engine Queries

More information

Title Core TIs Optional TIs Core Labs Optional Labs. 1.1 WANs All None None None. All None None None. All None 2.2.1, 2.2.4, 2.2.

Title Core TIs Optional TIs Core Labs Optional Labs. 1.1 WANs All None None None. All None None None. All None 2.2.1, 2.2.4, 2.2. CCNA 2 Plan for Academy Student Success (PASS) CCNA 2 v3.1 Instructional Update # 2006-1 This Instructional Update has been issued to provide guidance on the flexibility that Academy instructors now have

More information

Hidden Google SEO Secrets. SEO Marketing GUIDES

Hidden Google SEO Secrets. SEO Marketing GUIDES Hidden Google SEO Secrets SEO Marketing GUIDES SEO Secrets Search Engine Optimisation If you re reading this guide then you already know about SEO but to recap: SEO is the process of improving the visibility

More information

power up your business SEO (SEARCH ENGINE OPTIMISATION)

power up your business SEO (SEARCH ENGINE OPTIMISATION) SEO (SEARCH ENGINE OPTIMISATION) SEO (SEARCH ENGINE OPTIMISATION) The visibility of your business when a customer is looking for services that you offer is important. The first port of call for most people

More information

Web Crawling. Jitali Patel 1, Hardik Jethva 2 Dept. of Computer Science and Engineering, Nirma University, Ahmedabad, Gujarat, India

Web Crawling. Jitali Patel 1, Hardik Jethva 2 Dept. of Computer Science and Engineering, Nirma University, Ahmedabad, Gujarat, India Web Crawling Jitali Patel 1, Hardik Jethva 2 Dept. of Computer Science and Engineering, Nirma University, Ahmedabad, Gujarat, India - 382 481. Abstract- A web crawler is a relatively simple automated program

More information

(Refer Slide Time: 00:50)

(Refer Slide Time: 00:50) Programming, Data Structures and Algorithms Prof. N.S. Narayanaswamy Department of Computer Science and Engineering Indian Institute of Technology Madras Module - 03 Lecture 30 Searching Unordered linear

More information

Overview. Lab 2: Information Retrieval. Assignment Preparation. Data. .. Fall 2015 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar..

Overview. Lab 2: Information Retrieval. Assignment Preparation. Data. .. Fall 2015 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar.. .. Fall 2015 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar.. Due date: Thursday, October 8. Lab 2: Information Retrieval Overview In this assignment you will perform a number of Information

More information

Five SEO Strategies Every Company Needs to Master

Five SEO Strategies Every Company Needs to Master Five SEO Strategies Every Company Needs to Master Martin Laetsch Agenda SEO Overview Link Building Google Authorship Set a Canonical URL Microdata/Schema Responsive Design What is Search Engine Marketing?

More information

Government of Karnataka Department of Technical Education Bengaluru. Course Title: Web Programming Lab Scheme (L:T:P) : 0:2:4 Total Contact Hours: 78

Government of Karnataka Department of Technical Education Bengaluru. Course Title: Web Programming Lab Scheme (L:T:P) : 0:2:4 Total Contact Hours: 78 Government of Karnataka Department of Technical Education Bengaluru Course Title: Web Programming Lab Scheme (L:T:P) : 0:2:4 Total Contact Hours: 78 Course Code: 15CS55P Type of Course: Tutorial and Practical

More information

Blackboard course design

Blackboard course design DEO team, Academic Registry www.bristol.ac.uk/digital-education Blackboard course design Updated: 8 th Jan 2018 Contents 1. About this guide... 2 2. Essential information... 2 3. Requesting a Blackboard

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW PAPER ON IMPLEMENTATION OF DOCUMENT ANNOTATION USING CONTENT AND QUERYING

More information

Frequently Asked Questions (FAQ)

Frequently Asked Questions (FAQ) You are requested to go through all the questions & answers in this section and also the Advertisement Notification before proceeding for Registration and subsequent submission of Online Application Form

More information

Introduction to Moodle

Introduction to Moodle Introduction to Moodle Preparing for a Moodle Staff Development Session... 2 Logging in to Moodle... 2 Adding an image to your profile... 4 Navigate to and within a course... 6 Content of the basic template

More information

A tool for Entering Structural Metadata in Digital Libraries

A tool for Entering Structural Metadata in Digital Libraries A tool for Entering Structural Metadata in Digital Libraries Lavanya Prahallad, Indira Thammishetty, E.Veera Raghavendra, Vamshi Ambati MSIT Division, International Institute of Information Technology,

More information

Ministry of Higher Education and Scientific Research

Ministry of Higher Education and Scientific Research Morning Study Department of information technology Institute of Technical - Duhok. University of Polytechnic Duhok. Subject: Web Technology Course book for 2nd year. Lecturer s name: MSc. Ayman Nashwan

More information

SharePoint 2013 End User Level II

SharePoint 2013 End User Level II SharePoint 2013 End User Level II Course 55052A; 3 Days, Instructor-led Course Description This 3-day course explores several advanced topics of working with SharePoint 2013 sites. Topics include SharePoint

More information

Objectives. Connecting with Computer Science 2

Objectives. Connecting with Computer Science 2 Objectives Learn what the Internet really is Become familiar with the architecture of the Internet Become familiar with Internet-related protocols Understand how the TCP/IP protocols relate to the Internet

More information

Programme Specification

Programme Specification Programme Specification Awarding Body/Institution Teaching Institution Queen Mary University of London Queen Mary University of London Name of Final Award and Programme Title Certificate in Intellectual

More information

The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos

The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos The Ultimate YouTube SEO Guide: Tips & Tricks on How to Increase Views and Rankings for your Online Videos The Ultimate App Store Optimization Guide Summary 1. Introduction 2. Choose the right video topic

More information

Why it Really Matters to RESNET Members

Why it Really Matters to RESNET Members Welcome to SEO 101 Why it Really Matters to RESNET Members Presented by Fourth Dimension at the 2013 RESNET Conference 1. 2. 3. Why you need SEO How search engines work How people use search engines

More information

Edmodo for Teachers Guide (Taken directly from Edmodo s site.)

Edmodo for Teachers Guide (Taken directly from Edmodo s site.) Edmodo for Teachers Guide (Taken directly from Edmodo s site.) Table of Contents: 1. Teacher Sign Up 2. Create a Group 3. Manage Your Groups 4. Small Group 5. Co-Teaching 6. Student Sign up 7. Post a Message

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

COBIT 5 Foundation. Lesson Plan. Mock Exam: Duration: Language:

COBIT 5 Foundation. Lesson Plan.  Mock Exam: Duration: Language: COBIT 5 Foundation Lesson Plan Delivery: e-learning Certificate: Examination (included) Accredited By: APMG Mock Exam: Duration: Language: Included in Course 15 hours, 30 minutes, self-paced English www.goodelearning.com

More information

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University

FACULTY PROFILE. Total Experience : Academic : 7 Years and 3 Months. Degree Branch / Specialization College University FACULTY PROFILE Name Designation Email ID : A.K.Kavitha : Assistant Professor : kavitha.ece@srit.org Area of Specialization : Wireless Communication Total Experience : Academic : 7 Years and 3 Industry

More information

FIRST IDEA WEB DEVELOPMENT(Pvt.)Ltd.

FIRST IDEA WEB DEVELOPMENT(Pvt.)Ltd. FIRST IDEA WEB DEVELOPMENT(Pvt.)Ltd. SEO (SEARCH ENGINE OPTIMIZATION) OCTOBER 2018 On Page SEO focuses on the content on your site, including the relevancy of your content, the format and structure of

More information

"Charting the Course... ITIL 2011 Managing Across the Lifecycle ( MALC ) Course Summary

Charting the Course... ITIL 2011 Managing Across the Lifecycle ( MALC ) Course Summary Course Summary Description ITIL is a set of best practices guidance that has become a worldwide-adopted framework for IT Service Management by many Public & Private Organizations. Since early 1990, ITIL

More information

Search Engine Optimization. Rebecca Blanchette SEO and Analytics Specialist Carnegie

Search Engine Optimization. Rebecca Blanchette SEO and Analytics Specialist Carnegie Search Engine Optimization Rebecca Blanchette SEO and Analytics Specialist Carnegie Communications @beccablanchette What Is SEO Anyway? The short answer: Search Engine Optimization refers to the process

More information

INST Database Design and Modeling - Section 0101 Spring Tentative Syllabus

INST Database Design and Modeling - Section 0101 Spring Tentative Syllabus INST 327 - Database Design and Modeling - Section 0101 Spring 2017 - Tentative Syllabus Instructors: Office: Phone: E-mail: Office Hours: Vedat G. Diker (Dr. Diker) Hornbake 4111F (301) 405-9814 vdiker@umd.edu

More information

POSTGRADUATE CERTIFICATE IN LEARNING & TEACHING - REGULATIONS

POSTGRADUATE CERTIFICATE IN LEARNING & TEACHING - REGULATIONS POSTGRADUATE CERTIFICATE IN LEARNING & TEACHING - REGULATIONS 1. The Postgraduate Certificate in Learning and Teaching (CILT) henceforth the Course - comprises two modules: an Introductory Certificate

More information

"Charting the Course to Your Success!" MOC B Programming in C# Course Summary

Charting the Course to Your Success! MOC B Programming in C# Course Summary Course Summary Description This training course teaches developers the programming skills that are required for developers to create Windows applications using the C# language. During their five days in

More information

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR

Degree Branch / Specialization College University CSE SONA COLLEGE OF TECHNOLOGY : ASSISTANT PROFESSOR (SENIOR GRADE) ASSISTANT PROFESSOR FACULTY PROFILE Name Designation Email ID : NISHA SOMS : ASSISTANT PROFESSOR (SENIOR GRADE) : nishasoms.cse@srit.org Area of Specialization : Wireless Ad Hoc Networks, Mobile Computing Cryptography and

More information

ISSN (Online) ISSN (Print)

ISSN (Online) ISSN (Print) Accurate Alignment of Search Result Records from Web Data Base 1Soumya Snigdha Mohapatra, 2 M.Kalyan Ram 1,2 Dept. of CSE, Aditya Engineering College, Surampalem, East Godavari, AP, India Abstract: Most

More information

TECHNICAL NOTE RECOGNITION OF APPLIED CIVIL ENGINEERING SKILLS

TECHNICAL NOTE RECOGNITION OF APPLIED CIVIL ENGINEERING SKILLS TECHNICAL NOTE RECOGNITION OF APPLIED CIVIL ENGINEERING SKILLS Authors: Andrew Taylor Communications Manager InfraTrain New Zealand Limited Tel 04 494 1883 andrew@infratrain.co.nz Alister Harlow Chairman

More information

2D Computer Animation for Visual Communication: Advanced

2D Computer Animation for Visual Communication: Advanced Higher National Unit Specification General information for centres Unit title: 2D Computer Animation for Visual Communication: Advanced Unit code: DX3D 36 Unit purpose: The Unit is designed to extend candidates

More information

GOOGLE ANALYTICS 101 INCREASE TRAFFIC AND PROFITS WITH GOOGLE ANALYTICS

GOOGLE ANALYTICS 101 INCREASE TRAFFIC AND PROFITS WITH GOOGLE ANALYTICS GOOGLE ANALYTICS 101 INCREASE TRAFFIC AND PROFITS WITH GOOGLE ANALYTICS page 2 page 3 Copyright All rights reserved worldwide. YOUR RIGHTS: This book is restricted to your personal use only. It does not

More information

Creating a Classifier for a Focused Web Crawler

Creating a Classifier for a Focused Web Crawler Creating a Classifier for a Focused Web Crawler Nathan Moeller December 16, 2015 1 Abstract With the increasing size of the web, it can be hard to find high quality content with traditional search engines.

More information

Introduction Building and Using Databases for historical research December 2012

Introduction Building and Using Databases for historical research December 2012 1. Introduction This is a non-tutor led course that can be completed at your own pace and at a time of your own choosing. We strongly recommend that you complete each module and its component sections

More information

IT2.weebly.com Applied ICT 9713

IT2.weebly.com Applied ICT 9713 Chapter 11 Database and charts You already know how to o define database record structures o enter data into a database o select subsets of data within a database o sort data within a database o produce

More information

Ohio Media Spectrum Fall 2015, Vol. 67, No. 1

Ohio Media Spectrum Fall 2015, Vol. 67, No. 1 ISearch: Bridging the Research Path from K- 12 to College and Career by Erica Clay, INFOhio Integration Librarian and Jennifer Schwelik, INFOhio elearning Specialist Abstract: Along with many other changes

More information

G. PULLAIAH COLLEGE OF ENGINEERING AND TECHNOLOGY Pasupula, Nandikotkur Road, Kurnool

G. PULLAIAH COLLEGE OF ENGINEERING AND TECHNOLOGY Pasupula, Nandikotkur Road, Kurnool G. PULLAIAH COLLEGE OF ENGINEERING AND TECHNOLOGY Pasupula, Nandikotkur Road, Kurnool-518014 BRANCH: COMPUTER SCIENCE AND ENGINEERING COURSE DESCRIPTION FORM Course Title Course Code Regulation Course

More information

Familiar Simple Easy Safe. New (Delve, Sway) Different Rich Engaging Potential for innovation. Late Majority 34% 2.5% Innovators. Early Majority 34%

Familiar Simple Easy Safe. New (Delve, Sway) Different Rich Engaging Potential for innovation. Late Majority 34% 2.5% Innovators. Early Majority 34% New (Delve, Sway) Different Rich Engaging Potential for innovation Familiar Simple Easy Safe 2.5% Innovators Early Adopters 13.5% Early Majority 34% Late Majority 34% Laggards 13.5% What is Office 365?

More information

All LJMU programmes are delivered and assessed in English

All LJMU programmes are delivered and assessed in English PROGRAMME SPECIFICATION Master of Science in Computer Forensics Awarding institution Teaching institution JACS Code Programme Duration Language of Programme Subject benchmark statement Programme accredited

More information

Marketing & Back Office Management

Marketing & Back Office Management Marketing & Back Office Management Menu Management Add, Edit, Delete Menu Gallery Management Add, Edit, Delete Images Banner Management Update the banner image/background image in web ordering Online Data

More information

Digital Repositories for Sophisticated Information Management: Emphasizing Development of Digital Repositories in India

Digital Repositories for Sophisticated Information Management: Emphasizing Development of Digital Repositories in India Digital Repositories for Sophisticated Information Management: Emphasizing Development of Digital Repositories in India Prantosh Kr. Pau1 l, R.Senthamarai 2 and K.S. Shivraj 3 1 FBAS, Bengal Engineering

More information

"Charting the Course... MOC A: SharePoint 2016 Site Collections and Site Owner Administration. Course Summary

Charting the Course... MOC A: SharePoint 2016 Site Collections and Site Owner Administration. Course Summary MOC 55234 A: 2016 Site Collections Course Summary Description This five-day instructor-led course is intended for power users and IT professionals who are tasked with working within the 2016 environment

More information

Gradintelligence student support FAQs

Gradintelligence student support FAQs Gradintelligence student support FAQs Account activation issues... 2 I have not received my activation link / I cannot find it / it has expired. Please can you send me a new one?... 2 My account is showing

More information

Introduction to Big Data

Introduction to Big Data Introduction to Big Data OVERVIEW We are experiencing transformational changes in the computing arena. Data is doubling every 12 to 18 months, accelerating the pace of innovation and time-to-value. The

More information