Vocabulary Patterns in Free-for-all Collaborative Indexing Systems

Size: px
Start display at page:

Download "Vocabulary Patterns in Free-for-all Collaborative Indexing Systems"

Transcription

1 Vocabulary Patterns in Free-for-all Collaborative Indexing Systems Wolfgang Maass, Tobias Kowatsch, and Timo Münster Hochschule Furtwangen University (HFU) Robert-Gerwig-Platz 1, D-781 Furtwangen, Germany Abstract. In collaborative indexing systems users generate a big amount of metadata by labelling web-based content. These labels are known as tags and form a shared vocabulary. In order to understand the characteristics of that vocabulary, we study structural patterns of these tags by implying the theory of self-organizing systems. Therefore, we utilize the graph theoretic notion to model the network of tags and their valued connections, which represent frequency rates of co-occurring tags. Empirical data is provided by the free-for-all collaborative indexing systems Delicious, Connotea and CiteULike. First, we measure the frequency distribution of co-occurring tags. Secondly, we correlate these tags towards their rank over time. Results indicate a strong relationship among a few tags as well as a notable persistence of these tags over time. Therefore, we make the educated guess that the observed collaborative indexing systems are self-organizing systems towards a shared vocabulary building. Implications on the results are the presence of semantic domains based on high frequency rates of co-occurring tags, which reflect topics of interest among the user community. When observing those semantic domains over time, that information can be used to provide a historical or trend-setting development of the community s interests, thus enhancing collaborative indexing systems in general as well as providing a new tool to develop community-based products and services at the same time. Key words: Metadata, tagging, shared vocabulary, online community, collaborative software, self-organizing system 1 Introduction Cooperative, distributed labelling of content in the worldwide web is called collaborative indexing or social tagging. Within a collaborative indexing system users annotate different contents e.g.: events 1, video clips, music 3, pictures 4,

2 articles and references 5, weblogs 6 or websites 7. These collaborative indexing systems facilitate mass categorization establishing so-called folksonomies, which is a bottom up categorization made by a large user base. A collaborative indexing system has basically two features. First, it is used for future retrieval of self-indexed content. Secondly, it provides recommendations, which are based upon the co-occurrence of highly used tags within all annotations, whereas we call one single process of annotation an indexing task. 8 The recommendations are shown to the user by committing a tag query. For instance: content tagged with html will be frequently tagged with css as well. The data collected within an indexing task contains the name of the user, an url linking to the content, one or more tags and time-stamp information. Therefore, the data within a collaborative indexing system is basically a network of users, tags and content in a given period of time. All tags together represent the shared vocabulary of the user community. In this paper we study the structural patterns of that vocabulary, thus focusing only on the partial network of tags. Analyzing this partial network requires some constructs of the graph theory. We assume the shared vocabulary to be a self-organizing system by means of the systems theory [1]. Hence, stable patterns as well as specific correlations are determined throughout the vocabulary. In addition, implications on these patterns are presented. To support the requirements of self-organizing systems by reducing external restrictions and forces we choose the free-for-all collaborative indexing systems Delicious, Connotea and CiteULike for empirical data extraction, where any user can index any content element. Thus, indexing rights are not restricted as identified by Marlow et al. []. This paper starts with related work covering collaborative indexing systems and the systems theory. Then, we hypothesize two assumptions regarding stable patterns within the vocabulary. Afterwards, we build up a model based on the graph theoretic notion, clarify the methodic approach and present the empirical data used to prove the assumptions. Subsequently, we present and discuss the results of our analysis and draw implications on them. Finally, we give an outlook on further research. Related Work A general review on collaborative indexing systems is given by Voss [3]. Mathes [4] discusses the organization of information via tags and points out that user generated metadata is of an uncontrolled nature and fundamentally chaotic compared to a controlled vocabulary. But he also mentions that collaborative index There may also exist other recommender implementations, but we focus on the cooccurrence of highly used tags because this information is freely accessible on the web.

3 ing systems are highly responsive to the users needs and their vocabulary by involving them into the process of organization. Vander Wal [5] distinguishes between broad and narrow folksonomies depending on the amount of users, who tag one specific content element. He also defines the difference between pure tagging and folksonomy tagging. Voss [6] discovers power law distributions of tag frequency rates in Delicious and Wikipedia supporting the presence of self-organizing systems. Hotho et al. [7] and Quintarelli [8] find power law distributions according to collaborative indexing systems, too. Lund et al. [9] measure a power law distribution of user shared tags within Connotea. Results of Golder and Huberman [1] show regularities of dynamic structures within Delicious. Moreover, they introduce a classification on the semantics of tags as well as Zhichen et al. [11]. Wu et al. [1] distinguish the potential of collaborative indexing systems as a technological infrastructure for acquiring social knowledge. Millen et al. [13] study the deployment of a collaborative indexing system within a company and highlight the remarkable acceptance rate of the users as well as its personal and organizational usefulness. In addition, Damianos et al. [14], Farrell and Lau [15] as well as John and Seligmann [16] also examine the potential of collaborative indexing systems for the enterprise covering people s expertise, social networks and the integration of those systems in existing collaborative applications. An early classification of collaborative indexing systems is done by Hammond et al. [17] confronting scholarly and general resources with links and web pages. In a more detailed classification Marlow et al. [] distinguish the design of a system and present several user incentives. Heymann and Garcia-Molina [18] develop an algorithm, which generates a hierarchical taxonomy of a tag network. For the same purpose Mika [19] uses social network analysis on the network of users, tags and content. Hotho et al. [7] develop a search algorithm for folksonomies to find communities of interest within collaborative indexing systems. Cattuto et al. [] design a stochastic model for the analysis of indexing tasks over time consisting of tags and users. Dubinko et al. [1] visualize tags over time with data from Flickr, whereas Zhichen et al. [11] propose an algorithm for tag suggestions to support the user within an indexing task. An overview of self-organizing systems is given by Heylighen [1]. 3 Motivation As mentioned above, this paper deals with the partial network of tags. The concept of tags is central in collaborative indexing systems. The same tags used by different users to annotate similar content show a common understanding of the users. The set of all tags utilized by the user community represents the shared vocabulary. Users and content elements are linked to each other through tags, which are also directly connected when they are used together within one indexing task. Figures 1 and are representing such an indexing task as well as the resulting network of the tags sports, worldcup and soccer. Due to the current work, the value of those tag connections is an essential dimension, which

4 is based on the frequency rate of tags co-occurring within all indexing tasks. A prerequisite for a measurement of this frequency is the bag-model for aggregation of tags, in which multiple tags can be assigned to one resource by multiple users as discussed by Marlow et al. []. worldcup url description FIFAworldcup.com The Official Site of FIFA World Cup tags sports worldcup soccer Fig. 1. Graphical input mask for an indexing task soccer sports Fig.. Resulting network of the indexing task in Fig. 1 Prior work on stable patterns suggests that collaborative indexing systems are self-organizing systems [1,, 6, 8, 9]. The vocabulary - consisting of tags and generated within all indexing tasks by all users - is a part of this system, which organizes its structure by itself, without a centralized control mechanism. The users of a collaborative indexing system generate this vocabulary in a decentralized approach, not even aware of it. On its own this system evolves over time into a more stable state. Contrary to the aforementioned work, we explore patterns emerging out of cooccurring tags. Therefore, we want to know if the power law distribution, which is common in broad folksonomies [7, 9, 8], is also applicable to the structure of co-occurring tags. This would represent a community s vocabulary, which consists of a few tags co-occurring with high frequency rates and many tags cooccurring with low frequency rates. Such a pattern - we call it tag economics - would indicate a strong consensus on a particular subpart of the community s vocabulary, from which particular interests of the users can be identified. Due to these considerations, we hypothesize the relation of co-occurring tags as follows: H1 Let T i be a tag and T j i all tags co-occurring with T i. Then the ranked frequency distribution of all valued connections from T i to T j i follows a power law curve. Additionally, we focus on the frequency dynamics of tags over time depending on their position in the aforementioned frequency distribution. We assume that tags co-occurring with high frequency rates (higher position on the power law curve) are more stable over time than tags co-occurring with low frequency rates. This would represent persistence of the community s interests or, when tags with high frequency rates change to a low position, one can suggest a shift of the community s common understanding. Therefore, the current work has the second objective to examine the relationship of the frequency rates of cooccurring tags over time. We hypothesize this relationship as follows: H The higher the frequency rates of the tags T j i, the more stable are they over time.

5 4 Model Let an indexing task be a quadruple comprised of < user, url, timestamp, tag >. One user enters an url with none, one or more tags into a collaborative indexing system at a certain time. Only two entities are important according to our hypotheses, namely timestamp and tag. Therefore, The community s vocabulary is modelled as an undirected, valued and finite graph V within a given period of time δ. This period of time is essential, because the frequency of time based indexing tasks is subject to fluctuations, which occur in the course of a day, a week or month. Furthermore, δ can be used to affect directly the size of the vocabulary V to ease the analysis. The vocabulary V consists of a set of nodes (here tags) and a set of valued links, which represent the frequency values of co-occurring tags. Hence, we refer to this vocabulary as the network of tags, too. The links are undirected since each tag i, which co-occurs with a tag j, also means that the tag j co-occurs with the tag i, respectively. To better handle these frequency values, the vocabulary can be described by a symmetric frequency matrix F, such that the value on the ith row and jth column represents the frequency rate of the co-occurring tags i and j over all indexing tasks within δ, denoted as f(i, j). Self references are excluded since we focus only on co-occurring tags. Thus, the diagonal values f(i, j) with i = j are always zero. Figure 3 exemplifies an undirected, valued graph of the vocabulary V, whereas Fig. 4 shows the corresponding frequency matrix F. Based upon this graph theoretic notion and the corresponding frequency matrix, we are able to illustrate and compute the frequency distribution of co-occurring tags Fig. 3. Undirected, valued graph of the vocabulary V including 5 tags i Fig. 4. Corresponding frequency matrix F of the vocabulary V 6 3 j 4.1 Method A frequency matrix F (δ 1 ) is built within a given period of time. Afterwards, the frequency values f(i, j) for each tag T i are summed up. Consecutively, those

6 cumulative frequency rates are ranked by size and confined by a limit L. This approach eliminates tags T i with low cumulative frequency values of co-occurring tags T j i, because they cannot contribute any meaningful values for co-occurring tags and are therefore not relevant for further calculations. Then, N tags T i with maximum N max, medium N med and minimum N min cumulative frequency rates are identified. Afterwards, the frequency distribution of all tags T j i co-occurring with each tag T i is calculated from this selection and subsequently ranked by size. Finally, the values of these frequency distributions are normalized and utilized to conduct a curve estimation regression statistic based on the power model, whose equation is f(r) = β r β1 with f(r) estimating the frequency rate depending on the frequency rank r of a tag T j i. The results of the regression statistics are used to prove hypothesis 1. The aforementioned N tags T i are also used to prove the second hypothesis. Hence, N tags T i with maximum N max, medium N med, and minimum N min cumulative frequency rates are identified. Afterwards, the frequency rates for each pair of co-occurring tags are written down in a time series each lasting δ over I iterations. To measure the stability between co-occurring tags T i and T j i, the difference D from the mean frequency of each tag co-occurrence in N max, N med, and N min is calculated over all I iterations, so they can be compared afterwards. 4. Empirical Data The empirical data for the analysis was extracted from the collaborative indexing systems Delicious, Connotea and CiteULike. This information is freely accessible. Indexing rights are based on a free-for-all principle [], thus supporting the requirements of self-organizing systems by reducing external restrictions and forces. The content is respectively of textual nature. The selected collaborative indexing systems differ in the community s size and the quantity of indexing tasks, the amount of tags, as well as the period of time in which the data was gathered. Furthermore, all indexing systems use a bag-model to aggregate tags, which is essential for our approach as mentioned in Sect. 3. Table 1 provides detailed information about the empirical data. Table 1. Empirical data Indexing system Delicious Connotea CiteULike Period of measurement 9/1/6 1/1/6 1/1/6 1/1/6 9/17/6 1/1/6 Indexing tasks (It) It incl. at least tags (6%) (61%) 43 (5%) Tags incl. doublets Distinct tags (11%) (17%) (37%) Distinct users Users with at least It (58%) 7 (69%) 48 (64%)

7 5 Results 5.1 Hypothesis 1: Power Law Distribution of Co-occurring Tags The ranked frequency distribution f(r) of T j i tags co-occurring with a tag T i is illustrated in Fig. 5. Thus, a power law distribution is clearly apparent in the shared vocabulary, as to be expected from a broad folksonomy like Delicious. There are many tags T j i with low frequency rates of co-occurring tags and few with very high frequency rates. This result is proved by the cumulative discrete co-occurrence distribution in Fig. 6, which illustrates the discrete frequency distribution. There is a remarkable gap between tags, which co-occur with only one single tag, and tags co-occurring with multiple other tags. Towards the high co-occurrence rates the curve decreases rapidly as the logarithmic scale demonstrates. Frequency rates of co-occurring tags above 1 lead to absolute frequency rates less then ten. Similar results are provided by the collaborative indexing systems Connotea and CiteULike. The visual observations in Fig. 5 and 6 can be confirmed statistically. Therefore, Table shows the median of squared reliability ( R ), the median degree of freedom ( F ) as well as the exponent β 1 according to the power law curve estimation algorithm 9 over all corresponding tags within N max, N med, and N min. Compared with other curve estimation algorithms, the power model performed best by far. In particular, data from Delicious with 7 co-occurring tags T j i as a median degree of freedom shows a remarkable reliability of.96 by only.1 standard deviation for N max. Values with less reliability values lie nearby.8, which is still acceptable although standard deviation values show higher dynamics. Additionally, a decrease of the exponent β 1 can be observed related to the degree of freedom by considering the data of Delicious and Connotea. This can be referred to a smoother power law curve, when less tags co-occur with a tag T i. For this reason, the relative low degree of freedom according to CiteULike can be neglected to identify the aforementioned effect. Due to these facts, the first hypothesis is supported by the empirical data. It is quite evident that a power law curve of co-occurring tags is obvious for tags T i with high frequency rates, whereas the co-occurrences of middle and low ranked tags T i show more dynamics. 5. Hypothesis : Relation between Rank and Persistence of Tags over Time Figure 7 shows the dynamics over time of co-occurring tags against their deviations from the mean frequency. As illustrated, co-occurring tags with high frequency rates - values from Nmax T1 5 - are more stable and have a lower scatter respectively than co-occurring tags from N med or N min. Figure 8 shows the 9 Statistical software used for curve estimation: SPSS, Version 15..1, SPSS Inc. Chicago, USA

8 f(r) r Fig. 5. Extract from the ranked frequency rates f(r) of T j i tags co-occurring with 3 tags T i (different symbols) based on Delicious, δ 1: 9/1/6 9/19/6 1 1 frequency discrete co-occurrence distribution Fig. 6. Discrete co-occurrence distribution of tags T i and T j i from Fig. 5 and the corresponding frequency values

9 Table. Curve estimation regression statistics based on the power law f(r) = β r β 1, R : median of squared reliability, F : median degree of freedom, standard deviations are provided in brackets Indexing system Delicious Connotea CiteULike N / L 3 / 5 3 / 5 / 1 δ 1 9/1/6 9/19/6 1/1/6 9/5/6 9/15/6 9/5/6 R / F / β 1 for Nmax.96 (.1) / 7 / -.96 (.8).93 (.1) / 638 / -.8 (.8).81 (.1) / 58 / -.46 (.18) for N med.8 (.8) / 15 / -.51 (.1).86 (.6) / 88 / -.58 (.9).79 (.1) / 3 / -.47 (.3) for N min.78 (.11) / 73 / -.4 (.1).84 (.11) / 55 / -.53 (.8).8 (.) / / -.65 (.4) relative frequencies of two co-occurring tags from N max and N med in contrast to each other over 3 iterations with δ = 1 day. This figure illustrates that the interval of N med shows higher variations than the interval of N max. The basic data from all examined collaborative indexing systems with the average deviation of the mean frequency values D over N max, N med, and N min is shown in Table 3. As a result, δ and the number of indexing tasks within δ are affecting the dynamics. Those in Connotea and CiteULike are much lower than the dynamics in Delicious. This often causes co-occurring tags to appear only in a small degree over all iterations and therefore, they stabilize only on a low level with low frequency rates. Thus, a small deviation from the average frequency rate over all iterations is not always indicating a high position of co-occurring tags on the power law curve. There are also situations, where tags stabilize on low frequency rates. The stability alone is therefore not a sufficient criterion for the occurrence of high frequencies. In fact, the absolute frequency rates must be observed for a positioning on the power law curve besides the deviation. As a result, correlations between high frequency rates and their persistence over time can be concluded, but not vice versa. A change of that persistence is therefore only significant for a shared vocabulary, if the deviation of the average value appears on high frequency rates. Nevertheless, the second hypothesis is also supported by the figures of Table 3, although with less explanatory power compared to the findings of hypothesis 1. Table 3. Deviation over the average frequency Indexing system Delicious Connotea CiteULike N / L 5 / 3 3 / 1 15 / 5 δ 1 day, 1 month, 1 day, 9/1/6 9/3/6 1/1/6 8/31/6 9/16/6 9/3/6 D(Nmax) 14.1%.% 15.1% D(N med ) 19.4% 4.% 16.1% D(N min ) 19.8% 7.% 16.7% 5.3 Implications As shown in section 5.1, we observe a frequency distribution of co-occurring tags which follows a power law curve. The examined collaborative indexing systems

10 .3 Deviation over the average frequency in % T i Fig. 7. Deviation over the average frequency in % of 75 tags T i and all related tags T j i for N T 1 5 max, N T 51 5 med and N T min based on Delicious, between 9/1/6 9/3/6 relative frequency in % day Fig. 8. Relative frequency comparison in % of 1 high (dashed) and low positioned tag T i from Fig. 5 with δ = 1 day and 3 iterations based on Delicious, between 9/1/6 9/3/6

11 support a distributed approach without central control mechanisms, favoring self-organization. The observed distribution of tags means that the users have a strong consensus at least on a particular subpart of the shared vocabulary, since co-occurring tags with high frequency rates build a semantic domain. This shows some sort of tag economics within a collaborative indexing system. A further aspect indicating a self-organizing system is the resilience of the system. Accidental errors, e.g., typos or willful sabotages of the system by users have negligible effects, because single users cannot tip the scales of a power law curve. In addition, the construct of indexing support through pre-defined tags, which is suggested to consolidate the tag usage [11, ], would additionally support these findings by diminishing the limits of the uncontrolled vocabulary such as polysemy, synonymy/uniformity and basic level variation problems [1, ]. Hence, we suggest higher frequency rates within the top ranked tags as well as lower rates within low ranked ones as fundamental impacts of the indexing support construct. Another feature of a self-organizing system is the adaptation of environmental changes. In terms of a collaborative indexing system, these changes can be referred to as a shift of the community s interest, which is likewise reflected in a structural change of the vocabulary. Hence, semantic domains based upon cooccurring tags with high frequency rates may change. For instance, if the position of a tag T j i alters over time by means of an increase or decrease of the frequency rates according to T i, then this progress suggests a structural change within the vocabulary and vice versa. The higher the position of this tag on the power law curve, the more significant is the structural change of the vocabulary. When this dynamic information is monitored one can observe a historical or trend-setting development of the vocabulary based upon the time-stamp of selected indexing tasks. Those trend curves of the vocabulary suggest changes within the community s interest and are useful for the particular user when searching for content elements, users or tags in the time domain. 6 Conclusion and Future Work In this paper we studied structural patterns of user generated vocabularies within the free-for-all collaborative indexing systems Delicious, Connotea and CiteU- Like. The theory of self-organizing systems was implied to hypothesize patterns within those vocabularies. We built up a model based on the graph theoretic notion consisting of tags and their valued connections. This was required to calculate the frequency distribution of tags that co-occur with others, as well as to correlate those tags towards their frequency rate over time. Results indicate that only a few co-occurring tags exist with high and many with low frequency rates, thus following a power law curve. In addition to that, co-occurring tags with high frequency rates proved to be more stable over time than those with low rates. The results were also depending on the quantity of indexing tasks. For instance, the measured values of CiteULike yielded less explanatory power than the values of Delicious. Implications are drawn through

12 the presence of semantic domains, which are based on co-occurring tags with high frequency rates and the shift of common interests among the user community, if those high rates are fundamentally changing over time. The resulting information can be used to provide a historical or trend-setting development of the vocabulary and would not only be useful for the particular user but would also support enterprises to develop products and services, which may depend on or at least involve the interests and trends of online communities. Due to the current work, the development of algorithms for trend information and historical time series based on the frequency distribution of co-occurring tags is an interesting area for further research. A common understanding of the user community is expressed through the tag network comprised of valued links with high frequency rates. In addition, semantic domains of more than two co-occurring tags can also be identified with techniques of the social network analysis such as centrality measurements or clustering. This network alters dynamically in a self-organizing way over time suggesting new topics or events of social, academic, technical or economic nature. Defining triggers on the observed power law curve to identify those variances requires further clarification, but would be very useful by supporting users, enterprises or public organisations in upcoming decisions. Moreover, there is still a challenge in collaborative indexing systems featuring low indexing rates within a given period of time. This applies especially for those systems deployed in companies. Therefore, it is essential to find techniques, which permit major vocabulary coherence in such minimal systems and boost the significance of the common understanding. References 1. Heylighen, F.: The science of self-organization and adaptivity. In: The Encyclopedia of Life Support Systems, Oxford, UK, Eolss Publishers (1999). Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht6, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT 6: Proceedings of the seventeenth conference on Hypertext and Hypermedia, New York, ACM Press (6) Voss, J.: Tagging, folksonomy & co - renaissance of manual indexing? ArXiv Computer Science e-prints (January 7) 4. Mathes, A.: Folksonomies - cooperative classification and communication through shared metadata. Technical report, Graduate School of Library and Information Science, University of Illinois (December 4) 5. Vander Wal, T.: Explaining and showing broad and narrow folksonomies (February 5) 6. Voss, J.: Collaborative thesaurus tagging the wikipedia way. ArXiv Computer Science e-prints (April 6) 7. Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: Search and ranking. In Sure, Y., Domingue, J., eds.: The Semantic Web: Research and Applications. Volume 411 of LNAI., Heidelberg, Springer (June 6)

13 8. Quintarelli, E.: Folksonomies: power to the people. Incontro ISKO Italia - UniMIB Meeting (June 5) 9. Lund, B., Hammond, T., Flack, M., Hannay, T.: Social bookmarking tools (ii) a case study - connotea. D-Lib Magazine 11(4) (April 5) 1. Golder, S.A., Huberman, B.A.: Usage patterns of collaborative tagging systems. Journal of Information Science 3() (April 6) Zhichen, X., Yun, F., Jianchang, M., Difu, S.: Towards the semantic web: Collaborative tag suggestions. In: Collaborative Web Tagging Workshop, WWW 6, 15th International World Wide Web Conference, Edinburgh, IW3C (May 6) 1. Wu, H., Zubair, M., Maly, K.: Harvesting social knowledge from folksonomies. In: HYPERTEXT 6: Proceedings of the seventeenth conference on Hypertext and Hypermedia, New York, ACM Press (6) Millen, D.R., Feinberg, J., Kerr, B.: Dogear: Social bookmarking in the enterprise. In: CHI 6: Proceedings of the SIGCHI conference on Human Factors in computing systems, New York, ACM Press (6) Damianos, L., Griffith, J., Cuomo, D.: Onomi: Social bookmarking on a corporate intranet. In: Collaborative Web Tagging Workshop, WWW 6, 15th International World Wide Web Conference, Edinburgh, IW3C (May 6) 15. Farrell, S., Lau, T.: Fringe contacts: People-tagging for the enterprise. In: Collaborative Web Tagging Workshop, WWW 6, 15th International World Wide Web Conference, Edinburgh, IW3C (May 6) 16. John, A., Seligmann, D.: Collaborative tagging and expertise in the enterprise. In: Collaborative Web Tagging Workshop, WWW 6, 15th International World Wide Web Conference, Edinburgh, IW3C (May 6) 17. Hammond, T., Hannay, T., Lund, B., Scott, J.: Social bookmarking tools (i): A general review. D-Lib Magazine 11(4) (April 5) 18. Heymann, P., Garcia-Molina, H.: Collaborative creation of communal hierarchical taxonomies in social tagging systems. Technical report, Computer Science Department, Stanford University (April 6) 19. Mika, P.: Ontologies are us: A unified model of social networks and semantics. In: 4th International Semantic Web Conference (ISWC 5). (5). Cattuto, C., Loreto, V., Pietronero, L.: Collaborative tagging and semiotic dynamics. ArXiv Computer Science e-prints (May 6) 1. Dubinko, M., Kumar, R., Magnani, J., Novak, J., Raghavan, P., Tomkins, A.: Visualizing tags over time. In: WWW 6: Proceedings of the 15th international conference on World Wide Web, New York, ACM Press (6) 193. Furnas, G.W., Landauer, T.K., Gomez, L.M., Dumais, S.T.: The vocabulary problem in human-system communication. Communications of the ACM 3(11) (1987)

Understanding the user: Personomy translation for tag recommendation

Understanding the user: Personomy translation for tag recommendation Understanding the user: Personomy translation for tag recommendation Robert Wetzker 1, Alan Said 1, and Carsten Zimmermann 2 1 Technische Universität Berlin, Germany 2 University of San Diego, USA Abstract.

More information

Tagging tagging. Analysing user keywords in scientific bibliography management systems

Tagging tagging. Analysing user keywords in scientific bibliography management systems Tagging tagging. Analysing user keywords in scientific bibliography management systems Markus Heckner, Susanne Mühlbacher, Christian Wolff University of Regensburg Outline 1. Introduction - Research context

More information

Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems

Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems InfoLab Technical Report 2006-10 Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems Paul Heymann and Hector Garcia-Molina Computer Science Department, Stanford University

More information

Understanding the Semantics of Ambiguous Tags in Folksonomies

Understanding the Semantics of Ambiguous Tags in Folksonomies Understanding the Semantics of Ambiguous Tags in Folksonomies Ching-man Au Yeung, Nicholas Gibbins, and Nigel Shadbolt Intelligence, Agents and Multimedia Group (IAM), School of Electronics and Computer

More information

Springer Science+ Business, LLC

Springer Science+ Business, LLC Chapter 11. Towards OpenTagging Platform using Semantic Web Technologies Hak Lae Kim DERI, National University of Ireland, Galway, Ireland John G. Breslin DERI, National University of Ireland, Galway,

More information

Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study

Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study Christoph Trattner Knowledge Management Institute and Institute for Information Systems

More information

FACILITATING VIDEO SOCIAL MEDIA SEARCH USING SOCIAL-DRIVEN TAGS COMPUTING

FACILITATING VIDEO SOCIAL MEDIA SEARCH USING SOCIAL-DRIVEN TAGS COMPUTING FACILITATING VIDEO SOCIAL MEDIA SEARCH USING SOCIAL-DRIVEN TAGS COMPUTING ABSTRACT Wei-Feng Tung and Yan-Jun Huang Department of Information Management, Fu-Jen Catholic University, Taipei, Taiwan Online

More information

Collaborative Tagging: A New Way of Defining Keywords to Access Web Resources

Collaborative Tagging: A New Way of Defining Keywords to Access Web Resources International CALIBER-2008 309 Collaborative Tagging: A New Way of Defining Keywords to Access Web Resources Abstract Anila S The main feature of web 2.0 is its flexible interaction with users. It has

More information

Tags, Folksonomies, and How to Use Them in Libraries

Tags, Folksonomies, and How to Use Them in Libraries Tags, Folksonomies, and How to Use Them in Libraries Peishan Bartley Let s Tag http://flickr.com/photos/digitalcraftsman/141759034/ Tagging, how? Tagging: the act of reflecting upon a certain item, then

More information

Collaborative Tag Recommendations

Collaborative Tag Recommendations Collaborative Tag Recommendations Leandro Balby Marinho and Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Samelsonplatz 1, University of Hildesheim, D-31141 Hildesheim, Germany

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs Social Web Communities Conference or Workshop Item How to cite: Alani, Harith; Staab, Steffen and

More information

Keywords. Collaborative tagging, knowledge discovery, Web 2.0, social bookmarking. 1. Introduction

Keywords. Collaborative tagging, knowledge discovery, Web 2.0, social bookmarking. 1. Introduction Krešimir Zauder, Jadranka Lasi Lazi, Mihaela Banek Zorica Faculty of humanities and social sciences, Department of information sciences Ivana Lu i a 3, 10000 Zagreb Croatia kzauder@ffzg.hr, jlazic@ffzg.hr,

More information

Sloppy Tags and Metacrap? Quality of User Contributed Tags in Collaborative Social Tagging Systems

Sloppy Tags and Metacrap? Quality of User Contributed Tags in Collaborative Social Tagging Systems Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2009 Proceedings Americas Conference on Information Systems (AMCIS) 2009 Sloppy Tags and Metacrap? Quality of User Contributed Tags

More information

Classifying Users and Identifying User Interests in Folksonomies

Classifying Users and Identifying User Interests in Folksonomies Classifying Users and Identifying User Interests in Folksonomies Elias Zavitsanos 1, George A. Vouros 1, and Georgios Paliouras 2 1 Department of Information and Communication Systems Engineering University

More information

Towards Social Semantic Suggestive Tagging

Towards Social Semantic Suggestive Tagging Towards Social Semantic Suggestive Tagging Fabio Calefato, Domenico Gendarmi, Filippo Lanubile University of Bari, Dipartimento di Informatica, Via Orabona, 4, 70126 - Bari, Italy {calefato,gendarmi,lanubile}@di.uniba.it

More information

ConTag: A Semantic Tag Recommendation System

ConTag: A Semantic Tag Recommendation System Proceedings of I-MEDIA 07 and I-SEMANTICS 07 Graz, Austria, September 5-7, 2007 ConTag: A Semantic Tag Recommendation System Benjamin Adrian 1,2, Leo Sauermann 2, Thomas Roth-Berghofer 1,2 1 (Knowledge-Based

More information

HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging

HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging 007 International Conference on Convergence Information Technology HFCT: A Hybrid Fuzzy Clustering Method for Collaborative Tagging Lixin Han,, Guihai Chen Department of Computer Science and Engineering,

More information

ANNUAL REPORT Visit us at project.eu Supported by. Mission

ANNUAL REPORT Visit us at   project.eu Supported by. Mission Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing

More information

Collaborative Tagging: Providing User Created Organizational Structure for Web 2.0

Collaborative Tagging: Providing User Created Organizational Structure for Web 2.0 Pregledni članak Collaborative Tagging: Providing User Created Organizational Structure for Web 2.0 Mihaela Banek Zorica Department of Information Sciences Faculty of Humanities and Social Sciences Ivana

More information

THE DYNAMICS OF COLLABORATIVE TAGGING: AN ANALYSIS OF TAG VOCABULARY APPLICATION IN KNOWLEDGE REPRESENTATION, DISCOVERY AND RETRIEVAL 1.

THE DYNAMICS OF COLLABORATIVE TAGGING: AN ANALYSIS OF TAG VOCABULARY APPLICATION IN KNOWLEDGE REPRESENTATION, DISCOVERY AND RETRIEVAL 1. ASAC 2009 Niagara Falls, Ontario Joyline Makani (student) Louise Spiteri Faculty of Management Dalhousie University THE DYNAMICS OF COLLABORATIVE TAGGING: AN ANALYSIS OF TAG VOCABULARY APPLICATION IN KNOWLEDGE

More information

Telling Experts from Spammers Expertise Ranking in Folksonomies

Telling Experts from Spammers Expertise Ranking in Folksonomies 32 nd Annual ACM SIGIR 09 Boston, USA, Jul 19-23 2009 Telling Experts from Spammers Expertise Ranking in Folksonomies Michael G. Noll (Albert) Ching-Man Au Yeung Christoph Meinel Nicholas Gibbins Nigel

More information

Tag Recommendations in Folksonomies

Tag Recommendations in Folksonomies Tag Recommendations in Folksonomies Robert Jäschke 1, Leandro Marinho 2, Andreas Hotho 1, Lars Schmidt-Thieme 2 and Gerd Stumme 1 1: Knowledge & Data Engineering Group (KDE), University of Kassel, Wilhelmshöher

More information

The impact of resource title on tags in collaborative tagging systems

The impact of resource title on tags in collaborative tagging systems The impact of resource title on tags in collaborative tagging systems ABSTRACT Marek Lipczak Faculty of Computer Science Dalhousie University Halifax, Canada, B3H W5 lipczak@cs.dal.ca Collaborative tagging

More information

A Case Study on Emergent Semantics in Communities

A Case Study on Emergent Semantics in Communities A Case Study on Emergent Semantics in Communities Elke Michlmayr Women s Postgraduate College for Internet Technologies (WIT), Institute for Software Technology and Interactive Systems Vienna University

More information

Personalized Information Access in a Wiki Using Structured Tagging

Personalized Information Access in a Wiki Using Structured Tagging Personalized Information Access in a Wiki Using Structured Tagging AnmolV.Singh,AndreasWombacher, and Karl Aberer School of Computer and Communication Sciences, École Polytechnique Fédérale de Lausanne

More information

@toread and Cool: Subjective, Affective and Associative Factors in Tagging

@toread and Cool: Subjective, Affective and Associative Factors in Tagging @toread and Cool: Subjective, Affective and Associative Factors in Tagging Margaret E. I. Kipp Palmer School of Library and Information Science College of Information and Computer Science Long Island University

More information

Semantic Annotation for Semantic Social Networks. Using Community Resources

Semantic Annotation for Semantic Social Networks. Using Community Resources Semantic Annotation for Semantic Social Networks Using Community Resources Lawrence Reeve and Hyoil Han College of Information Science and Technology Drexel University, Philadelphia, PA 19108 lhr24@drexel.edu

More information

Author Keywords Search behavior, domain expertise, exploratory search.

Author Keywords Search behavior, domain expertise, exploratory search. Exploiting Knowledge-in-the-head and Knowledge-in-thesocial-web: Effects of Domain Expertise on Exploratory Search in Individual and Social Search Environments Ruogu Kang, Wai-Tat Fu, & Thomas George Kannampallil

More information

Use of Content Tags in Managing Advertisements for Online Videos

Use of Content Tags in Managing Advertisements for Online Videos Use of Content Tags in Managing Advertisements for Online Videos Chia-Hsin Huang, H. T. Kung, Chia-Yung Su Harvard School of Engineering and Applied Sciences, Cambridge, MA 02138, USA {jashing, htk, cysu}@eecs.harvard.edu

More information

Search Computing: Business Areas, Research and Socio-Economic Challenges

Search Computing: Business Areas, Research and Socio-Economic Challenges Search Computing: Business Areas, Research and Socio-Economic Challenges Yiannis Kompatsiaris, Spiros Nikolopoulos, CERTH--ITI NEM SUMMIT Torino-Italy, 28th September 2011 Media Search Cluster Search Computing

More information

HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL

HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL International Journal of Mechanical Engineering & Computer Sciences, Vol.1, Issue 1, Jan-Jun, 2017, pp 12-17 HYBRIDIZED MODEL FOR EFFICIENT MATCHING AND DATA PREDICTION IN INFORMATION RETRIEVAL BOMA P.

More information

GroupMe! - Where Semantic Web meets Web 2.0

GroupMe! - Where Semantic Web meets Web 2.0 GroupMe! - Where Semantic Web meets Web 2.0 Fabian Abel, Mischa Frank, Nicola Henze, Daniel Krause, Daniel Plappert, and Patrick Siehndel IVS Semantic Web Group, University of Hannover, Hannover, Germany

More information

Rules for Inducing Hierarchies from Social Tagging Data

Rules for Inducing Hierarchies from Social Tagging Data Rules for Inducing Hierarchies from Social Tagging Data iconference 2018, Sheffield, UK, March 25-28, 2018 Hang Dong, Wei Wang, Frans Coenen Department of Computer Science, University of Liverpool Social

More information

Exploring Social Annotations for Web Document Classification

Exploring Social Annotations for Web Document Classification Exploring Social Annotations for Web Document Classification ABSTRACT Michael G. Noll Hasso-Plattner-Institut, University of Potsdam 14440 Potsdam, Germany michael.noll@hpi.uni-potsdam.de Social annotation

More information

Power Tags as Tools for Social Knowledge Organization Systems

Power Tags as Tools for Social Knowledge Organization Systems Power Tags as Tools for Social Knowledge Organization Systems Isabella Peters Abstract Web services are popular which allow users to collaboratively index and describe web resources with folksonomies.

More information

int.ere.st: Building a Tag Sharing Service with the SCOT Ontology

int.ere.st: Building a Tag Sharing Service with the SCOT Ontology int.ere.st: Building a Tag Sharing Service with the SCOT Ontology HakLae Kim, John G. Breslin Digital Enterprise Research Institute National University of Ireland, Galway IDA Business Park, Lower Dangan

More information

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany

<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany Information Systems University of Koblenz Landau, Germany On Understanding the Sharing of Conceptualizations, Klaas Dellschaft What is an ontology? Gruber 93 (slightly adapted by Borst): An Ontology is

More information

Drawing Bipartite Graphs as Anchored Maps

Drawing Bipartite Graphs as Anchored Maps Drawing Bipartite Graphs as Anchored Maps Kazuo Misue Graduate School of Systems and Information Engineering University of Tsukuba 1-1-1 Tennoudai, Tsukuba, 305-8573 Japan misue@cs.tsukuba.ac.jp Abstract

More information

Development of an Ontology-Based Portal for Digital Archive Services

Development of an Ontology-Based Portal for Digital Archive Services Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw

More information

Exploring Characteristics of Social Classification

Exploring Characteristics of Social Classification Wayne State University School of Library and Information Science Faculty Research Publications School of Library and Information Science 11-4-2006 Exploring Characteristics of Social Classification Xia

More information

arxiv: v1 [cs.dl] 23 Feb 2012

arxiv: v1 [cs.dl] 23 Feb 2012 Analyzing Tag Distributions in Folksonomies for Resource Classification Arkaitz Zubiaga, Raquel Martínez, and Víctor Fresno arxiv:1202.5477v1 [cs.dl] 23 Feb 2012 NLP & IR Group @ UNED Abstract. Recent

More information

Learning User Profiles from Tagging Data and Leveraging them for Personal(ized) Information Access

Learning User Profiles from Tagging Data and Leveraging them for Personal(ized) Information Access Learning User Profiles from Tagging Data and Leveraging them for Personal(ized) Information Access ABSTRACT Elke Michlmayr Women s Postgraduate College for Internet Technologies (WIT), Vienna University

More information

Web Science Your Business, too!

Web Science Your Business, too! Web Science & Technologies University of Koblenz Landau, Germany Web Science Your Business, too! Agenda What is Web Science? An explanation by analogy What do we do about it? Understanding collective effects

More information

An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique

An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique An Improvement of Search Results Access by Designing a Search Engine Result Page with a Clustering Technique 60 2 Within-Subjects Design Counter Balancing Learning Effect 1 [1 [2www.worldwidewebsize.com

More information

Small World Folksonomies: Clustering in Tri-Partite Hypergraphs

Small World Folksonomies: Clustering in Tri-Partite Hypergraphs Small World Folksonomies: Clustering in Tri-Partite Hypergraphs Christoph Schmitz November 30, 2006 Abstract Many recent Web 2.0 resource sharing applications can be subsumed under the folksonomy moniker.

More information

highest cosine coecient [5] are returned. Notice that a query can hit documents without having common terms because the k indexing dimensions indicate

highest cosine coecient [5] are returned. Notice that a query can hit documents without having common terms because the k indexing dimensions indicate Searching Information Servers Based on Customized Proles Technical Report USC-CS-96-636 Shih-Hao Li and Peter B. Danzig Computer Science Department University of Southern California Los Angeles, California

More information

Semantic Web Systems Ontologies Jacques Fleuriot School of Informatics

Semantic Web Systems Ontologies Jacques Fleuriot School of Informatics Semantic Web Systems Ontologies Jacques Fleuriot School of Informatics 15 th January 2015 In the previous lecture l What is the Semantic Web? Web of machine-readable data l Aims of the Semantic Web Automated

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Computer Science 591Y Department of Computer Science University of Massachusetts Amherst February 3, 2005 Topics Tasks (Definition, example, and notes) Classification

More information

Facilitating Exploratory Search by Model-Based Navigational Cues

Facilitating Exploratory Search by Model-Based Navigational Cues Facilitating Exploratory Search by Model-Based Navigational Cues Wai-Tat Fu, Thomas G. Kannampallil, and Ruogu Kang Applied Cognitive Science Lab University of Illinois at Urbana-Champaign 405 N. Mathews

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

TGI Modules for Social Tagging System

TGI Modules for Social Tagging System TGI Modules for Social Tagging System Mr. Tambe Pravin M. Prof. Shamkuwar Devendra O. M.E. (2 nd Year) Department Of Computer Engineering Department Of Computer Engineering SPCOE, Otur SPCOE, Otur Pune,

More information

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005 Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005 Abstract Deciding on which algorithm to use, in terms of which is the most effective and accurate

More information

A Content-Based Method to Enhance Tag Recommendation

A Content-Based Method to Enhance Tag Recommendation Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence (IJCAI-09) A Content-Based Method to Enhance Tag Recommendation Yu-Ta Lu, Shoou-I Yu, Tsung-Chieh Chang, Jane Yung-jen

More information

Tag Based Image Search by Social Re-ranking

Tag Based Image Search by Social Re-ranking Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule

More information

Can Social Tags Help You Find What You Want?

Can Social Tags Help You Find What You Want? Razikin, K., Goh, D.H., Chua, A., and Lee, C.S. (2008) Can social tags help you find what you want? In proceedings of the 12th European Conference on Research and Advanced Technology for Digital Libraries

More information

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries

The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries The Semantics of Semantic Interoperability: A Two-Dimensional Approach for Investigating Issues of Semantic Interoperability in Digital Libraries EunKyung Chung, eunkyung.chung@usm.edu School of Library

More information

Opus: University of Bath Online Publication Store

Opus: University of Bath Online Publication Store Patel, M. (2004) Semantic Interoperability in Digital Library Systems. In: WP5 Forum Workshop: Semantic Interoperability in Digital Library Systems, DELOS Network of Excellence in Digital Libraries, 2004-09-16-2004-09-16,

More information

Oleksandr Kuzomin, Bohdan Tkachenko

Oleksandr Kuzomin, Bohdan Tkachenko International Journal "Information Technologies Knowledge" Volume 9, Number 2, 2015 131 INTELLECTUAL SEARCH ENGINE OF ADEQUATE INFORMATION IN INTERNET FOR CREATING DATABASES AND KNOWLEDGE BASES Oleksandr

More information

Content-based Dimensionality Reduction for Recommender Systems

Content-based Dimensionality Reduction for Recommender Systems Content-based Dimensionality Reduction for Recommender Systems Panagiotis Symeonidis Aristotle University, Department of Informatics, Thessaloniki 54124, Greece symeon@csd.auth.gr Abstract. Recommender

More information

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2 Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1907-1911 1907 Web-Based Data Mining in System Design and Implementation Open Access Jianhu

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN: Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,

More information

RSDC 09: Tag Recommendation Using Keywords and Association Rules

RSDC 09: Tag Recommendation Using Keywords and Association Rules RSDC 09: Tag Recommendation Using Keywords and Association Rules Jian Wang, Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University, Bethlehem, PA 18015 USA

More information

Kristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok

Kristina Lerman University of Southern California. This lecture is partly based on slides prepared by Anon Plangprasopchok Kristina Lerman University of Southern California This lecture is partly based on slides prepared by Anon Plangprasopchok Social Web is a platform for people to create, organize and share information Users

More information

Ermergent Semantics in BibSonomy

Ermergent Semantics in BibSonomy Ermergent Semantics in BibSonomy Andreas Hotho Robert Jäschke Christoph Schmitz Gerd Stumme Applications of Semantic Technologies Workshop. Dresden, 2006-10-06 Agenda Introduction Folksonomies BibSonomy

More information

Recent Design Optimization Methods for Energy- Efficient Electric Motors and Derived Requirements for a New Improved Method Part 3

Recent Design Optimization Methods for Energy- Efficient Electric Motors and Derived Requirements for a New Improved Method Part 3 Proceedings Recent Design Optimization Methods for Energy- Efficient Electric Motors and Derived Requirements for a New Improved Method Part 3 Johannes Schmelcher 1, *, Max Kleine Büning 2, Kai Kreisköther

More information

Organizing Information. Organizing information is at the heart of information science and is important in many other

Organizing Information. Organizing information is at the heart of information science and is important in many other Dagobert Soergel College of Library and Information Services University of Maryland College Park, MD 20742 Organizing Information Organizing information is at the heart of information science and is important

More information

Semantics to the Bookmarks: A Review of Social Semantic Bookmarking Systems

Semantics to the Bookmarks: A Review of Social Semantic Bookmarking Systems Proceedings of I-KNOW 09 and I-SEMANTICS 09 2-4 September 2009, Graz, Austria 445 Semantics to the Bookmarks: A Review of Social Semantic Bookmarking Systems Simone Braun (FZI Research Center for Information

More information

Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results

Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results Using Text Elements by Context to Display Search Results in Information Retrieval Systems Model and Research results Offer Drori SHAAM Information Systems The Hebrew University of Jerusalem offerd@ {shaam.gov.il,

More information

How Social Is Social Bookmarking?

How Social Is Social Bookmarking? How Social Is Social Bookmarking? Sudha Ram, Wei Wei IEEE International Workshop on Business Applications of Social Network Analysis (BASNA 2010) December 15 th, 2010. Bangalore, India Agenda Social Bookmarking

More information

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007 What s New in Spotfire DXP 1.1 Spotfire Product Management January 2007 Spotfire DXP Version 1.1 This document highlights the new capabilities planned for release in version 1.1 of Spotfire DXP. In this

More information

Agenda. Web 2.0: User Generated Metadata. Why metadata? Introduction. Why metadata? Example for Annotations

Agenda. Web 2.0: User Generated Metadata. Why metadata? Introduction. Why metadata? Example for Annotations Web 2.0: User Generated Metadata Mathias Lux Klagenfurt University, Austria Dept. for Information Technology mlux@itec.uni-klu.ac.at Agenda Introduction Examples for User Generated Metadata What is User

More information

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2 1 Department of Electronics & Comp. Sc, RTMNU, Nagpur, India 2 Department of Computer Science, Hislop College, Nagpur,

More information

Modeling for the Web

Modeling for the Web Modeling for the Web Web 2.0 and Web 3.0 Why ontologies? Copyright 2008 STI INNSBRUCK Motivation 2 1 Motivation (cont d) File Sharing: Flickr (Images) YouTube (Videos) Wikipedia (Online Encyclopedia) Blogs

More information

Social Tagging. Kristina Lerman. USC Information Sciences Institute Thanks to Anon Plangprasopchok for providing material for this lecture.

Social Tagging. Kristina Lerman. USC Information Sciences Institute Thanks to Anon Plangprasopchok for providing material for this lecture. Social Tagging Kristina Lerman USC Information Sciences Institute Thanks to Anon Plangprasopchok for providing material for this lecture. essembly Bugzilla delicious Social Web essembly Bugzilla Social

More information

Data analysis using Microsoft Excel

Data analysis using Microsoft Excel Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data

More information

Multi-Aspect Tagging for Collaborative Structuring

Multi-Aspect Tagging for Collaborative Structuring Multi-Aspect Tagging for Collaborative Structuring Katharina Morik and Michael Wurst University of Dortmund, Department of Computer Science Baroperstr. 301, 44221 Dortmund, Germany morik@ls8.cs.uni-dortmund

More information

A service based on Linked Data to classify Web resources using a Knowledge Organisation System

A service based on Linked Data to classify Web resources using a Knowledge Organisation System A service based on Linked Data to classify Web resources using a Knowledge Organisation System A proof of concept in the Open Educational Resources domain Abstract One of the reasons why Web resources

More information

Mapping Cognitive Models to Social Semantic Spaces Collaborative Development of Project Ontologies

Mapping Cognitive Models to Social Semantic Spaces Collaborative Development of Project Ontologies Mapping Cognitive Models to Social Semantic Spaces Collaborative Development of Project Ontologies Thomas Riechert 1, Steffen Lohmann 2 1 University of Leipzig Department of Computer Science Johannisgasse

More information

Lecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems

Lecture 1: Introduction and Motivation Markus Kr otzsch Knowledge-Based Systems KNOWLEDGE GRAPHS Introduction and Organisation Lecture 1: Introduction and Motivation Markus Kro tzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Markus Krötzsch, 16th Oct 2018 Course Tutors Knowledge

More information

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.

More information

International Journal of Scientific & Engineering Research Volume 8, Issue 5, May ISSN

International Journal of Scientific & Engineering Research Volume 8, Issue 5, May ISSN International Journal of Scientific & Engineering Research Volume 8, Issue 5, May-2017 106 Self-organizing behavior of Wireless Ad Hoc Networks T. Raghu Trivedi, S. Giri Nath Abstract Self-organization

More information

KNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems

KNOWLEDGE GRAPHS. Lecture 1: Introduction and Motivation. TU Dresden, 16th Oct Markus Krötzsch Knowledge-Based Systems KNOWLEDGE GRAPHS Lecture 1: Introduction and Motivation Markus Krötzsch Knowledge-Based Systems TU Dresden, 16th Oct 2018 Introduction and Organisation Markus Krötzsch, 16th Oct 2018 Knowledge Graphs slide

More information

Graph Structure Over Time

Graph Structure Over Time Graph Structure Over Time Observing how time alters the structure of the IEEE data set Priti Kumar Computer Science Rensselaer Polytechnic Institute Troy, NY Kumarp3@rpi.edu Abstract This paper examines

More information

A Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics

A Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics A Comparison of Text-Categorization Methods applied to N-Gram Frequency Statistics Helmut Berger and Dieter Merkl 2 Faculty of Information Technology, University of Technology, Sydney, NSW, Australia hberger@it.uts.edu.au

More information

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES

AN OVERVIEW OF SEARCHING AND DISCOVERING WEB BASED INFORMATION RESOURCES Journal of Defense Resources Management No. 1 (1) / 2010 AN OVERVIEW OF SEARCHING AND DISCOVERING Cezar VASILESCU Regional Department of Defense Resources Management Studies Abstract: The Internet becomes

More information

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition

A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition A Multiclassifier based Approach for Word Sense Disambiguation using Singular Value Decomposition Ana Zelaia, Olatz Arregi and Basilio Sierra Computer Science Faculty University of the Basque Country ana.zelaia@ehu.es

More information

Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics

Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics Prof. Ahmet Süerdem Istanbul Bilgi University London School of Economics Media Intelligence Business intelligence (BI) Uses data mining techniques and tools for the transformation of raw data into meaningful

More information

Tag-Based Contextual Collaborative Filtering

Tag-Based Contextual Collaborative Filtering Tag-Based Contextual Collaborative Filtering Reyn Nakamoto Shinsuke Nakajima Jun Miyazaki Shunsuke Uemura Abstract In this paper, we introduce a new Collaborative Filtering (CF) model which takes into

More information

Proceedings. of the ISSI 2011 conference. 13th International Conference of the International Society for Scientometrics & Informetrics

Proceedings. of the ISSI 2011 conference. 13th International Conference of the International Society for Scientometrics & Informetrics Proceedings of the ISSI 2011 conference 13th International Conference of the International Society for Scientometrics & Informetrics Durban, South Africa, 04-07 July 2011 Edited by Ed Noyons, Patrick Ngulube

More information

Seismic regionalization based on an artificial neural network

Seismic regionalization based on an artificial neural network Seismic regionalization based on an artificial neural network *Jaime García-Pérez 1) and René Riaño 2) 1), 2) Instituto de Ingeniería, UNAM, CU, Coyoacán, México D.F., 014510, Mexico 1) jgap@pumas.ii.unam.mx

More information

Bridging Versioning and Adaptive Hypermedia in the Dynamic Web

Bridging Versioning and Adaptive Hypermedia in the Dynamic Web Bridging Versioning and Adaptive Hypermedia in the Dynamic Web Evgeny Knutov, Mykola Pechenizkiy, Paul De Bra Eindhoven University of Technology, Department of Computer Science PO Box 513, NL 5600 MB Eindhoven,

More information

A Comparative Study of Hierarchical Clustering Algorithms for Tagging Systems

A Comparative Study of Hierarchical Clustering Algorithms for Tagging Systems A Comparative Study of Hierarchical Clustering Algorithms for Tagging Systems Anisa Allahdadi BIHE, Iran anisa.allahdadi@bihe.org Abstract. With the rapid growth of information on the web, the so-called

More information

Annotated multitree output

Annotated multitree output Annotated multitree output A simplified version of the two high-threshold (2HT) model, applied to two experimental conditions, is used as an example to illustrate the output provided by multitree (version

More information

Tag-Based Contextual Collaborative Filtering

Tag-Based Contextual Collaborative Filtering DEWS007 M5-6 Tag-Based Contextual Collaborative Filtering Reyn NAKAMOTO, Shinsuke NAKAJIMA, Jun MIYAZAKI, and Shunsuke UEMURA Nara Institute of Science and Technology, 896-5 Takayama-cho, Ikoma-shi, Nara-ken,

More information

IBM Research Report. Model-Driven Business Transformation and Semantic Web

IBM Research Report. Model-Driven Business Transformation and Semantic Web RC23731 (W0509-110) September 30, 2005 Computer Science IBM Research Report Model-Driven Business Transformation and Semantic Web Juhnyoung Lee IBM Research Division Thomas J. Watson Research Center P.O.

More information

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search 1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012 2 / 33 Teaser - Google Web History http://www.google.com/history

More information

An Approach to Building High-Quality Tag Hierarchies from Crowdsourced Taxonomic Tag Pairs

An Approach to Building High-Quality Tag Hierarchies from Crowdsourced Taxonomic Tag Pairs An Approach to Building High-Quality Tag Hierarchies from Crowdsourced Taxonomic Tag Pairs Fahad Almoqhim, David E. Millard, Nigel Shadbolt Electronics and Computer Science, University of Southampton,

More information

Tag Recommendation for Photos

Tag Recommendation for Photos Tag Recommendation for Photos Gowtham Kumar Ramani, Rahul Batra, Tripti Assudani December 10, 2009 Abstract. We present a real-time recommendation system for photo annotation that can be used in Flickr.

More information

Two-Dimensional Visualization for Internet Resource Discovery. Shih-Hao Li and Peter B. Danzig. University of Southern California

Two-Dimensional Visualization for Internet Resource Discovery. Shih-Hao Li and Peter B. Danzig. University of Southern California Two-Dimensional Visualization for Internet Resource Discovery Shih-Hao Li and Peter B. Danzig Computer Science Department University of Southern California Los Angeles, California 90089-0781 fshli, danzigg@cs.usc.edu

More information

Evaluation and Design Issues of Nordic DC Metadata Creation Tool

Evaluation and Design Issues of Nordic DC Metadata Creation Tool Evaluation and Design Issues of Nordic DC Metadata Creation Tool Preben Hansen SICS Swedish Institute of computer Science Box 1264, SE-164 29 Kista, Sweden preben@sics.se Abstract This paper presents results

More information