信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

Size: px
Start display at page:

Download "信息检索与搜索引擎 Introduction to Information Retrieval GESC1007"

Transcription

1 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities Spring

2 Last week What is Information Retrieval ( 信息检索 )? We discussed the «Boolean retrieval model ( 布尔检索模型 )»: searching documents using terms and Boolean operators (e.g. AND, OR, NOT) QQ Group: Website: PPTs 2

3 Course schedule ( 日程安排 ) Lecture 1 Lecture 2 Lecture 3 Lecture 4 Lecture 5 Lecture 6 Introduction Boolean retrieval ( 布尔检索模型 ) Term vocabulary and posting lists Dictionaries and tolerant retrieval Index construction and compression Scoring, weighting, and the vector space model Computer scores, and a complete search system Lecture 7 Lecture 8 Evaluation in information retrieval Web search engines, advanced topics, and conclusion 3

4 An exercise This is an exercise that you can do at home if you want to review what we have learnt last week b. Draw the dictionary (also called inverted index representation) for this collection c. What are the returned result for these queries? - schizophrenia AND drug - for AND NOT (drug OR approach) 4

5 Introduction To able to search for documents quickly, we need to create an index ( 索引 ). What kind of index? Term-document matrix ( 关联矩阵 ) Dictionary ( 词典 ) (also called inverted index 倒排索引 ) Four steps to create an index 5

6 How to create an index? Step 1: collect the documents to be indexed Book1 Book2 Book3 Book100

7 How to create an index? Step 1: collect the documents to be indexed Book1 Book2 Book3 Book100 Step 2: tokenize the text ( 标记文本 ): separate it into words Book1 «The city of Shenzhen is located in China» token1 token2 token7 token8 7

8 How to create an index? Step 3: Linguistic preprocessing ( 语言的预处理 ) Keep only the terms that are useful for indexing documents. «The city of Shenzhen is located in China» token1 token2 token7 token8 During that step, words can be also transformed if necessary: friends friend wolves wolf eaten eat 8

9 How to create an index? Step 4: Create the dictionary City Shenzhen Located China Dictionary City Shenzhen Located China Book1, Book2, Book 10, Book 7. Book1, Book3, Book 5, Book 9. Book1, Book 20, Book 34

10 How to create an index? The index has been created! It can then be used to search documents. Dictionary China City Located Shenzhen Book1, Book 20, Book 34 Book1, Book2, Book 7, Book 20. Book1, Book3, Book 5, Book 9.

11 CHAPTER 2 TERM VOCABULARY AND POSTING LISTS p56 11

12 In Chapter 2 We will discuss: Reading documents (2.1) Tokenization ( 标记化 ) and linguistic processing (2.2) Posting-lists (2.3) An extended model to handle phrase and proximity queries (2.4). e.g. City of Shenzhen 12

13 2.1 Reading digital documents Data ( 数据 ) stored in computers are represented as bits ( 比特 ). To read documents, an IR system must convert these bits into characters Hello 13

14 Reading documents (2) How to convert from bits to characters? There exists several encodings ( 文本编码 ) such as ASCII, UTF-8 : H E L L O 14

15 Reading documents (3) An IR system will only extract relevant content ( 相关内容 )from a document (e.g. the text). e.g. in a webpage ( 网页 ), pictures ( 图片 )can be ignored. Text ( 文本 ) Pictures ( 图片 ) 15

16 Reading documents (4) In this course, we consider English documents English is read from left to right. Some other languages are more complex to read. e.g. Arabic( 阿拉伯语 )mixes both left to right and right to left Also, some vowels( 元音 )are not written Creating an index is difficult for such languages! 16

17 Reading documents (5) Some IR systems process each document individually e.g. Indexing each individually Some IR systems process documents as groups. e.g. Indexing all s for a given day, together 17

18 Reading documents (6) It is also important to choose the granularity ( 粒度 ) carefully. should we index a book as a single document? It can be a bad idea! For example, if we search for books about Food from China but Food appears only in the first chapter and China appears only in the last chapter Then this book is not about food from China or should we index each chapter of the book separately? 18

19 2.2 Tokenization (1) After reading a document, the next step is tokenization ( 标记化 ). This means to split a text into pieces called tokens ( 标记 ) while throwing away some characters such as punctuation ( 标点符号 ). A text Tokenization ( 标记化 ) Token1 Token2 Token3 Token4 Token5 Token6 Token7 19

20 Tokenization (2) A token is a sequence of characters( 字符 ) appearing at a specific location in a document. Two tokens that are identical are said to be of the same type. This house is close to my house. These two tokens are of the same type ( house ).

21 Tokenization (3) Naive approach for tokenization ( 幼稚的方法 ): Remove punctuation. Split the text according to the whitespaces ( 空格 ) A text Tokenization ( 标记化 ) Token1 Token2 Token3 Token4 Token5 Token6 Token7 21

22 Tokenization (3) This approach has some problems. e.g. Mr. O Neill and his friends aren t How to tokenized O Neill and aren t? Which one is better? Which one is better? 22

23 Tokenization (4) In general, choosing how to tokenize a text influences how we can search for documents e.g. Mr. O Neill and his friends aren t If aren t is considered to be a token, then if a person searches for the term are, he may not find the document. 23

24 Tokenization (5) e.g. Mr. O Neill and his friends aren t If aren t is considered to be two tokens ( are and n t ), then if a person searches for aren t, he may not find the document. Solution: 1 - Tokenize the documents 2 - Tokenize the queries of users in the same way. 24

25 Tokenization (6) In general, tokenization is different for each language. For this reason, it is useful to first identify the language of a document before performing tokenization and indexing. In Chinese, a difficulty is that there are no whitespaces ( 空格 ) between words e.g. 我喜欢这节课 " 25

26 Tokenization (6) Word segmentation ( 分词 ) is the process of dividing a text into words. In Chinese, there are some ambiguities ( 歧义 ): «monk»? or «and» + «still»? Simple solution: find the longest words Other solutions: use Markov movels, and other techniques. 26

27 Tokenization (7) In English, there are whitespaces between words. But splitting a text using whitespaces may cause problems. San Francisco is the name of a city (it should not be considered as two tokens) 1 st January 2016 is a date Hunan University should be considered as a single token 27

28 Tokenization (8) A solution: For a given query such as: «Hunan University» a search engine can retrieve documents using all the different tokenizations: Hunan University HunanUniversity and combine the results. 28

29 Tokenization (9) In many languages, there are some unusual tokens. e.g. B-52 is an aircraft C++ and C# are programming languages ( 编程语言 ) M*A*S*H* is the name of a TV show ( 电视节目 ) is a web page It is important to consider these special tokens. 29

30 Tokenization (10) Some tokens can be ignored because it is unlikely that someone will search for them: amounts of money e.g. 56 元, numbers e.g Advantage: this reduces the size of the dictionary Disadvantage: we cannot search for the tokens that are ignored. 30

31 Removing common words In text documents, there are some words that are very common and may not be useful for retrieving documents. In English, 25 common words are: Such words are called «stop words» ( 停用词 ) 31

32 Removing common words (2) These words can be ignored when indexing documents. In general, this will not cause problems when searching for documents. However, stops words are useful when searching for phrases ( 短语 ) (consecutive words) e.g. «Airplane tickets to Beijing» is more precise than: «Airplane AND tickets AND Beijing» 32

33 Removing common words (3) In terms of performance, removing stop words: results in a smaller index. does not make a big difference in terms of performance (speed ). Most Web search engines do not remove stop words. instead they use other strategies to cope with common words, based on statistics about words. 33

34 Normalization - 规范化 When a person enters a query in a search engine: User ( 用户 ) cars shenzhen Query ( 查询 ) An IR system will also «tokenize» the query. 34

35 Normalization - 规范化 (2) When a person enters a query in a search engine: User ( 用户 ) cars shenzhen Query ( 查询 ) It is possible that the tokens obtained from the query do not match the tokens from documents 35

36 Normalization - 规范化 (3) Example: «cars» is used instead of «car» but these two tokens refer to the same object. «cars» is used instead of «automobile» but these two tokens have the same meaning (they are synonyms - 同义词 ) 36

37 Normalization - 规范化 (4) Normalization ( 规范化 ): it is the process of converting tokens to a standard form so that matches will occur despite small differences. cars car car automobile windows window Windows (operating system) 37

38 Normalization: accents and diacritics Diacritic ( 变音符 ): a sign written above or below a letter that indicates a difference in pronunciation à é ê Should we just ignore them? In some languages, they are important. In Spanish: peña = a cliff pena = sorrow 38

39 Normalization: accents and diacritics But it is possible that users will not use the diacritics because they may be lazy or may not know how to type them on the computer. Thus, a strategy is to remove them: peña = a cliff pena = sorrow 39

40 Capitalization Lower-case letters : a,b,c,d. ( 小写 ) Upper-case letters: A,B,C,D. ( 大写 ) A common strategy is to transform everything to lower-case letters: Ferrari ferrari Australia australia This can be a good idea because often people will not type upper-case letters when searching for documents. 40

41 Capitalization But sometimes capitalization is important. Bush: a person named «Bush» ( 布什 ) bush: a bush ( 灌木 ) C.A.T : a company cat : an animal ( 猫 ) 41

42 Capitalization A good solution for English: convert the first letter of a sentence to a lower-case letter. Saturday, Jim went out to eat something. saturday Jim went out to eat something This is not a perfect solution but work most of the time. However, as mentioned, users may not type the upper-case letters anyway. Thus, transforming everything to lower-case is a often the best solution. 42

43 Other issues in English British spelling vs American spelling colour color Dates 3/12/16 3rd March 2016 Mar. 3,

44 Lemmatization Sometimes a same word may have different forms: organize, organizes, organizing Lemmatization: converting a word to a common base form called lemma am, are, is be car, cars, car s, cars car The lemma for «car, cars,» 44

45 Lemmatization (2) How to perform lemmatization? A simple way called stemming consists of removing the end of words: cars car airplanes airplane But it may give some incorrect results: saw s The result should be «see»! 45

46 Lemmatization (2) If we want to perform lemmatization in a better way, it is necessary to analyze how the words are used in the text. This can be quite complicated. There exist some software to analyze texts and perform stemming for different languages (free or commercial). For English: Porter Stemmer martin/porterstemmer/ 46

47 Example Porter Stemmer Applying the Porter Stemmer 47

48 Lemmatization (3) In some cases, lemmatization can help to provide better results when searching for documents But in some other cases, it does not help and lead to worse results. Thus, lemmatization may not always be used in practice. Example of problem 48

49 Lemmatization (5) Example The Porter Stemmer convert all these words operate operating operates operation operative operatives operational to «oper». But these words have different meanings. 49

50 Lemmatization (6) In general, applying lemmatization allows users to find more documents using an Information retrieval system. But these documents may be less relevant. In other words, lemmatization may: decrease precision. increase recall 50

51 Precision ( 准确率 ) Precision: What fraction of the returned results are relevant to the information need? Example: A person searches for webpages about Beijing The search engine returns: 5 relevant webpages 5 irrelevant webpages. Precision = 5 / 10 = 0.5 (50 %) 51

52 Recall ( 召回 ) Recall: What fraction of the relevant documents in a collection were returned by the system? Example: A database contains 1000 documents about HITSZ. The user search for documents about HITSZ. Only 100 documents about HITSZ are retrieved. Recall = 100 / 1000 = 0.1 (10 %) 52

53 2.3 HOW TO SEARCH FASTER USING A DICTIONARY 53

54 Introduction Last week, we saw how we can use a dictionary to search for documents. Example 54

55 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, 55

56 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 56

57 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 57

58 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists. To do that, we compare both lists, posting by posting. 58

59 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 59

60 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 60

61 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, RESULT: Book 1, Book20 61

62 How to search faster? There are some techniques to allow faster search. One such technique is to use skip pointers. We will see the main idea (without the details) 62

63 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, 63

64 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 64

65 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, We need to do the intersection ( 交线 ) of the two lists 65

66 Example QUERY : CITY AND CHINA Dictionary City Shenzhen Located China Book1, Book2, Book 10,, Book 20. Book1, Book3 Book1, Book 20, RESULT: Book 1, Book20 66

67 Skip-pointers The idea is to use some «shortcuts» (arrows) to skip some entries when comparing lists. By doing this, we can compare lists of documents faster (we don t need to completely read the lists). This is just the main idea. We will not discuss technical details! This idea only works for queries using the AND operator (it does not work for OR). 67

68 2.4 PHRASE QUERIES 68

69 Phrase query ( 精确查询 ) Phrase query: a query where words must appear consecutively (one after the other) in documents e.g.: «Harbin Institute of Technology» This query is written with quotes ( ). It will find all documents containing these words one after the other. This type of query is not supported by all Web search engines. 69

70 Phrase query (2) Some Web search engines will instead consider the proximity between words in documents. Documents where words from a query appear closer will be preferred to other documents. How to answer a phrase query? 70

71 Biword indexes A solution is to considers each pair of consecutive terms in a document as a term. I walked in Beijing «I walked» «walked in» «in Beijing» Those terms are called «biwords» Each biword can be used to create an index that we call a «biword index». 71

72 Illustration of a biword index Dictionary I I walked walked walked in in Beijing Beijing Book1, Book5, Book 10,, Book 20. Book1, Book7 Book1, Book 12, 72

73 Biword indexes Using a biword index, we can search using the «biwords: A query: «Harbin Institute» AND «Institute of» AND «of Technology» This query would work pretty well. But it could still find documents where the phrase «Harbin Institute of Technology» would not appear consecutively. 73

74 Biword indexes How to solve this problem? A solution is to generalize the concept of biword index to more than two words (e.g. three words). Then, we may find more relevant documents. But a problem is that the index would become much larger (there will be more entries in the dictionary). 74

75 Positional indexes ( 位置索引 ) A better solution is to use another type of index called positional indexes. Positional index: a dictionary where the positions of terms in documents are stored. Dictionary City Shenzhen Located China Book1 (3, 25, 38) Book 20 (4, 100, 1000) Book1 (2, 24, 35). Book20(3,500) This indicates that «Shenzhen» appears as the 2 nd, 24 th and 35 th word in Book1 75

76 Positional indexes ( 位置索引 ) A better solution is to use another type of index called positional indexes. Positional index: a dictionary where the positions of terms in documents are stored. Dictionary City Shenzhen Located China Book1 (3, 25, 38) Book 20 (4, 100, 1000) Book1 (2, 24, 35). Book20(3,500) This indicates that «Shenzhen» appears as the 3 rd and 500 th word in Book20 76

77 Positional indexes Positional indexes can be used to answer phrase queries. 77

78 Example Phrase query: «Shenzhen City» Dictionary City Shenzhen Located China Book1 (3, 25, 38) Book 20 (4, 100, 1000) Book1 (2, 24, 35). Book20(3,500) 78 Result: Book 1 and Book 20 78

79 Positional indexes Positional indexes can also be used to answer proximity queries. «Shenzhen (within five words of) City» 79

80 Conclusion Today, we have discussed in more details how index are created. Tokenization, normalization, lemmatization The PPT slides are on the website. QQ Group: 80

81 References Manning, C. D., Raghavan, P., Schütze, H. Introduction to information retrieval. Cambridge: Cambridge University Press,

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 1 Introduction Philippe Fournier-Viger

More information

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 1 Last week We have discussed in

More information

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 1 Last week We have discussed about:

More information

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 1 Last week We have discussed: A

More information

Information Retrieval and Organisation

Information Retrieval and Organisation Information Retrieval and Organisation Dell Zhang Birkbeck, University of London 2016/17 IR Chapter 02 The Term Vocabulary and Postings Lists Constructing Inverted Indexes The major steps in constructing

More information

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007

信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 信息检索与搜索引擎 Introduction to Information Retrieval GESC1007 Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 1 Last week We have discussed: Evaluation

More information

Information Retrieval CS-E credits

Information Retrieval CS-E credits Information Retrieval CS-E4420 5 credits Tokenization, further indexing issues Antti Ukkonen antti.ukkonen@aalto.fi Slides are based on materials by Tuukka Ruotsalo, Hinrich Schütze and Christina Lioma

More information

Machine Vision Market Analysis of 2015 Isabel Yang

Machine Vision Market Analysis of 2015 Isabel Yang Machine Vision Market Analysis of 2015 Isabel Yang CHINA Machine Vision Union Content 1 1.Machine Vision Market Analysis of 2015 Revenue of Machine Vision Industry in China 4,000 3,500 2012-2015 (Unit:

More information

Information Retrieval

Information Retrieval Introduction to Information Retrieval CS276: Information Retrieval and Web Search Christopher Manning and Prabhakar Raghavan Lecture 2: The term vocabulary Ch. 1 Recap of the previous lecture Basic inverted

More information

Digital Libraries: Language Technologies

Digital Libraries: Language Technologies Digital Libraries: Language Technologies RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Recall: Inverted Index..........................................

More information

OTAD Application Note

OTAD Application Note OTAD Application Note Document Title: OTAD Application Note Version: 1.0 Date: 2011-08-30 Status: Document Control ID: Release _OTAD_Application_Note_CN_V1.0 Copyright Shanghai SIMCom Wireless Solutions

More information

Introduction to Information Retrieval

Introduction to Information Retrieval Introduction Inverted index Processing Boolean queries Course overview Introduction to Information Retrieval http://informationretrieval.org IIR 1: Boolean Retrieval Hinrich Schütze Institute for Natural

More information

如何查看 Cache Engine 缓存中有哪些网站 /URL

如何查看 Cache Engine 缓存中有哪些网站 /URL 如何查看 Cache Engine 缓存中有哪些网站 /URL 目录 简介 硬件与软件版本 处理日志 验证配置 相关信息 简介 本文解释如何设置处理日志记录什么网站 /URL 在 Cache Engine 被缓存 硬件与软件版本 使用这些硬件和软件版本, 此配置开发并且测试了 : Hardware:Cisco 缓存引擎 500 系列和 73xx 软件 :Cisco Cache 软件版本 2.3.0

More information

云计算入门 Introduction to Cloud Computing GESC1001

云计算入门 Introduction to Cloud Computing GESC1001 Lecture #6 云计算入门 Introduction to Cloud Computing GESC1001 Philippe Fournier-Viger Professor School of Humanities and Social Sciences philfv8@yahoo.com Fall 2017 1 Introduction Last week: how cloud applications

More information

Web Information Retrieval. Lecture 2 Tokenization, Normalization, Speedup, Phrase Queries

Web Information Retrieval. Lecture 2 Tokenization, Normalization, Speedup, Phrase Queries Web Information Retrieval Lecture 2 Tokenization, Normalization, Speedup, Phrase Queries Recap of the previous lecture Basic inverted indexes: Structure: Dictionary and Postings Key step in construction:

More information

数据挖掘 Introduction to Data Mining

数据挖掘 Introduction to Data Mining 数据挖掘 Introduction to Data Mining Philippe Fournier-Viger Full professor School of Natural Sciences and Humanities philfv8@yahoo.com Spring 2019 S8700113C 1 Introduction Last week: Association Analysis

More information

More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology

More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2016 Most slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276,

More information

More on indexing CE-324: Modern Information Retrieval Sharif University of Technology

More on indexing CE-324: Modern Information Retrieval Sharif University of Technology More on indexing CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2014 Most slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276, Stanford) Plan

More information

Recap of the previous lecture. Recall the basic indexing pipeline. Plan for this lecture. Parsing a document. Introduction to Information Retrieval

Recap of the previous lecture. Recall the basic indexing pipeline. Plan for this lecture. Parsing a document. Introduction to Information Retrieval Ch. Introduction to Information Retrieval Recap of the previous lecture Basic inverted indexes: Structure: Dictionary and Postings Lecture 2: The term vocabulary and postings lists Key step in construction:

More information

上汽通用汽车供应商门户网站项目 (SGMSP) User Guide 用户手册 上汽通用汽车有限公司 2014 上汽通用汽车有限公司未经授权, 不得以任何形式使用本文档所包括的任何部分

上汽通用汽车供应商门户网站项目 (SGMSP) User Guide 用户手册 上汽通用汽车有限公司 2014 上汽通用汽车有限公司未经授权, 不得以任何形式使用本文档所包括的任何部分 上汽通用汽车供应商门户网站项目 (SGMSP) User Guide 用户手册 上汽通用汽车有限公司 2014 上汽通用汽车有限公司未经授权, 不得以任何形式使用本文档所包括的任何部分 SGM IT < 上汽通用汽车供应商门户网站项目 (SGMSP)> 工作产品名称 :< User Guide 用户手册 > Current Version: Owner: < 曹昌晔 > Date Created:

More information

ICP Enablon User Manual Factory ICP Enablon 用户手册 工厂 Version th Jul 2012 版本 年 7 月 16 日. Content 内容

ICP Enablon User Manual Factory ICP Enablon 用户手册 工厂 Version th Jul 2012 版本 年 7 月 16 日. Content 内容 Content 内容 A1 A2 A3 A4 A5 A6 A7 A8 A9 Login via ICTI CARE Website 通过 ICTI 关爱网站登录 Completing the Application Form 填写申请表 Application Form Created 创建的申请表 Receive Acknowledgement Email 接收确认电子邮件 Receive User

More information

Text Pre-processing and Faster Query Processing

Text Pre-processing and Faster Query Processing Text Pre-processing and Faster Query Processing David Kauchak cs160 Fall 2009 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture2-dictionary.ppt Administrative Everyone have CS lab accounts/access?

More information

The Design of Everyday Things

The Design of Everyday Things The Design of Everyday Things Byron Li Copyright 2009 Trend Micro Inc. It's Not Your Fault Donald A. Norman & His Book Classification 03/17/11 3 Norman Door Why Learn to think from different aspects Contribute

More information

PRODUCT SPECIFICATION

PRODUCT SPECIFICATION Capacitive PRODUCT SPECIFICATION Customer ( 客户名称 ) : Customer No.( 客户编码 ): Product( 产品类型 ) :U Touch Panel Product No.( 产品编号 ): BET-CT016001V1 Date ( 日期 ) : 2014-1-01 BET Optronics Technology Co.,Ltd APPROVED

More information

Building Large Scale Text Corpus for Tibetan Natural Language Processing by Extracting Text from Web Pages 抽取网页文本为藏文自然语言处理构建大规模文本语料库

Building Large Scale Text Corpus for Tibetan Natural Language Processing by Extracting Text from Web Pages 抽取网页文本为藏文自然语言处理构建大规模文本语料库 Building Large Scale Text Corpus for Tibetan Natural Language Processing by Extracting Text from Web Pages Huidan LIU 1,2 M inghua N UO 1,2 Jian W U 1 Yeping H E 1 (1) Institute of Software, Chinese Academy

More information

Information Retrieval

Information Retrieval Introduction to Information Retrieval Lecture 2: Preprocessing 1 Ch. 1 Recap of the previous lecture Basic inverted indexes: Structure: Dictionary and Postings Key step in construction: Sorting Boolean

More information

Tokenization and Sentence Segmentation. Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017

Tokenization and Sentence Segmentation. Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017 Tokenization and Sentence Segmentation Yan Shao Department of Linguistics and Philology, Uppsala University 29 March 2017 Outline 1 Tokenization Introduction Exercise Evaluation Summary 2 Sentence segmentation

More information

Oriented Scene Text Detection Revisited. Xiang Bai Huazhong University of Science and Technology

Oriented Scene Text Detection Revisited. Xiang Bai Huazhong University of Science and Technology The Invited Talk in Vision and Learning Seminar (VALSE) Xiamen, 2017-4-22 Oriented Scene Text Detection Revisited Xiang Bai Huazhong University of Science and Technology xbai@hust.edu.cn http://mclab.eic.hust.edu.cn/~xbai/

More information

2.8 Megapixel industrial camera for extreme environments

2.8 Megapixel industrial camera for extreme environments Prosilica GT 1920 Versatile temperature range for extreme environments PTP PoE P-Iris and DC-Iris lens control 2.8 Megapixel industrial camera for extreme environments Prosilica GT1920 is a 2.8 Megapixel

More information

5.1 Megapixel machine vision camera with GigE interface

5.1 Megapixel machine vision camera with GigE interface Manta G-507 Latest Sony CMOS sensor PoE optional Angled-head and board level variants Video-iris lens control 5.1 Megapixel machine vision camera with GigE interface Manta G-507 is a 5.1 Megapixel machine

More information

Duke University. Information Searching Models. Xianjue Huang. Math of the Universe. Hubert Bray

Duke University. Information Searching Models. Xianjue Huang. Math of the Universe. Hubert Bray Duke University Information Searching Models Xianjue Huang Math of the Universe Hubert Bray 24 July 2017 Introduction Information searching happens in our daily life, and even before the computers were

More information

n Tuesday office hours changed: n 2-3pm n Homework 1 due Tuesday n Assignment 1 n Due next Friday n Can work with a partner

n Tuesday office hours changed: n 2-3pm n Homework 1 due Tuesday n Assignment 1 n Due next Friday n Can work with a partner Administrative Text Pre-processing and Faster Query Processing" David Kauchak cs458 Fall 2012 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture2-dictionary.ppt Tuesday office hours changed:

More information

Bi-monthly report. Tianyi Luo

Bi-monthly report. Tianyi Luo Bi-monthly report Tianyi Luo 1 Work done in this week Write a crawler plus based on keywords (Support Chinese and English) Modify a Sina weibo crawler (340M/day) Offline learning to rank module is completed

More information

Information Retrieval. Lecture 2 - Building an index

Information Retrieval. Lecture 2 - Building an index Information Retrieval Lecture 2 - Building an index Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1/ 40 Overview Introduction Introduction Boolean

More information

XML allows your content to be created in one workflow, at one cost, to reach all your readers XML 的优势 : 只需一次加工和投入, 到达所有读者的手中

XML allows your content to be created in one workflow, at one cost, to reach all your readers XML 的优势 : 只需一次加工和投入, 到达所有读者的手中 XML allows your content to be created in one workflow, at one cost, to reach all your readers XML 的优势 : 只需一次加工和投入, 到达所有读者的手中 We can format your materials to be read.. in print 印刷 XML Conversions online

More information

Previous on Computer Networks Class 18. ICMP: Internet Control Message Protocol IP Protocol Actually a IP packet

Previous on Computer Networks Class 18. ICMP: Internet Control Message Protocol IP Protocol Actually a IP packet ICMP: Internet Control Message Protocol IP Protocol Actually a IP packet 前 4 个字节都是一样的 0 8 16 31 类型代码检验和 ( 这 4 个字节取决于 ICMP 报文的类型 ) ICMP 的数据部分 ( 长度取决于类型 ) ICMP 报文 首部 数据部分 IP 数据报 ICMP: Internet Control Message

More information

Bing.com scholar. Мобильный портал WAP версия: wap.altmaster.ru

Bing.com scholar. Мобильный портал WAP версия: wap.altmaster.ru Мобильный портал WAP версия: wap.altmaster.ru Bing.com scholar Aug 16 2011. I have already had several people ask me whether Bing offers something comparable to Google Scholar. Bing's alternative is Microsoft.

More information

SPECIFICATION. Manual Card Reader MANUAL CARD READER PRODUCT SPECIFICATION

SPECIFICATION. Manual Card Reader MANUAL CARD READER PRODUCT SPECIFICATION Page 1/19 MANUAL CARD READER PRODUCT CREATOR (CHINA) TECH CO., LTD Add: 2F, M-10 Building, Center Area, Hi-tech Industrial Park, Shenzhen, China TEL: +86 755 26710691 FAX: +86 755 26710105 Http://www.china-creator.com/

More information

正则表达式 Frank from https://regex101.com/

正则表达式 Frank from https://regex101.com/ 符号 英文说明 中文说明 \n Matches a newline character 新行 \r Matches a carriage return character 回车 \t Matches a tab character Tab 键 \0 Matches a null character Matches either an a, b or c character [abc] [^abc]

More information

云计算入门 Introduction to Cloud Computing GESC1001

云计算入门 Introduction to Cloud Computing GESC1001 Lecture #3 云计算入门 Introduction to Cloud Computing GESC1001 Philippe Fournier-Viger Professor School of Humanities and Social Sciences philfv8@yahoo.com Fall 2018 1 Course schedule Part 1 Part 2 Part 3 Introduction

More information

1. Features. 2,Block diagram. 3. Outline dimension V power supply. 3. Assembled with 20 x 4 character displays

1. Features. 2,Block diagram. 3. Outline dimension V power supply. 3. Assembled with 20 x 4 character displays 1. Features 1. +5V power supply 2. Communicate over RS-232, 3. Assembled with 20 x 4 character displays 4. Built-in font with provision for up to 8 user defined 5. Easy Text Display Commands for printing

More information

Corso di Biblioteche Digitali

Corso di Biblioteche Digitali Corso di Biblioteche Digitali Vittore Casarosa casarosa@isti.cnr.it tel. 050-315 3115 cell. 348-397 2168 Ricevimento dopo la lezione o per appuntamento Valutazione finale 70-75% esame orale 25-30% progetto

More information

DATA VISUALIZATION. Lecture 10--Scientific Visualization

DATA VISUALIZATION. Lecture 10--Scientific Visualization DATA VISUALIZATION Lecture 10--Scientific Visualization Visualization Process Scalar 1D Data Scalar 2D Data Lin Lu http://vr.sdu.edu.cn/~lulin/ llu@sdu.edu.cn Contents The Visualization Process General

More information

2. Introduction to Digital Media Format

2. Introduction to Digital Media Format Digital Asset Management 数字媒体资源管理 2. Introduction to Digital Media Format 任课 老师 : 张宏鑫 2014-09-30 Outline Image format and coding methods Audio format and coding methods Video format and coding methods

More information

TDS - 3. Battery Compartment. LCD Screen. Power Button. Hold Button. Body. Sensor. HM Digital, Inc.

TDS - 3. Battery Compartment. LCD Screen. Power Button. Hold Button. Body. Sensor. HM Digital, Inc. TDS - 3 Battery Compartment LCD Screen Power Button Hold Button Body Sensor Dual Range Measures from 0~999ppm, with a resolution of 1 ppm. From 1,000 to 9,990ppm, the resolution is 10 ppm, indicated by

More information

IR System Components. Lecture 2: Data structures and Algorithms for Indexing. IR System Components. IR System Components

IR System Components. Lecture 2: Data structures and Algorithms for Indexing. IR System Components. IR System Components IR System Components Lecture 2: Data structures and Algorithms for Indexing Information Retrieval Computer Science Tripos Part II Document Collection Ronan Cummins 1 Natural Language and Information Processing

More information

Mini Wireless Keyboard 迷你无线键盘

Mini Wireless Keyboard 迷你无线键盘 R R User Manual Mini Wireless Keyboard 迷你无线键盘 Model( 型号 ):KP-810-09 Rev. 1.0 www.ip azzpo rt.com Introduction: This Mini Wireless Keyboard consists of keyboard, mouse touchpad, and LED (optional) which

More information

Outline of the course

Outline of the course Outline of the course Introduction to Digital Libraries (15%) Description of Information (30%) Access to Information (30%) User Services (10%) Additional topics (15%) Buliding of a (small) digital library

More information

Presentation Title. By Author The MathWorks, Inc. 1

Presentation Title. By Author The MathWorks, Inc. 1 Presentation Title By Author 2014 The MathWorks, Inc. 1 4G LTE 轻松入门 陈建平 MathWorks 中国 2014 The MathWorks, Inc. 2 大纲 4G 综述 LTE 系统工具箱的应用 黄金参考模型 点到点链路级仿真 信号发生和分析 信号信息恢复 4G 系统的并行仿真加速 3 无线标准的演化 * *Although ETSI

More information

IDT-288-K001 SPECIFICATION MANUAL INSERTION CARD READER PRODUCT SPECIFICATION. Date 2013/06/22 Manual Insertion. Ver. 1.0 Card Reader Page 1/11

IDT-288-K001 SPECIFICATION MANUAL INSERTION CARD READER PRODUCT SPECIFICATION. Date 2013/06/22 Manual Insertion. Ver. 1.0 Card Reader Page 1/11 Card Reader Page 1/11 IDT-288-K001 MANUAL INSERTION CARD READER PRODUCT SPECIFICATION Card Reader Page 2/11 Revisions Version Date Content 1.0 2013.06.22 Initial release Card Reader Page 3/11 Thanks for

More information

[ 电子书 ]Spark for Data Science PDF 下载 Spark 大数据博客 -

[ 电子书 ]Spark for Data Science PDF 下载 Spark 大数据博客 - [ 电子书 ]Spark for Data Science PDF 下载 昨天分享了 [ 电子书 ]Apache Spark 2 for Beginners pdf 下载, 这本书很适合入门学习 Spark, 虽然书名上写着是 Apache Spark 2, 但是其内容介绍几乎和 Spark 2 毫无关系, 今天要分享的图书也是一本适合入门的 Spark 电子书, 也是 Packt 出版,2016

More information

More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology

More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology More on indexing and text operations CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2015 Most slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276,

More information

PRODUCT SPECIFICATION

PRODUCT SPECIFICATION 带 Page 1/9 PRODUCT CRT-570-(KX2) MB-ANL CREATOR (CHINA) TECH CO., LTD Add: 2F, M-10 Building, Center Area, Hi-tech Industrial Park, Shenzhen, China TEL:+86 755 26710691 FAX:+86 755 26710105 Http://www.china-creator.com

More information

NyearBluetoothPrint SDK. Development Document--Android

NyearBluetoothPrint SDK. Development Document--Android NyearBluetoothPrint SDK Development Document--Android (v0.98) 2018/09/03 --Continuous update-- I Catalogue 1. Introduction:... 3 2. Relevant knowledge... 4 3. Direction for use... 4 3.1 SDK Import... 4

More information

Information Retrieval and Web Search Engines

Information Retrieval and Web Search Engines Information Retrieval and Web Search Engines Lecture 4: Indexing April 27, 2010 Wolf-Tilo Balke and Joachim Selke Institut für Informationssysteme Technische Universität Braunschweig Recap: Inverted Indexes

More information

#MDCC Swift 链式语法应 用 陈乘

#MDCC Swift 链式语法应 用 陈乘 #MDCC 2016 Swift 链式语法应 用 陈乘 方 @ENJOY 关于我 Swift 开发者 ENJOY ios 客户端负责 人 两年年 Swift 实际项 目开发经验 微博 ID: webfrogs Twitter: nswebfrog Writing code is always easy, the hard part is reading it. 链式语法? 链式语法 可以连续不不断地进

More information

CS 6320 Natural Language Processing

CS 6320 Natural Language Processing CS 6320 Natural Language Processing Information Retrieval Yang Liu Slides modified from Ray Mooney s (http://www.cs.utexas.edu/users/mooney/ir-course/slides/) 1 Introduction of IR System components, basic

More information

nbns-list netbios-type network next-server option reset dhcp server conflict 1-34

nbns-list netbios-type network next-server option reset dhcp server conflict 1-34 目录 1 DHCP 1-1 1.1 DHCP 公共命令 1-1 1.1.1 dhcp dscp 1-1 1.1.2 dhcp enable 1-1 1.1.3 dhcp select 1-2 1.2 DHCP 服务器配置命令 1-3 1.2.1 address range 1-3 1.2.2 bims-server 1-4 1.2.3 bootfile-name 1-5 1.2.4 class 1-6

More information

PubMed 简介. PubMed 是美国国立医学图书馆 (NLM) 所属的国家生物技术信息中心 (NCBI) 开发的因特网生物医学信息检索系统

PubMed 简介. PubMed 是美国国立医学图书馆 (NLM) 所属的国家生物技术信息中心 (NCBI) 开发的因特网生物医学信息检索系统 生物医学文献数据库 PubMed PubMed 简介 PubMed 是美国国立医学图书馆 (NLM) 所属的国家生物技术信息中心 (NCBI) 开发的因特网生物医学信息检索系统 PubMed 主要提供基于 Web 的 Medline 数据库检索服务, 其中包括医学文献的订购, 全文在线阅读的链接, 专家信息的查询, 期刊检索以及相关书籍的链接等 数据每周二至周六每天更新 详见 http://baike.so.com/doc/1870897-1978857.html

More information

Data Structures and Algorithms Dr. Naveen Garg Department of Computer Science and Engineering Indian Institute of Technology, Delhi.

Data Structures and Algorithms Dr. Naveen Garg Department of Computer Science and Engineering Indian Institute of Technology, Delhi. Data Structures and Algorithms Dr. Naveen Garg Department of Computer Science and Engineering Indian Institute of Technology, Delhi Lecture 18 Tries Today we are going to be talking about another data

More information

大数据基准测试 : 原理 方法和应用. 詹剑锋 中国科学院计算技术研究所中国科学院大学 可信云服务大会, 北京 INSTITUTE OF COMPUTING TECHNOLOGY

大数据基准测试 : 原理 方法和应用. 詹剑锋   中国科学院计算技术研究所中国科学院大学 可信云服务大会, 北京 INSTITUTE OF COMPUTING TECHNOLOGY 大数据基准测试 : 原理 方法和应用 詹剑锋 http://prof.ict.ac.cn/bigdatabench 中国科学院计算技术研究所中国科学院大学 2015.7.31 2015 可信云服务大会, 北京 INSTITUTE OF COMPUTING TECHNOLOGY Outline 原理 方法 BigDataBench 计量的意义 科学和人类日常生活的基础 牛顿 ( 力 ) 开尔文 ( 温度

More information

Multiprotocol Label Switching The future of IP Backbone Technology

Multiprotocol Label Switching The future of IP Backbone Technology Multiprotocol Label Switching The future of IP Backbone Technology Computer Network Architecture For Postgraduates Chen Zhenxiang School of Information Science and Technology. University of Jinan (c) Chen

More information

Chapter 10 Java and SQL. Wang Yang

Chapter 10 Java and SQL. Wang Yang Chapter 10 Java and SQL Wang Yang wyang@njnet.edu.cn Outline Concern Data - File & IO vs. Database &SQL Database & SQL How Connect Java to SQL - Java Model for Database Java Database Connectivity (JDBC)

More information

CS347. Lecture 2 April 9, Prabhakar Raghavan

CS347. Lecture 2 April 9, Prabhakar Raghavan CS347 Lecture 2 April 9, 2001 Prabhakar Raghavan Today s topics Inverted index storage Compressing dictionaries into memory Processing Boolean queries Optimizing term processing Skip list encoding Wild-card

More information

Information Retrieval

Information Retrieval Introduction to Information Retrieval CS4611: Information Retrieval Professor M. P. Schellekens Assistant: Ang Gao Slides adapted from P. Nayak and P. Raghavan Information Retrieval Lecture 2: The term

More information

Information Retrieval and Web Search

Information Retrieval and Web Search Information Retrieval and Web Search Text processing Instructor: Rada Mihalcea (Note: Some of the slides in this slide set were adapted from an IR course taught by Prof. Ray Mooney at UT Austin) IR System

More information

Command Dictionary CUSTOM

Command Dictionary CUSTOM 命令模式 CUSTOM [(filename)] [parameters] Executes a "custom-designed" command which has been provided by special programming using the GHS Programming Interface. 通过 GHS 程序接口, 执行一个 用户设计 的命令, 该命令由其他特殊程序提供 参数说明

More information

Recap of the previous lecture. This lecture. A naïve dictionary. Introduction to Information Retrieval. Dictionary data structures Tolerant retrieval

Recap of the previous lecture. This lecture. A naïve dictionary. Introduction to Information Retrieval. Dictionary data structures Tolerant retrieval Ch. 2 Recap of the previous lecture Introduction to Information Retrieval Lecture 3: Dictionaries and tolerant retrieval The type/token distinction Terms are normalized types put in the dictionary Tokenization

More information

Technology: Anti-social Networking 科技 : 反社交网络

Technology: Anti-social Networking 科技 : 反社交网络 Technology: Anti-social Networking 科技 : 反社交网络 1 Technology: Anti-social Networking 科技 : 反社交网络 The Growth of Online Communities 社交网络使用的增长 Read the text below and do the activity that follows. 阅读下面的短文, 然后完成练习

More information

Information Retrieval

Information Retrieval Information Retrieval Natural Language Processing: Lecture 12 30.11.2017 Kairit Sirts Homework 4 things that seemed to work Bidirectional LSTM instead of unidirectional Change LSTM activation to sigmoid

More information

Today s topics CS347. Inverted index storage. Inverted index storage. Processing Boolean queries. Lecture 2 April 9, 2001 Prabhakar Raghavan

Today s topics CS347. Inverted index storage. Inverted index storage. Processing Boolean queries. Lecture 2 April 9, 2001 Prabhakar Raghavan Today s topics CS347 Lecture 2 April 9, 2001 Prabhakar Raghavan Inverted index storage Compressing dictionaries into memory Processing Boolean queries Optimizing term processing Skip list encoding Wild-card

More information

Cyber Security Introduction

Cyber Security Introduction Cyber Security Introduction 2018.10.17 Huiping Sun( ) sunhp@ss.pku.edu.cn Course Overview 教师信息 sunhp@ss.pku.edu.cn https://huipingsun.github.io 1 1530E Course Overview 课程基本信息 14 17 (3204) 10 17 11 14 11

More information

Introduction to Information Retrieval

Introduction to Information Retrieval Mustafa Jarrar: Lecture Notes on Information Retrieval University of Birzeit, Palestine 2014 Introduction to Information Retrieval Dr. Mustafa Jarrar Sina Institute, University of Birzeit mjarrar@birzeit.edu

More information

计算机科学与技术专业本科培养计划. Undergraduate Program for Specialty in Computer Science & Technology

计算机科学与技术专业本科培养计划. Undergraduate Program for Specialty in Computer Science & Technology 计算机科学与技术学院 计算机科学与技术学院下设 6 个研究所 : 计算科学理论研究所 数据工程研究所 并行分布式计算研究所 数据存储研究所 数字媒体研究所 信息安全研究所 ;2 个中心 : 嵌入式软件与系统工程中心和教学中心 外存储系统国家专业实验室 教育部信息存储系统重点实验室 中国教育科研网格主结点 国家高性能计算中心 ( 武汉 ) 服务计算技术与系统教育部重点实验室 湖北省数据库工程技术研究中心

More information

More about Posting Lists

More about Posting Lists More about Posting Lists 1 FASTER POSTINGS MERGES: SKIP POINTERS/SKIP LISTS 2 Sec. 2.3 Recall basic merge Walk through the two postings simultaneously, in time linear in the total number of postings entries

More information

Chapter 4. Processing Text

Chapter 4. Processing Text Chapter 4 Processing Text Processing Text Modifying/Converting documents to index terms Convert the many forms of words into more consistent index terms that represent the content of a document What are

More information

Lenovo S850. Quick Start Guide v1.0. Read this guide carefully before using your smartphone.

Lenovo S850. Quick Start Guide v1.0. Read this guide carefully before using your smartphone. Lenovo S80 Quick Start Guide v.0 Read this guide carefully before using your smartphone. Technical specifications Model and system Model: Lenovo S80 System: Android. Dimensions and weight Height:. mm (.6

More information

Information Retrieval. (M&S Ch 15)

Information Retrieval. (M&S Ch 15) Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion

More information

Lenovo S580. Quick Start Guide v1.0. Read this guide carefully before using your smartphone.

Lenovo S580. Quick Start Guide v1.0. Read this guide carefully before using your smartphone. Lenovo S0 Quick Start Guide v.0 Read this guide carefully before using your smartphone. Technical specifications Model and system Model: Lenovo S0 Dimensions and weight Height:. mm (.0 inches) Depth:.mm

More information

display portal server display portal user display portal user count display portal web-server

display portal server display portal user display portal user count display portal web-server 目录 1 Portal 1-1 1.1 Portal 配置命令 1-1 1.1.1 aaa-fail nobinding enable 1-1 1.1.2 aging-time 1-1 1.1.3 app-id (Facebook authentication server view) 1-2 1.1.4 app-id (QQ authentication server view) 1-3 1.1.5

More information

Research of Attitude Measuring System Using Single Camera for Non-cooperative Spacecraft 基于单目相机的空间非合作目标姿态测量

Research of Attitude Measuring System Using Single Camera for Non-cooperative Spacecraft 基于单目相机的空间非合作目标姿态测量 系统与设计 Research of Attitude Measuring System Using Single Camera for Non-cooperative Spacecraft LI You-wen,ZHANG Xi-tao,ZHANG Xue-feng (Luoyang Opto-Electro Technology Development Center, Luoyang 471009,

More information

Introduction to Computer Science

Introduction to Computer Science Introduction to Computer Science 郝建业副教授 软件学院 http://www.escience.cn/people/jianye/index.html Lecturer Jianye HAO ( 郝建业 ) Email: jianye.hao@tju.edu.cn Tutor: Li Shuxin ( 李姝昕 ) Email: 957005030@qq.com Outline

More information

Keygen Codes For Photoshop Cs6 ->>> DOWNLOAD

Keygen Codes For Photoshop Cs6 ->>> DOWNLOAD Keygen Codes For Photoshop Cs6 ->>> DOWNLOAD 1 / 5 2 / 5 Photoshop Cs6 Crack Serial Ke....op_sp_fanyi{font-size:1em;word-break:normal;}.op_sp_fanyi.op_sp_fanyi_read{display: inline-block;*display: inline;*zoom:1;marginleft:4px;*position:relative;*top:-2px;}.op_sp_fanyi_how_read,.op_sp_fanyi_mp3_play{display:block;w

More information

Overview. Lecture 3: Index Representation and Tolerant Retrieval. Type/token distinction. IR System components

Overview. Lecture 3: Index Representation and Tolerant Retrieval. Type/token distinction. IR System components Overview Lecture 3: Index Representation and Tolerant Retrieval Information Retrieval Computer Science Tripos Part II Ronan Cummins 1 Natural Language and Information Processing (NLIP) Group 1 Recap 2

More information

Logitech G302 Daedalus Prime Setup Guide 设置指南

Logitech G302 Daedalus Prime Setup Guide 设置指南 Logitech G302 Daedalus Prime Setup Guide 设置指南 Logitech G302 Daedalus Prime Contents / 目录 English................. 3 简体中文................. 6 2 Logitech G302 Daedalus Prime 1 On 2 USB Your Daedalus Prime

More information

GUJARAT TECHNOLOGICAL UNIVERSITY

GUJARAT TECHNOLOGICAL UNIVERSITY GUJARAT TECHNOLOGICAL UNIVERSITY INFORMATION TECHNOLOGY DATA COMPRESSION AND DATA RETRIVAL SUBJECT CODE: 2161603 B.E. 6 th SEMESTER Type of course: Core Prerequisite: None Rationale: Data compression refers

More information

Made by: Ali Ibrahim. Supervisor: MR. Ali Jnaide. Class: 12

Made by: Ali Ibrahim. Supervisor: MR. Ali Jnaide. Class: 12 Made by: Ali Ibrahim Supervisor: MR. Ali Jnaide Class: 12 pg. 1 Introduction: As recently as the 1990s, studies showed that most people preferred getting information from other people rather than from

More information

Indexing and Query Processing. What will we cover?

Indexing and Query Processing. What will we cover? Indexing and Query Processing CS 510 Winter 2007 1 What will we cover? Key concepts and terminology Inverted index structures Organization, creation, maintenance Compression Distribution Answering queries

More information

Digital Asset Management 数字媒体资源管理理 2. Introduction to Digital Media Format

Digital Asset Management 数字媒体资源管理理 2. Introduction to Digital Media Format Digital Asset Management 数字媒体资源管理理 2. Introduction to Digital Media Format 任课 老老师 : 张宏鑫 2017-09-27 Outline Image format and coding methods Audio format and coding methods Video format and coding methods

More information

Efficiency. Efficiency: Indexing. Indexing. Efficiency Techniques. Inverted Index. Inverted Index (COSC 488)

Efficiency. Efficiency: Indexing. Indexing. Efficiency Techniques. Inverted Index. Inverted Index (COSC 488) Efficiency Efficiency: Indexing (COSC 488) Nazli Goharian nazli@cs.georgetown.edu Difficult to analyze sequential IR algorithms: data and query dependency (query selectivity). O(q(cf max )) -- high estimate-

More information

基于项目信息类专业教育实验班本科培养计划 (2+2)

基于项目信息类专业教育实验班本科培养计划 (2+2) 基于项目信息类专业教育实验班本科培养计划 (+) Undergraduate Program for Advanced Project-based Information Science Education (+) 一培养目标 Ⅰ. Program Objectives This program is designed in a unique learning-via-doing approach

More information

计算机组成原理第二讲 第二章 : 运算方法和运算器 数据与文字的表示方法 (1) 整数的表示方法. 授课老师 : 王浩宇

计算机组成原理第二讲 第二章 : 运算方法和运算器 数据与文字的表示方法 (1) 整数的表示方法. 授课老师 : 王浩宇 计算机组成原理第二讲 第二章 : 运算方法和运算器 数据与文字的表示方法 (1) 整数的表示方法 授课老师 : 王浩宇 haoyuwang@bupt.edu.cn 1 Today: Bits, Bytes, and Integers Representing information as bits Bit-level manipulations Integers Representation: unsigned

More information

Division of Science and Technology

Division of Science and Technology BNU-HKBU UNITED INTERNATIONAL COLLEGE UNDERGRADUATE HANDBOOK 2008 Division of Science and Technology Computer Science and Technology Programme Com puter Science and Technology Program m e Contents 1.

More information

DataCube Data Analysis With Mongodb.

DataCube Data Analysis With Mongodb. DataCube Data Analysis With Mongodb SongheYang@taobao datacube.taobao.com shu.taobao.com Project 数据化运营平台 首页 客户信息 类目信息 潜在用户 产品监控 运营工具 Agenda Offline Data Analysis Realtime Data Analysis New Aggregation

More information

Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras

Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Privacy and Security in Online Social Networks Department of Computer Science and Engineering Indian Institute of Technology, Madras Lecture - 25 Tutorial 5: Analyzing text using Python NLTK Hi everyone,

More information

Chapter 2. Architecture of a Search Engine

Chapter 2. Architecture of a Search Engine Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them

More information

A CAN Bus Based Control System for Joint Modular Robot

A CAN Bus Based Control System for Joint Modular Robot A CAN Bus Based Control System for Joint Modular Robot Qingkai Han* a, Li Wang a, Yunpeng Ren b, Tianxia Zhang a, Bangchun Wen a a School of Mechanical Engineering, Northeastern Univeristy, Shenyang, China

More information

Britannica Academic Online Edition 大不列顛百科全书网络学术版

Britannica Academic Online Edition 大不列顛百科全书网络学术版 Britannica Academic Online Edition 大不列顛百科全书网络学术版 The Complete Digital Resource Deep use of online resources 2013 The Complete Digital Resource High profile contributors Current content Collaborative content

More information

Natural Language Processing

Natural Language Processing Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document

More information