Phd. studies at Mälardalen University

Size: px
Start display at page:

Download "Phd. studies at Mälardalen University"

Transcription

1 Phd. studies at Mälardalen University Christopher Engström Mälardalen University, School of Education, Culture and Communication (UKK), Mathematics/Applied mathematics

2 Contents What do you do as a Phd. student?

3 Contents What do you do as a Phd. student? My education before Phd. studies.

4 Contents What do you do as a Phd. student? My education before Phd. studies. Examples of some mathematical problems.

5 What do you do as a Phd. Student? 80/20 Phd studies/faculty work

6 What do you do as a Phd. Student? 80/20 Phd studies/faculty work Research

7 What do you do as a Phd. Student? 80/20 Phd studies/faculty work Research Studies

8 What do you do as a Phd. Student? 80/20 Phd studies/faculty work Research Studies Teaching

9 Courses Mathematical Modelling Mathematical Modelling, (A) Biological Systems Optimization Applied Mathematics Engineering Mathematics Num. Methods for Differential Eq. Discrete Mathematics Simulation Tools Time Series Analysis Finite Element Method Monte Carlo and Empirical Methods for Statistical Inference Calculus in One Variable Calculus in Several Variables Linear Algebra Matrix theory Mathematical Statistics Markov Processes Mathematical Communication Mathematical Structures Algebra Stationary Stochastic Processes Systems and Transforms Analytic Functions

10 Courses Signals and Communications Biology, Introductory Course Image Analysis Multi-spectral Imaging Microeconomics for Technicians Automatic Control, Basic Course Physics Electromagnetic Field Theory Numerical Methods in CAGD Programming Algorithms and Data structures Algorithm Implementation C programming Algorithm Theory Database Technology Functional programming Computer Graphics

11 Natural language processing Problem formulation: Given a collection of abstracts of scientific articles in some field of research: Find words or short phrases which convey knowledge of the domain. Annotation done manually by experts: time consuming and costly.

12 Natural language processing Problem formulation: Given a collection of abstracts of scientific articles in some field of research: Find words or short phrases which convey knowledge of the domain. Annotation done manually by experts: time consuming and costly. Annotation done automatically: not good enough.

13 Natural language processing Problem formulation: Given a collection of abstracts of scientific articles in some field of research: Find words or short phrases which convey knowledge of the domain. Annotation done manually by experts: time consuming and costly. Annotation done automatically: not good enough. Applications in for example medicine.

14 Natural language processing Problem formulation: Given a collection of abstracts of scientific articles in some field of research: Find words or short phrases which convey knowledge of the domain. Annotation done manually by experts: time consuming and costly. Annotation done automatically: not good enough. Applications in for example medicine. Work together with Thierry Hamon, Paris-Nord University, Computer Science Laboratory for Mechanics and Engineering Sciences.

15 Example of annotated abstract To address potential mechanisms by which estrogens suppress erythropoiesis, we have examined their effects on GATA 1, an erythroid transcription factor that participates in the regulation of the majority of erythroid cell specific genes and is necessary for full maturation of erythrocytes.

16 Natural language processing Texts Term extraction rule-based approaches Candidate terms My work in the Term ranking step: Term ranking frequency term length C-Value frequency: How often does a term appear? Ranked candidate terms Validation by terminologists

17 Natural language processing Texts Term extraction rule-based approaches Candidate terms My work in the Term ranking step: Term ranking frequency term length C-Value frequency: How often does a term appear? term length: Longer terms less important? Ranked candidate terms Validation by terminologists

18 Natural language processing Texts Term extraction rule-based approaches Candidate terms My work in the Term ranking step: Term ranking frequency term length C-Value frequency: How often does a term appear? term length: Longer terms less important? Ranked candidate terms Validation by terminologists C-value: Long terms not part of other terms are preferred.

19 Parsing tree of a term modifier component head component modifier component full (of) head component maturation erythrocytes Figure: Parsing tree of the term full maturation of erythrocytes

20 C-Value Original definition [Frantzi et al. 1997]: log 2 ( t + 1) f (t) if t is not included in a term C Value(t) = log 2 ( t + 1) (f (t) 1 f (t )) otherwise P(T t ) t T t

21 C-Value Original definition [Frantzi et al. 1997]: log 2 ( t + 1) f (t) if t is not included in a term C Value(t) = log 2 ( t + 1) (f (t) 1 f (t )) otherwise P(T t ) t T t Parametrised C-Value ( ) t + 1 log 2 t α f (t), if t is not included in a term (Roo R ( ) t + 1 log 2 t α f (t) c H H 1/β H f (t ) βh, C Value t T t = if t is a Head term ( ) t + 1 log 2 t α f (t) c M M 1/β M f (t ) βm, t T t if t is a Modifier term

22 F-measure and average precision for different set of parameters Frequency and the C-Value: better average precision for the very first terms Then, all the C-Value* models outperform the frequency and the C-Value ranking Similar ranking for all the C-Value* models

23 Network analysis in cancer research, background Out of inferred gene expression networks, can we predict which genes are important for the development of a cancer? Work together with (mainly) Holger Weishaupt, Uppsala University, Rudbecklaboratoriet.

24 Network analysis in cancer research, background Out of inferred gene expression networks, can we predict which genes are important for the development of a cancer? Work together with (mainly) Holger Weishaupt, Uppsala University, Rudbecklaboratoriet. Finding relevant genes makes it probable that you could develop medication for that cancer type rather than using other more intrusive methods.

25 Network analysis in cancer research Omics data Samples Gene correlations Genes Adjacency matrix Genes Genes Genes Genes Centralities Inferred network

26 Network analysis in cancer research Problems: Low quality and bad conservation of centralities for generated networks compared to reference networks.

27 Network analysis in cancer research Problems: Low quality and bad conservation of centralities for generated networks compared to reference networks. What centrality mesures can be used to find driver genes for the cancer?

28 Network analysis in cancer research Problems: Low quality and bad conservation of centralities for generated networks compared to reference networks. What centrality mesures can be used to find driver genes for the cancer? Low amount of good reference networks for training and verification.

29 PageRank Used to rank homepages based on the webs linkstructure.

30 PageRank Used to rank homepages based on the webs linkstructure. Needs to be both very fast but also of good quality.

31 PageRank Used to rank homepages based on the webs linkstructure. Needs to be both very fast but also of good quality.

32 PageRank Used to rank homepages based on the webs linkstructure. Needs to be both very fast but also of good quality. Our main considerations: Can the rank of different types of graphs and/or subgraphs be calculated more efficiently?

33 PageRank Used to rank homepages based on the webs linkstructure. Needs to be both very fast but also of good quality. Our main considerations: Can the rank of different types of graphs and/or subgraphs be calculated more efficiently? Generalizations and variations of the definition of PageRank.

34 PageRank Used to rank homepages based on the webs linkstructure. Needs to be both very fast but also of good quality. Our main considerations: Can the rank of different types of graphs and/or subgraphs be calculated more efficiently? Generalizations and variations of the definition of PageRank. How does the rank change if you introduce some changes to the graph?

35 PageRank Defined as the (right) eigenvector of matrix M: M = c(a + gw ) + (1 c)we (1)

36 PageRank Defined as the (right) eigenvector of matrix M: M = c(a + gw ) + (1 c)we (1) Defines as a sum of random walks on a graph: R j = W j + ( ) W i P(v i v j ) (P(v j v j )) k v i V,v i v j k=0 (2)

37 Calculating PageRank Power iteration: R n+1 = MR n

38 Calculating PageRank Power iteration: Better: R n+1 = MR n R n+1 = ca R n + dw d = 1 ca R n 1

39 Calculating PageRank Power iteration: Better: R n+1 = MR n R n+1 = ca R n + dw d = 1 ca R n 1 Power series: { Pn+1 = cap n = W P 0 R (3) n = n k=0 P k

40 Different types of vertices Definition For the vertices of a simple directed graph with no loops we define 5 distinct groups G 1, G 2,..., G 5 1. G 1 : Vertices with no outgoing or incoming edges.

41 Different types of vertices Definition For the vertices of a simple directed graph with no loops we define 5 distinct groups G 1, G 2,..., G 5 1. G 1 : Vertices with no outgoing or incoming edges. 2. G 2 : Vertices with no outgoing edges and at least one incoming edge (also called dangling nodes).

42 Different types of vertices Definition For the vertices of a simple directed graph with no loops we define 5 distinct groups G 1, G 2,..., G 5 1. G 1 : Vertices with no outgoing or incoming edges. 2. G 2 : Vertices with no outgoing edges and at least one incoming edge (also called dangling nodes). 3. G 3 : Vertices with at least one outgoing edge, but no incoming edges (also called root nodes).

43 Different types of vertices Definition For the vertices of a simple directed graph with no loops we define 5 distinct groups G 1, G 2,..., G 5 1. G 1 : Vertices with no outgoing or incoming edges. 2. G 2 : Vertices with no outgoing edges and at least one incoming edge (also called dangling nodes). 3. G 3 : Vertices with at least one outgoing edge, but no incoming edges (also called root nodes). 4. G 4 : Vertices with at least one outgoing and incoming edge, but which is not part of any directed cycle (no path from the vertex back to itself).

44 Different types of vertices Definition For the vertices of a simple directed graph with no loops we define 5 distinct groups G 1, G 2,..., G 5 1. G 1 : Vertices with no outgoing or incoming edges. 2. G 2 : Vertices with no outgoing edges and at least one incoming edge (also called dangling nodes). 3. G 3 : Vertices with at least one outgoing edge, but no incoming edges (also called root nodes). 4. G 4 : Vertices with at least one outgoing and incoming edge, but which is not part of any directed cycle (no path from the vertex back to itself). 5. G 5 : Vertices that is part of at least one cycle.

45 PageRank for different types of vertices Vertices with no incomming or outgoing edges: Theorem We let e g G 1 and e i be any other vertex, then R g = W g, and P(v g v i ) = 0. (3)

46 PageRank for different types of vertices Vertices with no incomming or outgoing edges: Theorem We let e g G 1 and e i be any other vertex, then R g = W g, and P(v g v i ) = 0. (3) Vertices with no outgoing edges: Theorem We let e g G 2 and e i be any other vertex, then R g = v i G, v i v g W i P(v i v g ) + v g, and P(v g v i ) = 0. (4)

47 PageRank for different types of vertices Vertices with no incomming edges: Theorem We let e g G 3 and e i be any other vertex, then R g = W g, and v i G, v i v g P(v g v i ) > 0. (5)

48 PageRank for different types of vertices Useful when calculating the rank of other vertices: Theorem Given R g = W g, v g G 3. We can write the PageRank of another general vertex e i as R i = W i + W g ca gi + v j G v j v i,v g ( ) (P(v i v i )) k k=0 (W j + W g ca gj )P(v j v i ) where ca gi is the one-step probability to go from v g to v i.

49 PageRank, graph reordering Figure: Non-zero values of adjacency matrix for the Web graph before and after sorting vertices according to level and component.

50 PageRank iterations/component Figure: Number of iterations needed per SCC. The horizontal line denotes the result where the whole graph is considered a single component.

51 Thank you all for listening! Questions?

PAGERANK IN EVOLVING NETWORKS AND APPLICATIONS OF GRAPHS IN NATURAL LANGUAGE PROCESSING AND BIOLOGY

PAGERANK IN EVOLVING NETWORKS AND APPLICATIONS OF GRAPHS IN NATURAL LANGUAGE PROCESSING AND BIOLOGY Mälardalen University Press Dissertations No. 217 PAGERANK IN EVOLVING NETWORKS AND APPLICATIONS OF GRAPHS IN NATURAL LANGUAGE PROCESSING AND BIOLOGY Christopher Engström 2016 School of Education, Culture

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400 Statistics (STAT) 1 Statistics (STAT) STAT 1200: Introductory Statistical Reasoning Statistical concepts for critically evaluation quantitative information. Descriptive statistics, probability, estimation,

More information

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola Mathematical Methods and Computational Algorithms for Complex Networks Benard Abola Division of Applied Mathematics, Mälardalen University Department of Mathematics, Makerere University Second Network

More information

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti

More information

Mathematical Analysis of Google PageRank

Mathematical Analysis of Google PageRank INRIA Sophia Antipolis, France Ranking Answers to User Query Ranking Answers to User Query How a search engine should sort the retrieved answers? Possible solutions: (a) use the frequency of the searched

More information

1 Random Walks on Graphs

1 Random Walks on Graphs Lecture 7 Com S 633: Randomness in Computation Scribe: Ankit Agrawal In the last lecture, we looked at random walks on line and used them to devise randomized algorithms for 2-SAT and 3-SAT For 2-SAT we

More information

Social Network Analysis

Social Network Analysis Social Network Analysis Giri Iyengar Cornell University gi43@cornell.edu March 14, 2018 Giri Iyengar (Cornell Tech) Social Network Analysis March 14, 2018 1 / 24 Overview 1 Social Networks 2 HITS 3 Page

More information

Application of PageRank Algorithm on Sorting Problem Su weijun1, a

Application of PageRank Algorithm on Sorting Problem Su weijun1, a International Conference on Mechanics, Materials and Structural Engineering (ICMMSE ) Application of PageRank Algorithm on Sorting Problem Su weijun, a Department of mathematics, Gansu normal university

More information

Lecture #3: PageRank Algorithm The Mathematics of Google Search

Lecture #3: PageRank Algorithm The Mathematics of Google Search Lecture #3: PageRank Algorithm The Mathematics of Google Search We live in a computer era. Internet is part of our everyday lives and information is only a click away. Just open your favorite search engine,

More information

Link Analysis and Web Search

Link Analysis and Web Search Link Analysis and Web Search Moreno Marzolla Dip. di Informatica Scienza e Ingegneria (DISI) Università di Bologna http://www.moreno.marzolla.name/ based on material by prof. Bing Liu http://www.cs.uic.edu/~liub/webminingbook.html

More information

Chapter 4. Relations & Graphs. 4.1 Relations. Exercises For each of the relations specified below:

Chapter 4. Relations & Graphs. 4.1 Relations. Exercises For each of the relations specified below: Chapter 4 Relations & Graphs 4.1 Relations Definition: Let A and B be sets. A relation from A to B is a subset of A B. When we have a relation from A to A we often call it a relation on A. When we have

More information

Analysis of Biological Networks. 1. Clustering 2. Random Walks 3. Finding paths

Analysis of Biological Networks. 1. Clustering 2. Random Walks 3. Finding paths Analysis of Biological Networks 1. Clustering 2. Random Walks 3. Finding paths Problem 1: Graph Clustering Finding dense subgraphs Applications Identification of novel pathways, complexes, other modules?

More information

Math 778S Spectral Graph Theory Handout #2: Basic graph theory

Math 778S Spectral Graph Theory Handout #2: Basic graph theory Math 778S Spectral Graph Theory Handout #: Basic graph theory Graph theory was founded by the great Swiss mathematician Leonhard Euler (1707-178) after he solved the Königsberg Bridge problem: Is it possible

More information

The link prediction problem for social networks

The link prediction problem for social networks The link prediction problem for social networks Alexandra Chouldechova STATS 319, February 1, 2011 Motivation Recommending new friends in in online social networks. Suggesting interactions between the

More information

B.Sc. & B.Sc. (Hons) with Major in Applied Mathematics

B.Sc. & B.Sc. (Hons) with Major in Applied Mathematics B.Sc. & B.Sc. (Hons) with Major in Applied Mathematics Graduation Requirements for students admitted in AY2017/18 or after To be awarded a B.Sc. or B.Sc.(Hons) with primary major in Applied Mathematics,

More information

Computer vision: models, learning and inference. Chapter 10 Graphical Models

Computer vision: models, learning and inference. Chapter 10 Graphical Models Computer vision: models, learning and inference Chapter 10 Graphical Models Independence Two variables x 1 and x 2 are independent if their joint probability distribution factorizes as Pr(x 1, x 2 )=Pr(x

More information

Lecture 1: Examples, connectedness, paths and cycles

Lecture 1: Examples, connectedness, paths and cycles Lecture 1: Examples, connectedness, paths and cycles Anders Johansson 2011-10-22 lör Outline The course plan Examples and applications of graphs Relations The definition of graphs as relations Connectedness,

More information

A Modified Algorithm to Handle Dangling Pages using Hypothetical Node

A Modified Algorithm to Handle Dangling Pages using Hypothetical Node A Modified Algorithm to Handle Dangling Pages using Hypothetical Node Shipra Srivastava Student Department of Computer Science & Engineering Thapar University, Patiala, 147001 (India) Rinkle Rani Aggrawal

More information

CS6200 Information Retreival. The WebGraph. July 13, 2015

CS6200 Information Retreival. The WebGraph. July 13, 2015 CS6200 Information Retreival The WebGraph The WebGraph July 13, 2015 1 Web Graph: pages and links The WebGraph describes the directed links between pages of the World Wide Web. A directed edge connects

More information

Spanning and weighted spanning trees. A different kind of optimization. (graph theory is cool.)

Spanning and weighted spanning trees. A different kind of optimization. (graph theory is cool.) A different kind of optimization (graph theory is cool.) A different kind of optimization (graph theory is cool.) Definitions and examples Graphs A graph is a collection of vertices (that look like dots

More information

Program Changes Communications Engineering

Program Changes Communications Engineering Department of Systems & Computer Engineering 1/12 Program Changes Communications Engineering Department of Systems and Computer Engineering, Carleton University, Canada Why Are We Here? Substantial changes

More information

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree

The University of Jordan. Accreditation & Quality Assurance Center. Curriculum for Doctorate Degree Accreditation & Quality Assurance Center Curriculum for Doctorate Degree 1. Faculty King Abdullah II School for Information Technology 2. Department Computer Science الدكتوراة في علم الحاسوب (Arabic).3

More information

Week 10: DTMC Applications Randomized Routing. Network Performance 10-1

Week 10: DTMC Applications Randomized Routing. Network Performance 10-1 Week 10: DTMC Applications Randomized Routing Network Performance 10-1 Random Walk: Probabilistic Routing Random neighbor selection e.g. in ad-hoc/sensor network due to: Scalability: no routing table (e.g.

More information

Bowie State University

Bowie State University Bowie State University Department of Mathematics Master of Science in Applied and Computational Mathematics Certificate in Applied and Computational Mathematics Department of Mathematics Crawford Science

More information

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR Detecting Missing and Spurious Edges in Large, Dense Networks Using Parallel Computing Samuel Coolidge, sam.r.coolidge@gmail.com Dan Simon, des480@nyu.edu Dennis Shasha, shasha@cims.nyu.edu Technical Report

More information

The PageRank Citation Ranking

The PageRank Citation Ranking October 17, 2012 Main Idea - Page Rank web page is important if it points to by other important web pages. *Note the recursive definition IR - course web page, Brian home page, Emily home page, Steven

More information

Agenda. Math Google PageRank algorithm. 2 Developing a formula for ranking web pages. 3 Interpretation. 4 Computing the score of each page

Agenda. Math Google PageRank algorithm. 2 Developing a formula for ranking web pages. 3 Interpretation. 4 Computing the score of each page Agenda Math 104 1 Google PageRank algorithm 2 Developing a formula for ranking web pages 3 Interpretation 4 Computing the score of each page Google: background Mid nineties: many search engines often times

More information

PageRank and related algorithms

PageRank and related algorithms PageRank and related algorithms PageRank and HITS Jacob Kogan Department of Mathematics and Statistics University of Maryland, Baltimore County Baltimore, Maryland 21250 kogan@umbc.edu May 15, 2006 Basic

More information

Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank. Hans De Sterck

Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank. Hans De Sterck Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank Hans De Sterck Department of Applied Mathematics University of Waterloo, Ontario, Canada joint work with Steve McCormick,

More information

Information Networks: PageRank

Information Networks: PageRank Information Networks: PageRank Web Science (VU) (706.716) Elisabeth Lex ISDS, TU Graz June 18, 2018 Elisabeth Lex (ISDS, TU Graz) Links June 18, 2018 1 / 38 Repetition Information Networks Shape of the

More information

The Course Structure for the MCA Programme

The Course Structure for the MCA Programme The Course Structure for the MCA Programme SEMESTER - I MCA 1001 Problem Solving and Program Design with C 3 (3-0-0) MCA 1003 Numerical & Statistical Methods 4 (3-1-0) MCA 1007 Discrete Mathematics 3 (3-0-0)

More information

( It will be applied from Fall)

( It will be applied from Fall) İZMİR INSTITUTE OF TECHNOLOGY GRADUATE SCHOOL OF ENGINEERING AND SCIENCES DEPARTMENT OF COMPUTER ENGINEERING MASTER OF SCIENCE PROGRAM IN COMPUTER ENGINEERING Core Courses ECTS *CENG 590 Seminar (0-2)

More information

REGULAR GRAPHS OF GIVEN GIRTH. Contents

REGULAR GRAPHS OF GIVEN GIRTH. Contents REGULAR GRAPHS OF GIVEN GIRTH BROOKE ULLERY Contents 1. Introduction This paper gives an introduction to the area of graph theory dealing with properties of regular graphs of given girth. A large portion

More information

Web search before Google. (Taken from Page et al. (1999), The PageRank Citation Ranking: Bringing Order to the Web.)

Web search before Google. (Taken from Page et al. (1999), The PageRank Citation Ranking: Bringing Order to the Web.) ' Sta306b May 11, 2012 $ PageRank: 1 Web search before Google (Taken from Page et al. (1999), The PageRank Citation Ranking: Bringing Order to the Web.) & % Sta306b May 11, 2012 PageRank: 2 Web search

More information

Lecture 8: Linkage algorithms and web search

Lecture 8: Linkage algorithms and web search Lecture 8: Linkage algorithms and web search Information Retrieval Computer Science Tripos Part II Ronan Cummins 1 Natural Language and Information Processing (NLIP) Group ronan.cummins@cl.cam.ac.uk 2017

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

Introduction to Graphical Models

Introduction to Graphical Models Robert Collins CSE586 Introduction to Graphical Models Readings in Prince textbook: Chapters 10 and 11 but mainly only on directed graphs at this time Credits: Several slides are from: Review: Probability

More information

COMP Page Rank

COMP Page Rank COMP 4601 Page Rank 1 Motivation Remember, we were interested in giving back the most relevant documents to a user. Importance is measured by reference as well as content. Think of this like academic paper

More information

Data mining --- mining graphs

Data mining --- mining graphs Data mining --- mining graphs University of South Florida Xiaoning Qian Today s Lecture 1. Complex networks 2. Graph representation for networks 3. Markov chain 4. Viral propagation 5. Google s PageRank

More information

How to organize the Web?

How to organize the Web? How to organize the Web? First try: Human curated Web directories Yahoo, DMOZ, LookSmart Second try: Web Search Information Retrieval attempts to find relevant docs in a small and trusted set Newspaper

More information

10/10/13. Traditional database system. Information Retrieval. Information Retrieval. Information retrieval system? Information Retrieval Issues

10/10/13. Traditional database system. Information Retrieval. Information Retrieval. Information retrieval system? Information Retrieval Issues COS 597A: Principles of Database and Information Systems Information Retrieval Traditional database system Large integrated collection of data Uniform access/modifcation mechanisms Model of data organization

More information

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge Centralities (4) By: Ralucca Gera, NPS Excellence Through Knowledge Some slide from last week that we didn t talk about in class: 2 PageRank algorithm Eigenvector centrality: i s Rank score is the sum

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize the Web? First try: Human curated Web directories Yahoo, DMOZ, LookSmart Second

More information

Special-topic lecture bioinformatics: Mathematics of Biological Networks

Special-topic lecture bioinformatics: Mathematics of Biological Networks Special-topic lecture bioinformatics: Leistungspunkte/Credit points: 5 (V2/Ü1) This course is taught in English language. The material (from books and original literature) are provided online at the course

More information

PageRank Algorithm Abstract: Keywords: I. Introduction II. Text Ranking Vs. Page Ranking

PageRank Algorithm Abstract: Keywords: I. Introduction II. Text Ranking Vs. Page Ranking IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 1, Ver. III (Jan.-Feb. 2017), PP 01-07 www.iosrjournals.org PageRank Algorithm Albi Dode 1, Silvester

More information

Link Structure Analysis

Link Structure Analysis Link Structure Analysis Kira Radinsky All of the following slides are courtesy of Ronny Lempel (Yahoo!) Link Analysis In the Lecture HITS: topic-specific algorithm Assigns each page two scores a hub score

More information

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page Link Analysis Links Web consists of web pages and hyperlinks between pages A page receiving many links from other pages may be a hint of the authority of the page Links are also popular in some other information

More information

Doctoral School: József Hatvany Doctoral School for Computer Science and Engineering

Doctoral School: József Hatvany Doctoral School for Computer Science and Engineering University of Miskolc Faculty of Mechanical Engineering and Doctoral School: József Hatvany Doctoral School for Computer Science and Engineering Name of the Study Program: Computer Science and Engineering

More information

The PageRank Computation in Google, Randomized Algorithms and Consensus of Multi-Agent Systems

The PageRank Computation in Google, Randomized Algorithms and Consensus of Multi-Agent Systems The PageRank Computation in Google, Randomized Algorithms and Consensus of Multi-Agent Systems Roberto Tempo IEIIT-CNR Politecnico di Torino tempo@polito.it This talk The objective of this talk is to discuss

More information

Algebraic Graph Theory- Adjacency Matrix and Spectrum

Algebraic Graph Theory- Adjacency Matrix and Spectrum Algebraic Graph Theory- Adjacency Matrix and Spectrum Michael Levet December 24, 2013 Introduction This tutorial will introduce the adjacency matrix, as well as spectral graph theory. For those familiar

More information

THE CHINESE UNIVERSITY OF HONG KONG Department of Information Engineering

THE CHINESE UNIVERSITY OF HONG KONG Department of Information Engineering THE CHINESE UNIVERSITY OF HONG KONG Department of Information Engineering I. Suggested Study Plan for MIEG (entrants via ENGG Faculty in 08) University Core Requirements English 9 Chinese 6 Foundation

More information

Graph Theory Problem Ideas

Graph Theory Problem Ideas Graph Theory Problem Ideas April 15, 017 Note: Please let me know if you have a problem that you would like me to add to the list! 1 Classification Given a degree sequence d 1,...,d n, let N d1,...,d n

More information

Trees Rooted Trees Spanning trees and Shortest Paths. 12. Graphs and Trees 2. Aaron Tan November 2017

Trees Rooted Trees Spanning trees and Shortest Paths. 12. Graphs and Trees 2. Aaron Tan November 2017 12. Graphs and Trees 2 Aaron Tan 6 10 November 2017 1 10.5 Trees 2 Definition Definition Definition: Tree A graph is said to be circuit-free if, and only if, it has no circuits. A graph is called a tree

More information

Link Analysis. CSE 454 Advanced Internet Systems University of Washington. 1/26/12 16:36 1 Copyright D.S.Weld

Link Analysis. CSE 454 Advanced Internet Systems University of Washington. 1/26/12 16:36 1 Copyright D.S.Weld Link Analysis CSE 454 Advanced Internet Systems University of Washington 1/26/12 16:36 1 Ranking Search Results TF / IDF or BM25 Tag Information Title, headers Font Size / Capitalization Anchor Text on

More information

Introduction to Mathematical Programming IE406. Lecture 16. Dr. Ted Ralphs

Introduction to Mathematical Programming IE406. Lecture 16. Dr. Ted Ralphs Introduction to Mathematical Programming IE406 Lecture 16 Dr. Ted Ralphs IE406 Lecture 16 1 Reading for This Lecture Bertsimas 7.1-7.3 IE406 Lecture 16 2 Network Flow Problems Networks are used to model

More information

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology 9/9/ I9 Introduction to Bioinformatics, Clustering algorithms Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Outline Data mining tasks Predictive tasks vs descriptive tasks Example

More information

PageRank. CS16: Introduction to Data Structures & Algorithms Spring 2018

PageRank. CS16: Introduction to Data Structures & Algorithms Spring 2018 PageRank CS16: Introduction to Data Structures & Algorithms Spring 2018 Outline Background The Internet World Wide Web Search Engines The PageRank Algorithm Basic PageRank Full PageRank Spectral Analysis

More information

UNIVERSITY OF SOUTH ALABAMA COMPUTER SCIENCE

UNIVERSITY OF SOUTH ALABAMA COMPUTER SCIENCE UNIVERSITY OF SOUTH ALABAMA COMPUTER SCIENCE 1 Computer Science CSC 108 Intro to Computer Science 3 cr An introduction to the major areas of computer science, such as computing systems, the binary number

More information

DSCI 575: Advanced Machine Learning. PageRank Winter 2018

DSCI 575: Advanced Machine Learning. PageRank Winter 2018 DSCI 575: Advanced Machine Learning PageRank Winter 2018 http://ilpubs.stanford.edu:8090/422/1/1999-66.pdf Web Search before Google Unsupervised Graph-Based Ranking We want to rank importance based on

More information

CS281 Section 9: Graph Models and Practical MCMC

CS281 Section 9: Graph Models and Practical MCMC CS281 Section 9: Graph Models and Practical MCMC Scott Linderman November 11, 213 Now that we have a few MCMC inference algorithms in our toolbox, let s try them out on some random graph models. Graphs

More information

Heat Kernels and Diffusion Processes

Heat Kernels and Diffusion Processes Heat Kernels and Diffusion Processes Definition: University of Alicante (Spain) Matrix Computing (subject 3168 Degree in Maths) 30 hours (theory)) + 15 hours (practical assignment) Contents 1. Solving

More information

Acyclic Chromatic Number Of Central Graph

Acyclic Chromatic Number Of Central Graph Volume 118 No. 10 2018, 183-190 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu doi: 10.12732/ijpam.v118i10.60 ijpam.eu Acyclic Chromatic Number Of Central

More information

Large-Scale Networks. PageRank. Dr Vincent Gramoli Lecturer School of Information Technologies

Large-Scale Networks. PageRank. Dr Vincent Gramoli Lecturer School of Information Technologies Large-Scale Networks PageRank Dr Vincent Gramoli Lecturer School of Information Technologies Introduction Last week we talked about: - Hubs whose scores depend on the authority of the nodes they point

More information

Scale Free Network Growth By Ranking. Santo Fortunato, Alessandro Flammini, and Filippo Menczer

Scale Free Network Growth By Ranking. Santo Fortunato, Alessandro Flammini, and Filippo Menczer Scale Free Network Growth By Ranking Santo Fortunato, Alessandro Flammini, and Filippo Menczer Motivation Network growth is usually explained through mechanisms that rely on node prestige measures, such

More information

Partitioning and Partitioning Tools. Tim Barth NASA Ames Research Center Moffett Field, California USA

Partitioning and Partitioning Tools. Tim Barth NASA Ames Research Center Moffett Field, California USA Partitioning and Partitioning Tools Tim Barth NASA Ames Research Center Moffett Field, California 94035-00 USA 1 Graph/Mesh Partitioning Why do it? The graph bisection problem What are the standard heuristic

More information

Structured System Theory

Structured System Theory Appendix C Structured System Theory Linear systems are often studied from an algebraic perspective, based on the rank of certain matrices. While such tests are easy to derive from the mathematical model,

More information

Anomaly Detection in Very Large Graphs Modeling and Computational Considerations

Anomaly Detection in Very Large Graphs Modeling and Computational Considerations Anomaly Detection in Very Large Graphs Modeling and Computational Considerations Benjamin A. Miller, Nicholas Arcolano, Edward M. Rutledge and Matthew C. Schmidt MIT Lincoln Laboratory Nadya T. Bliss ASURE

More information

SECOND SEMESTER BCA : Syllabus Copy

SECOND SEMESTER BCA : Syllabus Copy BCA203T: DATA STRUCTURES SECOND SEMESTER BCA : Syllabus Copy Unit-I Introduction and Overview: Definition, Elementary data organization, Data Structures, data structures operations, Abstract data types,

More information

Using PageRank in Feature Selection

Using PageRank in Feature Selection Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy fienco,meo,bottag@di.unito.it Abstract. Feature selection is an important

More information

On Page Rank. 1 Introduction

On Page Rank. 1 Introduction On Page Rank C. Hoede Faculty of Electrical Engineering, Mathematics and Computer Science University of Twente P.O.Box 217 7500 AE Enschede, The Netherlands Abstract In this paper the concept of page rank

More information

Computing the Betti Numbers of Arrangements. Saugata Basu School of Mathematics & College of Computing Georgia Institute of Technology.

Computing the Betti Numbers of Arrangements. Saugata Basu School of Mathematics & College of Computing Georgia Institute of Technology. 1 Computing the Betti Numbers of Arrangements Saugata Basu School of Mathematics & College of Computing Georgia Institute of Technology. 2 Arrangements in Computational Geometry An arrangement in R k is

More information

Stochastic Simulation: Algorithms and Analysis

Stochastic Simulation: Algorithms and Analysis Soren Asmussen Peter W. Glynn Stochastic Simulation: Algorithms and Analysis et Springer Contents Preface Notation v xii I What This Book Is About 1 1 An Illustrative Example: The Single-Server Queue 1

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize the Web? First try: Human curated Web directories Yahoo, DMOZ, LookSmart Second

More information

computational Fluid Dynamics - Prof. V. Esfahanian

computational Fluid Dynamics - Prof. V. Esfahanian Three boards categories: Experimental Theoretical Computational Crucial to know all three: Each has their advantages and disadvantages. Require validation and verification. School of Mechanical Engineering

More information

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs Advanced Operations Research Techniques IE316 Quiz 1 Review Dr. Ted Ralphs IE316 Quiz 1 Review 1 Reading for The Quiz Material covered in detail in lecture. 1.1, 1.4, 2.1-2.6, 3.1-3.3, 3.5 Background material

More information

Outline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity

Outline. Possible solutions. The basic problem. How? How? Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Outline Relevance Feedback, Query Expansion, and Inputs to Ranking Beyond Similarity Lecture 10 CS 410/510 Information Retrieval on the Internet Query reformulation Sources of relevance for feedback Using

More information

Relevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline

Relevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline Relevance Feedback and Query Reformulation Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price IR on the Internet, Spring 2010 1 Outline Query reformulation Sources of relevance

More information

Using Spam Farm to Boost PageRank p. 1/2

Using Spam Farm to Boost PageRank p. 1/2 Using Spam Farm to Boost PageRank Ye Du Joint Work with: Yaoyun Shi and Xin Zhao University of Michigan, Ann Arbor Using Spam Farm to Boost PageRank p. 1/2 Roadmap Introduction: Link Spam and PageRank

More information

Introduction III. Graphs. Motivations I. Introduction IV

Introduction III. Graphs. Motivations I. Introduction IV Introduction I Graphs Computer Science & Engineering 235: Discrete Mathematics Christopher M. Bourke cbourke@cse.unl.edu Graph theory was introduced in the 18th century by Leonhard Euler via the Königsberg

More information

Bachelor of Science in Electrical Engineering - Computer Engineering Concentration

Bachelor of Science in Electrical Engineering - Computer Engineering Concentration Bachelor of Science in Electrical Engineering - Computer Engineering Concentration 1 Bachelor of Science in Electrical Engineering - Computer Engineering Concentration Learn more about the Bachelor of

More information

Master's Programme, Computer Science, 120 credits Masterprogram, datalogi credits

Master's Programme, Computer Science, 120 credits Masterprogram, datalogi credits Programme syllabus Master's Programme, Computer Science, 120 credits Masterprogram, datalogi 120.0 credits Valid for students admitted to the education from autumn 18 (HT - Autumn term; VT - Spring term).

More information

Loopy Belief Propagation

Loopy Belief Propagation Loopy Belief Propagation Research Exam Kristin Branson September 29, 2003 Loopy Belief Propagation p.1/73 Problem Formalization Reasoning about any real-world problem requires assumptions about the structure

More information

Let G = (V, E) be a graph. If u, v V, then u is adjacent to v if {u, v} E. We also use the notation u v to denote that u is adjacent to v.

Let G = (V, E) be a graph. If u, v V, then u is adjacent to v if {u, v} E. We also use the notation u v to denote that u is adjacent to v. Graph Adjacent Endpoint of an edge Incident Neighbors of a vertex Degree of a vertex Theorem Graph relation Order of a graph Size of a graph Maximum and minimum degree Let G = (V, E) be a graph. If u,

More information

Multigrid Methods for Markov Chains

Multigrid Methods for Markov Chains Multigrid Methods for Markov Chains Hans De Sterck Department of Applied Mathematics, University of Waterloo collaborators Killian Miller Department of Applied Mathematics, University of Waterloo, Canada

More information

Incoming, Outgoing Degree and Importance Analysis of Network Motifs

Incoming, Outgoing Degree and Importance Analysis of Network Motifs Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.758

More information

CONVERGENCE RANK AND ITS APPLICATIONS

CONVERGENCE RANK AND ITS APPLICATIONS D. Cirne, Int. J. of Design & Nature and Ecodynamics. Vol. 11, No. 3 (2016) 198 207 CONVERGENCE RANK AND ITS APPLICATIONS D. CIRNE mparticle Inc., U.S.A. ABSTRACT In this paper, we explore an algorithm

More information

Mathematics in Orbit

Mathematics in Orbit Mathematics in Orbit Dan Kalman American University Slides and refs at www.dankalman.net Outline Basics: 3D geospacial models Keyhole Problem: Related Rates! GPS: space-time triangulation Sensor Diagnosis:

More information

Diffusion and Clustering on Large Graphs

Diffusion and Clustering on Large Graphs Diffusion and Clustering on Large Graphs Alexander Tsiatas Thesis Proposal / Advancement Exam 8 December 2011 Introduction Graphs are omnipresent in the real world both natural and man-made Examples of

More information

Understanding Networks through Physical Metaphors. Daniel A. Spielman

Understanding Networks through Physical Metaphors. Daniel A. Spielman Understanding Networks through Physical Metaphors Daniel A. Spielman Graphs and Networks Analysis by Physical Metaphors Algorithms (what I usually do) A Social Network Graph A Social Network Graph vertex

More information

CENTRALITIES. Carlo PICCARDI. DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy

CENTRALITIES. Carlo PICCARDI. DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy CENTRALITIES Carlo PICCARDI DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy email carlo.piccardi@polimi.it http://home.deib.polimi.it/piccardi Carlo Piccardi

More information

On the construction of Tanner graphs

On the construction of Tanner graphs On the construction of Tanner graphs Jesús Martínez Mateo Universidad Politécnica de Madrid Outline Introduction Low-density parity-check (LDPC) codes LDPC decoding Belief propagation based algorithms

More information

EECS 1028 M: Discrete Mathematics for Engineers

EECS 1028 M: Discrete Mathematics for Engineers EECS 1028 M: Discrete Mathematics for Engineers Suprakash Datta Office: LAS 3043 Course page: http://www.eecs.yorku.ca/course/1028 Also on Moodle S. Datta (York Univ.) EECS 1028 W 18 1 / 15 Graphs: Motivations

More information

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS Overview of Networks Instructor: Yizhou Sun yzsun@cs.ucla.edu January 10, 2017 Overview of Information Network Analysis Network Representation Network

More information

1. BACHELOR OF SCIENCE IN COMPUTER SCIENCE EDUCATION

1. BACHELOR OF SCIENCE IN COMPUTER SCIENCE EDUCATION 1. BACHELOR OF SCIENCE IN COMPUTER SCIENCE EDUCATION ENTRANCE REQUIREMENTS The entrance requirements shall be the requirements stipulated in the Special Regulation. A. SGCSE/IGCSE Admission In addition

More information

STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH

STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH Slide 3.1 3 STRUCTURES AND STRATEGIES FOR STATE SPACE SEARCH 3.0 Introduction 3.1 Graph Theory 3.2 Strategies for State Space Search 3.3 Using the State Space to Represent Reasoning with the Predicate

More information

College of Sciences Department of Mathematics and Computer Science. Assessment Plan Mathematics

College of Sciences Department of Mathematics and Computer Science. Assessment Plan Mathematics College of Sciences Department of athematics and Computer Science Programs: Assessment Plan athematics athematics offers the following instructional programs: 1. Bachelor of Science in athematics Option

More information

PPI Network Alignment Advanced Topics in Computa8onal Genomics

PPI Network Alignment Advanced Topics in Computa8onal Genomics PPI Network Alignment 02-715 Advanced Topics in Computa8onal Genomics PPI Network Alignment Compara8ve analysis of PPI networks across different species by aligning the PPI networks Find func8onal orthologs

More information

MASTER OF SCIENCE IN COMPUTER AND INFORMATION SCIENCE

MASTER OF SCIENCE IN COMPUTER AND INFORMATION SCIENCE DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE Ping Zhang, Ph.D., Interim Department Chairperson Math and Science Building, 1 st Floor 1000 ASU Drive #30 Phone: 601-877-6430 FAX: 601-877-6631 Degrees Offered

More information

Getting Students Excited About Learning Mathematics

Getting Students Excited About Learning Mathematics Getting Students Excited About Learning Mathematics Introduction Jen Mei Chang Department of Mathematics and Statistics California State University, Long Beach jchang9@csulb.edu It wasn t so long ago when

More information