Scholarly Big Data: Leverage for Science
|
|
- Chrystal Lucinda Atkinson
- 5 years ago
- Views:
Transcription
1 Scholarly Big Data: Leverage for Science C. Lee Giles The Pennsylvania State University University Park, PA, USA Funded in part by NSF, Allen Institute for Artificial Intelligence (AI2), Dow Chemical, & the Qatar Foundation.
2 What is Scholarly Big Data All academic/research documents (journal & conference papers, books, theses, TRs) Related data: Academic/researcher/group/lab web homepages Funding agency and organization grants, records, reports Research laboratories reports Patents Associated data presentations experimental data (very large) images, video, figures, tables, etc. course materials Social networks Examples: Google Scholar, Microsoft Academic Search, Publishers/repositories, CiteSeer, ArnetMiner, funding agencies, universities, Mendeley, ResearchGate, Semantic Scholar, LibGen, Sci- Hub, others
3 Scholarly Big Data Most of the data that is available in the era of scholarly big data does not look like this Or even like this It looks more like this with Semantics (tags and labels) Courtesy Lise Getoor NIPS 12
4 Where do you get this data? Web (Wayback machine, crawl - Heritrix) Repositories (arxiv, Cern, PubMed, us) Bibliographic resources (PubMed, DBLP) Funding sources/laboratories Publishers Data aggregators (Web of science) Patents API s (Microsoft Academic) How much is there & how much available?
5 Who is interested in scholarly big data Scholars, scientists/engineers Economists Policy makers Funding agencies (government, foundations, etc) Educators Social scientists Business Governments Science of Science
6 Scholarly Big Data Research Directions Data creation, management, collections Search and access, data mining and information extraction NER, entity disambiguation Data integration and linking Data integrity and cleaning Large scale experiments Knowledge discovery Collaboration and sharing Visualization Privacy & security not so much New social networks collaboration; teams sociology & policy of science Many uses of AI & machine learning Ng, ICML 2012
7 Applications of scholarly big data New discoveries, directions & trends in research DARPA Big Mechanism Scientific, technical and scholarly trends Science and technology innovation Evaluation of science, technology and scholarly investments - science of science Individual, group and organization evaluation Collaboration opportunities, building teams Moneyball for scholar/scientists
8 IARPA FUSE Program
9 IARPA FUSE Program
10 Scholarly Big Data Workshop
11
12 Big Scholar Workshop
13
14 Semantic Scholar
15 Semantic data in CiteSeerX 27
16 Automatic Metadata Information Extraction (IE) - CiteSeerX Header title, authors, affiliations, abst Table Converter IE Figure Databases Search index PDF Text Formulae Body Citations Many other open source academic document metadata extractors available recent JCDL workshop, metadata hackathon, JCDL tutorial 2016
17 Tool for entity extraction for scholarly documents - PDFMEF Wu, et.al ACM K-Cap 2015 Header Title Authors Year Conference Journal Full text Citations Filtering Figures Tables Algorithms
18 Download CiteSeerX Tools
19 Highlights of AI/ML Technologies in CiteSeerX Document Classification Document Deduplication and Citation Graph Metadata Extraction Header Extraction Citation Extraction Table Extraction Figure Extraction Algorithm Extraction Author Disambiguation Wu, et.al IAAI 2014
20 TableSeer Table extraction & search engine Liu, et al, AAAI07, JCDL06,
21 Must scale!! Efficient Large Scale Author Disambiguation CiteSeer X & PubMed Motivation Correct attribution Manually curated databases still have errors DBLP, medical records Entity disambiguation problem documents Actors, entities Determine the real identity of the authors using metadata of the research papers, including co-authors, affiliation, physical address, address, information from crawling such as host server, etc. Entity normalization Challenges Accuracy Scalability Expandability Han, et.al JCDL 2004 Huang, et.al PKDD 2006 Treeratpituk, et.al JCDL 2009 Khabsa, et.al JCDL 2015 Key features Learn distance function Random Forest others DBSCAN clustering Ameliorate labeling inconsistency (transitivity problem) Efficient solution to find name clusters N logn scaling Recently all of PubMed authors, 80M mentions
22 Chem X Seer
23 csseer.ist.psu.edu Expert search for authors H-H Chen, JCDL 2014
24 Experimental Collaborator recommendation system CollabSeer currently supports 400k authors HH Chen, JCDL 2011
25 Al-Zaidy, AAAI 2016 Figure Extraction Bar Chart User traffic increases significantly then really drops off Chart Data Extraction Data Feature Extraction Bar Chart Chart Data Values Chart structured as semantic graph Indexed text Text summaries User queries
26 Automated Figure Data Extraction and Search Large amount of results in digital documents are recorded in figures, time series, experimental results (eg., NMR spectra, income growth) Extraction for purposes of: Further modeling using presented data Indexing, meta-data creation for storage & search on figures for data reuse Current extraction done manually! Documents Extracted Plot Extracted Info. Document Index Merged Index Plot Index Digital Library User
27 Automatic Citation (or paper) Recommendation Built on millions of papers Never miss a citation and know about the latest work Several recommendations models Huang, AAAI 2015 Huang, CIKM 2013 He, WWW 2010
28 Big Data Scholarly Document Size Large # of academic/research documents, all containing a great deal of data & related semantics Many millions of documents 50M records Microsoft Academic (2013) 25M records, 10 million authors, 3 times mentions PubMed Google scholar (english) estimated to be ~100M records Total online estimate ~120M records ~25 million full documents freely available 100s of millions of authors, affiliations, locations, dates Billions of citation mentions 100s millions of tables, figures, math, formulae, etc. Related & linked data Raw data > petabytes Khabsa, Giles, PLoSONE, 14
29 Challenges Scalable methods for extraction and search Tables, figures, formula, equations, methodologies, etc. How do we effectively integrate and utilize this data for search and research? Natural language generation What does the data mean (semantics) Ontologies for scholarly data Scholarly knowledge vault(s) Big Mechanism approaches and knowledge discovery and relations Monetization?
30 The future ain t what it used to be. Yogi Berra, catcher, NY Yankees. For more information clgiles.ist.psu.edu giles@ist.psu.edu gitbhub.com/seerlabs
A Web Service for Scholarly Big Data Information Extraction
A Web Service for Scholarly Big Data Information Extraction Kyle Williams, Lichi Li, Madian Khabsa, Jian Wu, Patrick C. Shih and C. Lee Giles Information Sciences and Technology Computer Science and Engineering
More informationSemantic Scholar. ICSTI Towards a More Efficient Review of Research Literature 11 September
Semantic Scholar ICSTI Towards a More Efficient Review of Research Literature 11 September 2018 Allen Institute for Artificial Intelligence (https://allenai.org/) Non-profit Research Institute in Seattle,
More informationCITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS
CITESEERX DATA: SEMANTICIZING SCHOLARLY PAPERS Jian Wu, IST, Pennsylvania State University Chen Liang, IST, Pennsylvania State University Huaiyu Yang, EECS, Vanderbilt University C. Lee Giles, IST & CSE
More informationCiteSeer x : A Scholarly Big Dataset
CiteSeer x : A Scholarly Big Dataset Cornelia Caragea 1,4, Jian Wu 2,5, Alina Ciobanu 3,6, Kyle Williams 2,5, Juan Fernández-Ramírez 1,7, Hung-Hsuan Chen 1,5, Zhaohui Wu 1,5, and Lee Giles 1,2,5 1 Computer
More informationExtracting Algorithms by Indexing and Mining Large Data Sets
Extracting Algorithms by Indexing and Mining Large Data Sets Vinod Jadhav 1, Dr.Rekha Rathore 2 P.G. Student, Department of Computer Engineering, RKDF SOE Indore, University of RGPV, Bhopal, India Associate
More informationPANDA: A Platform for Academic Knowledge Discovery and Acquisition
PANDA: A Platform for Academic Knowledge Discovery and Acquisition Zhaoan Dong 1 ; Jiaheng Lu 2,1 ; Tok Wang Ling 3 1.Renmin University of China 2.University of Helsinki 3.National University of Singapore
More informationWeb of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION
Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.27 Nina Chang Product Release Date: December 10, 2017 Document Version: 1.0 Date of issue: December 7, 2017 RELEASE OVERVIEW The following
More informationAbstract and Index and Web Discovery Services IEEE Partners
Abstract and Index and Web Discovery Services IEEE Partners Introduction This document is intended to provide a general overview of the abstract and indexing services and web discovery services that take
More informationYour Research Social Media: Leverage the Mendeley platform for your needs
Your Research Social Media: Leverage the Mendeley platform for your needs Thelmal Huang, Elsevier Email: th.huang@elsevier.com Cell Phone: 0930660745 www.mendeley.com All But a the lot time of the we state
More informationScholarly collaboration platforms
Scholarly collaboration platforms STM Meeting 22 April 2015 Washington, DC Mark Ware @mrkwr Question: Which social network do researchers know & use almost as much as Google Scholar? Source: Reprinted
More informationWho is Citing Your Work?
Who is Citing Your Work? Thursday Topic Series Office of Faculty Affairs 12 November 2009 BERNARD BECKER MEDICAL LIBRARY Washington University School of Medicine Knowing How Your Research Was Used: How
More informationOutline. Eg. 1: DBLP. Motivation. Eg. 2: ACM DL Portal. Eg. 2: DBLP. Digital Libraries (DL) often have many errors that negatively affect:
Outline Effective and Scalable Solutions for Mixed and Split Citation Problems in Digital Libraries Dongwon Lee, Byung-Won On Penn State University, USA Jaewoo Kang North Carolina State University, USA
More informationFinding Topic-centric Identified Experts based on Full Text Analysis
Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr
More informationCyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)
Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation, Integration Alan Blatecky Director OCI 1 1 Framing the
More informationMake the most of your access to ScienceDirect
1 Make the most of your access to ScienceDirect Present Future 2 ScienceDirect Training Deck We re here to help you make the most of your access to ScienceDirect. ScienceDirect offers researchers the latest
More informationNext-Generation Scholarly Discovery. Director Portfolio Strategy Microsoft Research
Next-Generation Scholarly Discovery Director Portfolio Strategy Microsoft Research Outreach. Collaboration. Innovation. http://research.microsoft.com/collaboration/ Explore over 38.8 million publications
More informationReflections on Three Decades in Internet Time
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States of America License. Reflections on Three Decades in Internet Time Christine Borgman, Paul
More informationSearchable. Readable. Relatable. E-Journal Platform for Japanese Academic Societies
J-STAGE J-GLOBAL NBDC Searchable. Readable. Relatable. E-Journal Platform for Japanese Academic Societies J-STAGE https://www.jstage.jst.go.jp/browse/-char/en/ contact@jstage.jst.go.jp J-STAGE is the e-journal
More informationApplication of Big Data Technology to Library data:a review
Application of Big Data Technology to Library data:a review A.Kaladhar Research Scholar Dept. of Library and Information Science JNTUK Kakinada (A.P) Email:librarian@svecw.edu.in and B.R. Doraswamy Naick
More informationData publication and discovery with Globus
Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,
More informationThe library s role in promoting the sharing of scientific research data
The library s role in promoting the sharing of scientific research data Katherine Akers Biomedical Research/Research Data Specialist Shiffman Medical Library Wayne State University Funding agency requirements
More informationSome Big Data Challenges
Some Big Data Challenges 2,500,000,000,000,000,000 Bytes (2.5 x 10 18 ) of data are created every day! (2012) or 8,000,000,000,000,000,000 (8 exabytes) of new data were stored globally by enterprises in
More informationANNUAL REPORT Visit us at project.eu Supported by. Mission
Mission ANNUAL REPORT 2011 The Web has proved to be an unprecedented success for facilitating the publication, use and exchange of information, at planetary scale, on virtually every topic, and representing
More informationWEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS
1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,
More informationThe Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts. Leslie Carr
The Semantic Institution: An Agenda for Publishing Authoritative Scholarly Facts Leslie Carr http://id.ecs.soton.ac.uk/people/60 What s the Web For? To share information 1. Ad hoc home pages 2. Structured
More informationSemantic Web Technology Evaluation Ontology (SWETO): A Test Bed for Evaluating Tools and Benchmarking Applications
Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 5-22-2004 Semantic Web Technology Evaluation Ontology (SWETO): A Test
More informationRole of Social Media and Semantic WEB in Libraries
Role of Social Media and Semantic WEB in Libraries By Dr. Anwar us Saeed Email: anwarussaeed@yahoo.com Layout Plan Where Library streams merge the WEB Recent Evolution of the WEB Social WEB Semantic WEB
More informationWeb of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION
Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.28 Nina Chang Product Release Date: March 25, 2018 Document Version: 1.0 Date of issue: March 22, 2018 RELEASE OVERVIEW The following features
More informationNSF gateway to Scientific literature
NSF gateway to Scientific literature Workshop on Proposal Writing National Science Foundation 19 June 2012 Sunethra Perera Outline NSF Literature Local Literature at the NSF Local Literature at Other institutions
More informationAmerican Institute of Physics
American Institute of Physics (http://journals.aip.org/)* Founded in 1931, the American Institute of Physics (AIP) is a not-for-profit scholarly society established for the purpose of promoting the advancement
More informationDATA MINING II - 1DL460
DATA MINING II - 1DL460 Spring 2016 A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationGoogle indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites
Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated
More informationParticular experience in design and implementation of a Current Research Information System in Russia: national specificity
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 00 (2014) 000 000 www.elsevier.com/locate/procedia CRIS 2014 Particular experience in design and implementation of a Current
More informationA Service-Oriented Architecture for Digital Libraries
A Service-Oriented Architecture for Digital Libraries Yves Petinot 1,2, C. Lee Giles 1,2,3, Vivek Bhatnagar 2,3, Pradeep B. Teregowda 2, Hui Han 1,3, Isaac Councill 3 1 Department of Computer Science and
More informationChapter 50 Tracing Related Scientific Papers by a Given Seed Paper Using Parscit
Chapter 50 Tracing Related Scientific Papers by a Given Seed Paper Using Parscit Resmana Lim, Indra Ruslan, Hansin Susatya, Adi Wibowo, Andreas Handojo and Raymond Sutjiadi Abstract The project developed
More informationIntroduction to Data Management for Ocean Science Research
Introduction to Data Management for Ocean Science Research Cyndy Chandler Biological and Chemical Oceanography Data Management Office 12 November 2009 Ocean Acidification Short Course Woods Hole, MA USA
More informationScience 2.0 VU Big Science, e-science and E- Infrastructures + Bibliometric Network Analysis
W I S S E N n T E C H N I K n L E I D E N S C H A F T Science 2.0 VU Big Science, e-science and E- Infrastructures + Bibliometric Network Analysis Elisabeth Lex KTI, TU Graz WS 2015/16 u www.tugraz.at
More informationScientific databases
SCID 305 : Generic Skills in Science Research Scientific databases Suang Udomvaraphunt Academic IT Stang Monkolsuk library and Information Division Faculty of Science Stang Mongkolsuk Library http://stang.sc.mahidol.ac.th
More informationScience 2.0 VU Processing Science 2.0 Data, Content Mining
W I S S E N n T E C H N I K n L E I D E N S C H A F T Science 2.0 VU Processing Science 2.0 Data, Content Mining Elisabeth Lex KTI, TU Graz WS 2015/16 u www.tugraz.at Agenda Repetition from last time:
More informationScuola di dottorato in Scienze molecolari Information literacy in chemistry 2015 SCOPUS
SCOPUS ORIGINAL RESEARCH INFORMATION IN SCIENCE is published (stored) in PRIMARY LITERATURE it refers to the first place a scientist will communicate to the general audience in a publicly accessible document
More informationAutomatic Identification of Research Articles from Crawled Documents
Automatic Identification of Research Articles from Crawled Documents Cornelia Caragea 1,3, Jian Wu 2,4, Kyle Williams 2,4, Sujatha Das G. 1,3, Madian Khabsa 1,4, Pradeep Teregowda 1,4, and C. Lee Giles
More informationInformation Extraction from Research Papers by Data Integration and Data Validation from Multiple Header Extraction Sources
, October 24-26, 2012, San Francisco, USA Information Extraction from Research Papers by Data Integration and Data Validation from Multiple Header Extraction Sources Ozair Saleem, Seemab Latif Abstract
More informationCiteSeerX is a digital library search engine providing
CiteSeerX: AI in a Digital Library Search Engine Jian Wu, Kyle William, Hung-Hsuan Chen, Madian Khabsa, Cornelia Caragea, Suppawong Tuarob, Alexander Ororbia, Douglas Jordan,Prasenjit Mitra, C. Lee Giles
More informationXETA: extensible metadata System
XETA: extensible metadata System Abstract: This paper presents an extensible metadata system (XETA System) which makes it possible for the user to organize and extend the structure of metadata. We discuss
More informationAn Automatic Extraction of Educational Digital Objects and Metadata from institutional Websites
An Automatic Extraction of Educational Digital Objects and Metadata from institutional Websites Kajal K. Nandeshwar 1, Praful B. Sambhare 2 1M.E. IInd year, Dept. of Computer Science, P. R. Pote College
More informationColorado PROFILES. An Introduction
Colorado PROFILES An Introduction About Profiles Profiles is a research networking and expertise mining software tool. It not only shows traditional directory information, but also illustrates how each
More informationOpenAIRE Open Knowledge Infrastructure for Europe
Birgit Schmidt University of Göttingen State and University Library OpenAIRE Open Knowledge Infrastructure for Europe ERC Workshop, 6-7 February 2013, Brussels OpenAIRE Characteristics A policy driven
More informationHierarchical Location and Topic Based Query Expansion
Hierarchical Location and Topic Based Query Expansion Shu Huang 1 Qiankun Zhao 2 Prasenjit Mitra 1 C. Lee Giles 1 Information Sciences and Technology 1 AOL Research Lab 2 Pennsylvania State University
More informationBig Data Integration for Data Enthusiasts. Jayant Madhavan Structured Data Research Google Inc.
for Data Enthusiasts Jayant Madhavan Structured Data Research Google Inc. Big Data Challenge Running computations over ginormous datasets Petabytes, Exabytes, maybe more! Only one aspect of the challenge!
More informationOverview. Data-mining. Commercial & Scientific Applications. Ongoing Research Activities. From Research to Technology Transfer
Data Mining George Karypis Department of Computer Science Digital Technology Center University of Minnesota, Minneapolis, USA. http://www.cs.umn.edu/~karypis karypis@cs.umn.edu Overview Data-mining What
More informationDocument Type Classification in Online Digital Libraries
Proceedings of the Twenty-Eighth AAAI Conference on Innovative Applications (IAAI-16) Document Type Classification in Online Digital Libraries Cornelia Caragea, 1 Jian Wu, 2 Sujatha Das Gollapalli, 3 and
More informationHow to Use Google Scholar An Educator s Guide
http://scholar.google.com/ How to Use Google Scholar An Educator s Guide What is Google Scholar? Google Scholar provides a simple way to broadly search for scholarly literature. Google Scholar helps you
More informationEmpowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia
Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user
More informationMining Trusted Information in Medical Science: An Information Network Approach
Mining Trusted Information in Medical Science: An Information Network Approach Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign Collaborated with many, especially Yizhou
More informationBring Semantic Web to Social Communities
Bring Semantic Web to Social Communities Jie Tang Dept. of Computer Science, Tsinghua University, China jietang@tsinghua.edu.cn April 19, 2010 Abstract Recently, more and more researchers have recognized
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationEBP. Accessing the Biomedical Literature for the Best Evidence
Accessing the Biomedical Literature for the Best Evidence Structuring the search for information and evidence Basic search resources Starting the search EBP Lab / Practice: Simple searches Using PubMed
More informationOracle Big Data Discovery
Oracle Big Data Discovery Turning Data into Business Value Harald Erb Oracle Business Analytics & Big Data 1 Safe Harbor Statement The following is intended to outline our general product direction. It
More informationScopus Development Focus
0 Scopus Development Focus Superior support of the scientific literature research process - on finding relevant articles quickly and investigating current research relationships through citation information
More informationDATA MINING II - 1DL460
DATA MINING II - 1DL460 Spring 2012 A second course in data mining!! http://www.it.uu.se/edu/course/homepage/infoutv2/vt12 Kjell Orsborn! Uppsala Database Laboratory! Department of Information Technology,
More informationOpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond
Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering
More informationCL Scholar: The ACL Anthology Knowledge Graph Miner
CL Scholar: The ACL Anthology Knowledge Graph Miner Mayank Singh, Pradeep Dogga, Sohan Patro, Dhiraj Barnwal, Ritam Dutt, Rajarshi Haldar, Pawan Goyal and Animesh Mukherjee Department of Computer Science
More informationBuilding Institutional Repositories: Emerging Challenges
University of Nebraska at Omaha From the SelectedWorks of Yumi Ohira 2014 Building Institutional Repositories: Emerging Challenges Yumi Ohira, University of Nebraska at Omaha Available at: https://works.bepress.com/yumi-ohira/3/
More informationSCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR
SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR P.SHENBAGAVALLI M.E., Research Scholar, Assistant professor/cse MPNMJ Engineering college Sspshenba2@gmail.com J.SARAVANAKUMAR B.Tech(IT)., PG
More informationSearch Engines and Knowledge Graphs
Search Engines and Knowledge Graphs It s Complicated! Panos Alexopoulos Head of Ontology Who we are and what we do We develop Technology to bridge the language and meaning gap between People and Jobs...
More informationInge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license
Inge Van Nieuwerburgh OpenAIRE NOAD Belgium Tools&Services OpenAIRE EUDAT can be reused under the CC BY license Open Access Infrastructure for Research in Europe www.openaire.eu Research Data Services,
More informationMarkus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph
Analytics Building business tools for the scholarly publishing domain using LOD and the ELK stack SEMANTiCS Vienna 2018 Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph 1 Agenda (25
More informationData Management Glossary
Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative
More informationClustering using Topic Models
Clustering using Topic Models Compiled by Sujatha Das, Cornelia Caragea Credits for slides: Blei, Allan, Arms, Manning, Rai, Lund, Noble, Page. Clustering Partition unlabeled examples into disjoint subsets
More informationLink Mining & Entity Resolution. Lise Getoor University of Maryland, College Park
Link Mining & Entity Resolution Lise Getoor University of Maryland, College Park Learning in Structured Domains Traditional machine learning and data mining approaches assume: A random sample of homogeneous
More informationData science How to prepare engineers for this field
16th Workshop Software Engineering Education and Reverse Engineering, Jahorina 2016 Data science How to prepare engineers for this field Ivica Marković Department of Computer Science Faculty of Electronic
More informationErkki Tolonen
Erkki Tolonen 12.9.2017 Photograph: Eeva Rista 1974, Helsinki City Museum, https://finna.fi/record/hkm.hkms000005:km0000lqgq Finna in a nutshell The user interface of National Digital Library. One of the
More informationDigital repositories as research infrastructure: a UK perspective
Digital repositories as research infrastructure: a UK perspective Dr Liz Lyon Director This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 UKOLN is supported by: Presentation
More informationUK Institutional Repository Search Project
UK Institutional Repository Search Project Vic Lyte 28th October 2009 Background - 2007 Growth in Institutional Repositories supported by JISC RPP; Highly variable quality of deposition re: content and
More informationConcise Summary: Detailed Summary: Comparison table: VIVO and other 19 websites. Semantic web. Service offered. Number of profiles
Page 1 of 11 comparison with VIVO Concise Summary: Among these 19 systems: - Harvard catalyst is the most complete one, providing many services, such as Medvane, SHRINE, In/out Patient Resources, Harvard
More informationAn Entity Name Systems (ENS) for the [Semantic] Web
An Entity Name Systems (ENS) for the [Semantic] Web Paolo Bouquet University of Trento (Italy) Coordinator of the FP7 OKKAM IP LDOW @ WWW2008 Beijing, 22 April 2008 An ordinary day on the [Semantic] Web
More informationState of the Art and Trends in Search Engine Technology. Gerhard Weikum
State of the Art and Trends in Search Engine Technology Gerhard Weikum (weikum@mpi-inf.mpg.de) Commercial Search Engines Web search Google, Yahoo, MSN simple queries, chaotic data, many results key is
More informationUser guide. ( Basic Search Tips
User guide Welcome to the new ProQuest search experience. ProQuest s all-new, powerful, comprehensive, and easyto-navigate search environment brings together resources from ProQuest, Cambridge Scientific
More informationThe 2018 (14th) International Conference on Data Science (ICDATA)
CALL FOR PAPERS LATE BREAKING PAPERS, POSITION PAPERS, ABSTRACTS, POSTERS Paper Submission Deadline: May 20, 2018 The 2018 (14th) International Conference on Data Science (ICDATA) (former International
More informationEfficient Name Disambiguation for Large-Scale Databases
Efficient Name Disambiguation for Large-Scale Databases Jian Huang 1,SeydaErtekin 2, and C. Lee Giles 1,2 1 College of Information Sciences and Technology The Pennsylvania State University, University
More informationHow to Guide. For Personal Users
How to Guide For Personal Users March 2016 Contents Introduction... 2 Features and Functions:... 2 Accessing UICollaboratory... 3 Home Page... 3 Homepage Key Features... 3 Collaboration Map... 4 Search
More informationCORE: Improving access and enabling re-use of open access content using aggregations
CORE: Improving access and enabling re-use of open access content using aggregations Petr Knoth CORE (Connecting REpositories) Knowledge Media institute The Open University @petrknoth 1/39 Outline 1. The
More informationOntology Based Search Engine
Ontology Based Search Engine K.Suriya Prakash / P.Saravana kumar Lecturer / HOD / Assistant Professor Hindustan Institute of Engineering Technology Polytechnic College, Padappai, Chennai, TamilNadu, India
More informationJAKUB KOPERWAS, HENRYK RYBINSKI, ŁUKASZ SKONIECZNY Institute of Computer Science, Warsaw University of Technology
JAKUB KOPERWAS, HENRYK RYBINSKI, ŁUKASZ SKONIECZNY Institute of Computer Science, Warsaw University of Technology Motivation, goals and key assumptions Main features and functionalities of Omega- Psir
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationOpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe
Natalia Manola University of Athens Department of Informatics and Telecommunications OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Outline Open Access in Europe Brief history
More informationAccess Innovations, Inc.
2016. Access Innovations, Inc. All rights reserved. Welcome To DCMI Special Session: Applying Taxonomies in Publishing Leveraging Your Semantic Enrichment Investment 13 October 2016, 10:30 to 12:00 Access
More informationA Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources
A Vision for Bigger Biomedical Data: Integration of REDCap with Other Data Sources Ram Gouripeddi Assistant Professor, Department of Biomedical Informatics, University of Utah Senior Biomedical Informatics
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationApplication of machine learning and big data technologies in OpenAIRE system
Application of machine learning and big data technologies in OpenAIRE system Warsztaty Orange z cyklu Centrum Badawczo Rozwojowe zaprasza Mateusz Kobos, ICM, Univeristy of Warsaw Warszawa, 2017-05-10 OpenAIRE
More informationNOW ON. Mike Takats Thomson Reuters April 30, 2013
NOW ON Mike Takats Thomson Reuters April 30, 2013 Thomson Reuters, ISI and the Web of Knowledge OVER 50 YEARS OF EXPERIENCE IN CITATION INDEXING, ANALYSIS AND METRICS In 1955, Dr. Eugene Garfield revolutionized
More informationContext Aware Computing
CPET 565/CPET 499 Mobile Computing Systems Context Aware Computing Lecture 7 Paul I-Hai Lin, Professor Electrical and Computer Engineering Technology Purdue University Fort Wayne Campus 1 Context-Aware
More informationSemantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22, 2004) Semantic Web Track, Developers Day Boanerges
More informationA System for Searching, Extracting & Copying for Algorithm, Pseudocodes & Programs in Data
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,
More informationSUMMON WEB-SCALE DISCOVERY. ADA University Baku 02/04/2014
SUMMON WEB-SCALE DISCOVERY ADA University Baku 02/04/2014 Why an Automated Management Solution is Important Academic Library Expenditures on Purchased and Licensed Content 90% 80% 70% 60% 50% 40% 30% 20%
More informationDevelopment of an Ontology-Based Portal for Digital Archive Services
Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw
More informationA Scalable Architecture for Extracting, Aligning, Linking, and Visualizing Multi-Int Data
A Scalable Architecture for Extracting, Aligning, Linking, and Visualizing Multi-Int Data Craig Knoblock & Pedro Szekely University of Southern California Introduction Massive quantities of data available
More informationQuery Independent Scholarly Article Ranking
Query Independent Scholarly Article Ranking Shuai Ma, Chen Gong, Renjun Hu, Dongsheng Luo, Chunming Hu, Jinpeng Huai SKLSDE Lab, Beihang University, China Beijing Advanced Innovation Center for Big Data
More informationObjectives of the Webometrics Ranking of World's Universities (2016)
Objectives of the Webometrics Ranking of World's Universities (2016) The original aim of the Ranking was to promote Web publication. Supporting Open Access initiatives, electronic access to scientific
More informationSciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus
Prepared by: Jawad Sayadi Account Manager, United Kingdom Elsevier BV Radarweg 29 1043 NX Amsterdam The Netherlands J.Sayadi@elsevier.com SciVerse Scopus SciVerse Scopus 1. Scopus introduction and content
More information