rpaf ktl Pen Apache Solr 3 Enterprise Search Server J community exp<= highlighting, relevancy ranked sorting, and more source publishing""
|
|
- Adela Norman
- 5 years ago
- Views:
Transcription
1 Apache Solr 3 Enterprise Search Server Enhance your search with faceted navigation, result highlighting, relevancy ranked sorting, and more David Smiley Eric Pugh rpaf ktl Pen I I riv IV I J community exp<= publishing"" - birmingham mumbai source experience distilled
2 Preface 1 Chapter 1: Quick Starting Solr 7 An introduction to Solr 7 Lucene, the underlying engine 8 Solr, a Lucene-based search server 9 Comparison to database technology 10 Getting started 11 Solr's installation directory structure 12 Solr's home directory and Solr cores 14 Running Solr 15 A quick tour of Solr 16 Loading sample data 18 A simple query 20 Some statistics 23 The sample browse interface 24 Configuration files 25 Resources outside this book 27 Summary 28 Chapter 2: Schema and Text Analysis 29 MusicBrainz.org 30 One combined index or separate indices 31 One combined index 32 Problems with using a single combined index 33 Separate indices 34 Schema design 35 Step 1: Determine which searches are going to be powered by Solr 36 Step 2: Determine the entities returned from each search 36 Step 3: Denormalize related data 37
3 Denormalizing 'one-to-one' associated data 37 Denormalizing 'one-to-many' associated data 38 Step 4: (Optional) Omit the inclusion of fields only used in search results 39 The schema.xml file 40 Defining field types 41 Built-in field type classes 42 Numbers and dates 42 Geospatial 43 Field options 43 Field definitions 44 Dynamic field definitions 45 Our MusicBrainz field definitions 46 Copying fields 48 The unique key 49 The default search field and query operator 49 Text analysis 50 Configuration 51 Experimenting with text analysis 54 Character filters 55 Tokenization 57 WordDelimiterFilter 59 Stemming 61 Correcting and augmenting stemming 62 Synonyms 63 Index-time versus query-time, and to expand or not 64 Stop words 65 Phonetic sounds-like analysis 66 Substring indexing and wildcards 67 ReversedWildcardFilter 68 N-grams 69 N-gram costs 70 Sorting Text 71 Miscellaneous token filters 72 Summary 73 Chapter 3: Indexing Data 75 Communicating with Solr 76 Direct HTTP or a convenient client API 76 Push data to Solr or have Solr pull it 76 Data formats 76 HTTP POSTing options to Solr 77 Remote streaming 79 Solr's Update-XML format 80
4 Deleting documents 81 Commit, optimize, and rollback 82 Sending CSV formatted data to Solr 84 Configuration options 86 The Data Import Handler Framework 87 Setup 88 The development console 89 Writing a DIH configuration file 90 Data Sources 90 Entity processors 91 Fields and transformers 92 Example DIH configurations 94 Importing from databases 94 Importing XML from a file with XSLT 96 Importing multiple rich document files (crawling) 97 Importing commands 98 Delta imports 99 Indexing documents with Solr Cell 100 Extracting text and metadata from files 100 Configuring Solr 101 Solr Cell parameters 102 Extracting karaoke lyrics 104 Indexing richer documents 106 Update req uest processors 109 Summary 110 Chapter 4: Searching 111 Your first search, a walk-through 112 Solr's generic XML structured data representation 114 Solr's XML response format 115 Parsing the URL 116 Request handlers 117 Query parameters 119 Search criteria related parameters 119 Result pagination related parameters 120 Output related parameters 121 Diagnostic related parameters 121 Query parsers and local-params 122 Query syntax (the lucene query parser) 123 Matching all the documents 125 Mandatory, prohibited, and optional clauses 125 Boolean operators 126 Sub-queries 127
5 Limitations of prohibited clauses in sub-queries 128 Field qualifier 128 Phrase queries and term proximity 129 Wildcard queries 129 Fuzzy queries 131 Range queries 131 Date math 132 Score boosting 133 Existence (and non-existence) queries 134 Escaping special characters 134 The Dismax query parser (parti) 135 Searching multiple fields 137 Limited query syntax 137 Min-should-match 138 Basic rules 138 Multiple rules 139 What to choose 140 A default search 140 Filtering 141 Sorting 142 Geospatial search 143 Indexing locations 143 Filtering by distance 144 Sorting by distance 145 Summary 146 Chapter 5: Search Relevancy 147 Scoring 148 Query-time and index-time boosting 149 Troubleshooting queries and scoring 149 Dismax query parser (part 2) 151 Lucene's DisjunctionMaxQuery 152 Boosting: Automatic phrase boosting 153 Configuring automatic phrase boosting 153 Phrase slop configuration 154 Partial phrase boosting 154 Boosting: Boost queries 155 Boosting: Boost functions 156 Add or multiply boosts? 157 Function queries 158 Field references 159 Function reference 160 Mathematical primitives 161 Other math 161
6 Table ofcontents ord and rord 162 Miscellaneous functions 162 Function query boosting 164 Formula: Logarithm 164 Formula: Inverse reciprocal 165 Formula: Reciprocal 167 Formula: Linear 168 How to boost based on an increasing numeric field 168 Step by step External field values 170 How to boost based on recent dates 170 Step by step Summary 171 Chapter 6: Faceting 173 A quick example: Faceting release types 174 MusicBrainz schema changes 176 Field requirements 178 Types of faceting 178 Faceting field values 179 Alphabetic range bucketing 181 Faceting numeric and date ranges 182 Range facet parameters 185 Facet queries 187 Building a filter query from a facet 188 Field value filter queries 189 Facet range filter queries 189 Excluding filters (multi-select faceting) 190 Hierarchical faceting 194 Summary 196 Chapter 7: Search Components 197 About components 198 The Highlight component 200 A highlighting example 200 Highlighting configuration 202 The regex fragmenter 205 The fast vector highlighter with multi-colored highlighting 205 The SpellCheck component 207 Schema configuration 208 Configuration in solrconfig.xml 209 Configuring spellcheckers (dictionaries) 211 Processing of the q parameter 213 Processing of the spellcheck.q parameter 213 Building the dictionary from its source 214
7 Issuing spellcheck requests 215 Example usage for a misspelled query 217 Query complete / suggest 219 Query term completion via facet.prefix 221 Query term completion via the Suggester 223 Query term completion via the Terms component 226 The QueryElevation component 227 Configuration 228 The MoreLikeThis component 230 Configuration parameters 231 Parameters specific to the MLT search component 231 Parameters specific to the MLT request handler 231 Common MLT parameters 232 MLT results example 234 The Stats component 236 Configuring the stats component 237 Statistics on track durations 237 The Clustering component 238 Result grouping/field collapsing 239 Configuring result grouping 241 The TermVector component 243 Summary 243 Chapter 8: Deployment 245 Deployment methodology for Solr 245 Questions to ask 246 Installing Solr into a Servlet container 247 Differences between Servlet containers 248 Defining solr.home property 248 ' Logging 249 HTTP server request access logs 250 Solr application logging 251 Configuring logging output 252 Logging using Log4j 253 Jetty startup integration 253 Managing log levels at runtime 254 A SearchHandler per search interface? 254 Leveraging Solr cores 256 Configuring solr.xml 256 Property substitution 258 Include fragments of XML with Xlnclude 259 Managing cores 259 Why use multicore? 261
8 Monitoring Soir performance 262 Stats.jsp 263 JMX 264 Starting Soir with JMX 265 Securing Soir from prying eyes 270 Limiting server access 270 Securing public searches 272 Controlling JMX access 273 Securing index data 273 Controlling document access 273 Other things to look at 274 Summary 275 Chapter 9: Integrating Soir 277 Working with included examples 278 Inventory of examples 278 Solritas, the integrated search Ul 279 Pros and Cons of Solritas 281 SolrJ: Simple Java interface 283 Using Heritrix to download artist pages 283 SolrJ-based client for Indexing HTML 285 SolrJ client API 287 Embedding Soir 288 Searching with SolrJ 289 Indexing 290 When should I use embedded Soir? 294 In-process indexing 294 Standalone desktop applications 295 Upgrading from legacy Lucene 295 Using JavaScript with Soir 296 Wait, what about security? 297 Building a Soir powered artists autocomplete widget with jquery and JSONP 298 AJAX Soir 303 Using XSLT to expose Soir via OpenSearch 305 OpenSearch based Browse plugin 306 Installing the Search MB Artists plugin 306 Accessing Soir from PHP applications 309 solr-php-client 310 Drupal options 311 Apache Soir Search integration module 312 Hosted Soir by Acquia 312 Ruby on Rails integrations 313 The Ruby query response writer 313
9 sunspot_rails gem 314 Setting up MyFaves project 315 Populating MyFaves relational database from Solr 316 Build Solr indexes from a relational database 318 Complete MyFaves website 320 Which Rails/Ruby library should I use? 322 Nutch for crawling web pages 323 Maintaining document security with ManifoldCF 324 Connectors 325 Putting ManifoldCF to use 325 Summary 328 Chapter 10: Scaling Solr 329 Tuning complex systems 330 Testing Solr performance with SolrMeter 332 Optimizing a single Solr server (Scale up) 334 Configuring JVM settings to improve memory usage 334 MMapDirectoryFactory to leverage additional virtual memory 335 Enabling downstream HTTP caching 335 Solr caching 338 Tuning caches 339 Indexing performance 340 Designing the schema 340 Sending data to Solr in bulk 341 Don't overlap commits 342 Disabling unique key checking 343 Index optimization factors 343 Enhancing faceting performance 345 Using term vectors 345 Improving phrase search performance 346 Moving to multiple Solr servers (Scale horizontally) 348 Replication 349 Starting multiple Solr servers 349 Configuring replication 351 Load balancing searches across slaves 352 Indexing into the master server 352 Configuring slaves 353 Configuring load balancing 354 Sharding indexes 356 Assigning documents to shards 357 Searching across shards (distributed search) 358 Combining replication and sharding (Scale deep) 360 Near real time search 362 Where next for scaling Solr? 363 Summary 364
10 Appendix: Search Quick Reference 365 Quick reference 366 Index 369 [ix]
Soir 1.4 Enterprise Search Server
Soir 1.4 Enterprise Search Server Enhance your search with faceted navigation, result highlighting, fuzzy queries, ranked scoring, and more David Smiley Eric Pugh *- PUBLISHING -J BIRMINGHAM - MUMBAI Preface
More informationEPL660: Information Retrieval and Search Engines Lab 3
EPL660: Information Retrieval and Search Engines Lab 3 Παύλος Αντωνίου Γραφείο: B109, ΘΕΕ01 University of Cyprus Department of Computer Science Apache Solr Popular, fast, open-source search platform built
More informationAn Application for Monitoring Solr
An Application for Monitoring Solr Yamin Alam Gauhati University Institute of Science and Technology, Guwahati Assam, India Nabamita Deb Gauhati University Institute of Science and Technology, Guwahati
More informationTechnical Deep Dive: Cassandra + Solr. Copyright 2012, Think Big Analy7cs, All Rights Reserved
Technical Deep Dive: Cassandra + Solr Confiden7al Business case 2 Super scalable realtime analytics Hadoop is fantastic at performing batch analytics Cassandra is an advanced column family oriented system
More informationRelevancy Workbench Module. 1.0 Documentation
Relevancy Workbench Module 1.0 Documentation Created: Table of Contents Installing the Relevancy Workbench Module 4 System Requirements 4 Standalone Relevancy Workbench 4 Deploy to a Web Container 4 Relevancy
More informationA short introduction to the development and evaluation of Indexing systems
A short introduction to the development and evaluation of Indexing systems Danilo Croce croce@info.uniroma2.it Master of Big Data in Business SMARS LAB 3 June 2016 Outline An introduction to Lucene Main
More informationEnterprise Search with ColdFusion Solr. Dan Sirucek cf.objective 2012 May 2012
Enterprise Search with ColdFusion Solr Dan Sirucek cf.objective 2012 May 2012 About Me Senior Learning Technologist at WellPoint, Inc Developer for 14 years Developing in ColdFusion for 8 years Started
More informationBattle of the Giants Apache Solr 4.0 vs ElasticSearch 0.20 Rafał Kuć sematext.com
Battle of the Giants Apache Solr 4.0 vs ElasticSearch 0.20 Rafał Kuć Sematext International @kucrafal @sematext sematext.com Who Am I Solr 3.1 Cookbook author (4.0 inc) Sematext consultant & engineer Solr.pl
More informationImproving Drupal search experience with Apache Solr and Elasticsearch
Improving Drupal search experience with Apache Solr and Elasticsearch Milos Pumpalovic Web Front-end Developer Gene Mohr Web Back-end Developer About Us Milos Pumpalovic Front End Developer Drupal theming
More informationApache Solr Cookbook. Apache Solr Cookbook
Apache Solr Cookbook i Apache Solr Cookbook Apache Solr Cookbook ii Contents 1 Apache Solr Tutorial for Beginners 1 1.1 Why Apache Solr................................................... 1 1.2 Installing
More informationMastering phpmyadmiri 3.4 for
Mastering phpmyadmiri 3.4 for Effective MySQL Management A complete guide to getting started with phpmyadmin 3.4 and mastering its features Marc Delisle [ t]open so 1 I community experience c PUBLISHING
More informationApache Lucene - Query Parser Syntax
Peter Carlson Table of contents 1 Overview...2 2 Terms... 2 3 Fields...3 4 Term Modifiers... 3 4.1 Wildcard Searches... 3 4.2 Fuzzy Searches... 4 4.3 Proximity Searches...4 4.4 Range Searches...4 4.5 Boosting
More informationSearch and Time Series Databases
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Search and Time Series Databases Corso di Sistemi e Architetture per Big Data A.A. 2016/17 Valeria
More informationRoad to Auto Scaling
Road to Auto Scaling Varun Thacker Lucidworks Apache Lucene/Solr Committer, and PMC member Agenda APIs Metrics Recipes Auto-Scale Triggers SolrCloud Overview ZooKee per Lots Shard 1 Leader Shard 3 Replica
More informationSearch Engines and Time Series Databases
Università degli Studi di Roma Tor Vergata Dipartimento di Ingegneria Civile e Ingegneria Informatica Search Engines and Time Series Databases Corso di Sistemi e Architetture per Big Data A.A. 2017/18
More informationTaming Text. How to Find, Organize, and Manipulate It MANNING GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS. Shelter Island
Taming Text How to Find, Organize, and Manipulate It GRANT S. INGERSOLL THOMAS S. MORTON ANDREW L. KARRIS 11 MANNING Shelter Island contents foreword xiii preface xiv acknowledgments xvii about this book
More informationRealtime visitor analysis with Couchbase and Elasticsearch
Realtime visitor analysis with Couchbase and Elasticsearch Jeroen Reijn @jreijn #nosql13 About me Jeroen Reijn Software engineer Hippo @jreijn http://blog.jeroenreijn.com About Hippo Visitor Analysis OneHippo
More informationLAB 7: Search engine: Apache Nutch + Solr + Lucene
LAB 7: Search engine: Apache Nutch + Solr + Lucene Apache Nutch Apache Lucene Apache Solr Crawler + indexer (mainly crawler) indexer + searcher indexer + searcher Lucene vs. Solr? Lucene = library, more
More informationCollective Intelligence in Action
Collective Intelligence in Action SATNAM ALAG II MANNING Greenwich (74 w. long.) contents foreword xv preface xvii acknowledgments xix about this book xxi PART 1 GATHERING DATA FOR INTELLIGENCE 1 "1 Understanding
More informationAlfresco Developer Guide
Alfresco Developer Guide Customizing Alfresco with actions, web scripts, web forms, workflows, and more Jeff Potts - PUBLISHING - 1 BIRMINGHAM - MUMBAI Preface Chapter 1: The Alfresco Platform 7 Alfresco
More informationGoal of this document: A simple yet effective
INTRODUCTION TO ELK STACK Goal of this document: A simple yet effective document for folks who want to learn basics of ELK (Elasticsearch, Logstash and Kibana) without any prior knowledge. Introduction:
More informationClick to add text IBM Collaboration Solutions
IBM Connections Search: Troubleshooting and Best Practices 5/14/2014 Greg Presayzen Client Technical Professional Mark McCarville Advisory Software Engineer Click to add text IBM Collaboration Solutions
More informationApache Solr Reference Guide. Covering Apache Solr 4.5
Apache Solr Reference Guide Covering Apache Solr 4.5 Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for
More informationOracle Fusion Middleware 11g: Build Applications with ADF I
Oracle University Contact Us: +966 1 1 2739 894 Oracle Fusion Middleware 11g: Build Applications with ADF I Duration: 5 Days What you will learn This course is aimed at developers who want to build Java
More informationWeb scraping. Donato Summa. 3 WP1 face to face meeting September 2017 Thessaloniki (EL)
Web scraping Donato Summa Summary Web scraping : Specific vs Generic Web scraping phases Web scraping tools Istat Web scraping chain Summary Web scraping : Specific vs Generic Web scraping phases Web scraping
More informationFusion Registry 9 SDMX Data and Metadata Management System
Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance
More informationParallel SQL and Streaming Expressions in Apache Solr 6. Shalin Shekhar Lucidworks Inc.
Parallel SQL and Streaming Expressions in Apache Solr 6 Shalin Shekhar Mangar @shalinmangar Lucidworks Inc. Introduction Shalin Shekhar Mangar Lucene/Solr Committer PMC Member Senior Solr Consultant with
More informationQuery Parsing. Presented by Erik Hatcher 27 February 2013
Query Parsing Presented by Erik Hatcher 27 February 2013 1 Description Interpreting what the user meant and what they ideally would like to find is tricky business. This talk will cover useful tips and
More informationBuilding Search Applications
Building Search Applications Lucene, LingPipe, and Gate Manu Konchady Mustru Publishing, Oakton, Virginia. Contents Preface ix 1 Information Overload 1 1.1 Information Sources 3 1.2 Information Management
More informationOracle Fusion Middleware 11g: Build Applications with ADF Accel
Oracle University Contact Us: +352.4911.3329 Oracle Fusion Middleware 11g: Build Applications with ADF Accel Duration: 5 Days What you will learn This is a bundled course comprising of Oracle Fusion Middleware
More informationOracle APEX 18.1 New Features
Oracle APEX 18.1 New Features May, 2018 Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationOracle Fusion Middleware 11g: Build Applications with ADF I
Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 4108 4709 Oracle Fusion Middleware 11g: Build Applications with ADF I Duration: 5 Days What you will learn Java EE is a standard, robust,
More informationApache Lucene 4. Robert Muir
Apache Lucene 4 Robert Muir Agenda Overview of Lucene Conclusion Resources Q & A Download of Lucene: core/ analysis/ queryparser/ highlighter/ suggest/ expressions/ join/ memory/ codecs/... core/ Lucene
More informationCourse Content MongoDB
Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL
More informationWorkbench User's Guide
IBM Initiate Workbench User's Guide Version9Release7 SC19-3167-06 IBM Initiate Workbench User's Guide Version9Release7 SC19-3167-06 Note Before using this information and the product that it supports,
More informationRed Hat JBoss Data Grid 7.1 Migration Guide
Red Hat JBoss Data Grid 7.1 Migration Guide For Use with JBoss Data Grid 7.1 Red Hat Customer Content Services Red Hat JBoss Data Grid 7.1 Migration Guide For Use with JBoss Data Grid 7.1 Legal Notice
More informationHibernate Search Googling your persistence domain model. Emmanuel Bernard Doer JBoss, a division of Red Hat
Hibernate Search Googling your persistence domain model Emmanuel Bernard Doer JBoss, a division of Red Hat Search: left over of today s applications Add search dimension to the domain model Frankly, search
More informationWeb Applications. Software Engineering 2017 Alessio Gambi - Saarland University
Web Applications Software Engineering 2017 Alessio Gambi - Saarland University Based on the work of Cesare Pautasso, Christoph Dorn, Andrea Arcuri, and others ReCap Software Architecture A software system
More informationOpen Source Search. Andreas Pesenhofer. max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria
Open Source Search Andreas Pesenhofer max.recall information systems GmbH Künstlergasse 11/1 A-1150 Wien Austria max.recall information systems max.recall is a software and consulting company enabling
More informationMEAP Edition Manning Early Access Program Solr in Action version 1
MEAP Edition Manning Early Access Program Solr in Action version 1 Copyright 2012 Manning Publications For more information on this and other Manning titles go to www.manning.com brief contents PART 1:
More informationElasticsearch Search made easy
Elasticsearch Search made easy Alexander Reelsen Agenda Why is search complex? Installation & initial setup Importing data Searching data Replication & Sharding Plugin-based
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationFinal Report CS 5604 Fall 2016
Final Report CS 5604 Fall 2016 Solr Team CS 5604: Information Storage and Retrieval Instructor: Dr. Edward A. Fox Liuqing Li, Anusha Pillai, Ke Tian, Ye Wang {liuqing, anusha89, ketian, yewang16} @vt.edu
More informationNo Schema Type For Mysql Type Date Drupal
No Schema Type For Mysql Type Date Drupal I made a custom entity with a date field stored as datetime in mysql. It is important that your data is represented, as documented for your data type, e.g. a date
More informationThinking Beyond Search with Solr Understanding How Solr Can Help Your Business Scale. Magento Expert Consulting Group Webinar July 31, 2013
Thinking Beyond Search with Solr Understanding How Solr Can Help Your Business Scale Magento Expert Consulting Group Webinar July 31, 2013 The presenters Magento Expert Consulting Group Udi Shamay Head,
More informationSharePoint 2013 Search Inside Out
SharePoint 2013 Search Inside Out 55037; 5 Days, Instructor-led Course Description This 5-day course will instruct on how to create simple and advanced search topologies. How to configure the various search
More informationX100 ARCHITECTURE REFERENCES:
UNION SYSTEMS GLOBAL This guide is designed to provide you with an highlevel overview of some of the key points of the Oracle Fusion Middleware Forms Services architecture, a component of the Oracle Fusion
More informationVK Multimedia Information Systems
VK Multimedia Information Systems Mathias Lux, mlux@itec.uni-klu.ac.at This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Results Exercise 01 Exercise 02 Retrieval
More informationfpackfl Drupal 6 JavaScript and jquery L PUBLISHING Putting jquery, AJAX, and JavaScript effects into your Drupal 6 modules and themes Matt Butcher
Drupal 6 JavaScript and jquery Putting jquery, AJAX, and JavaScript effects into your Drupal 6 modules and themes Matt Butcher fpackfl L PUBLISHING -I BIRMINGHAM - MUMBAI Preface 1 Chapter 1: Drupal and
More informationDigital Factory 7 Search and Query API under the hood
Digital Factory 7 Search and Query API under the hood #jahiaone Benjamin Papež, QA Architect Search and Query API under the hood Overview on used search engine frameworks and API Jahia's extensions to
More informationcominvent as Migrating FAST to Solr by Jan Høydahl cominvent as Enterprise Search Specialists
Enterprise Search Specialists Migrating FAST to Solr by Jan Høydahl Consulting Cominvent delivers independent search consulting Focus on Apache Lucene/Solr & Microsoft FAST ESP We know both the proprietary
More informationAdobe Experience Manager
Adobe Experience Manager Extend and Customize Adobe Experience Manager v6.x Student Guide: Volume 1 Contents CHAPTER ONE: BASICS OF THE ARCHITECTURAL STACK... 10 What is Adobe Experience Manager?... 10
More informationGoogle Search Appliance
Google Search Appliance Getting the Most from Your Google Search Appliance Google Search Appliance software version 7.4 Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com GSA-QS_200.03
More informationMicrosoft. Inside Microsoft. SharePoint Ted Pattison. Andrew Connell. Scot Hillier. David Mann
Microsoft Inside Microsoft SharePoint 2010 Ted Pattison Andrew Connell Scot Hillier David Mann ble of Contents Foreword Acknowledgments Introduction xv xvii xix 1 SharePoint 2010 Developer Roadmap 1 SharePoint
More informationFusing Corporate Thesaurus Management with Linked Data using PoolParty
Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl PoolParty at a glance Developed by punkt. netservices Current release: PoolParty 2.8 Main focus on three application
More informationmysolr Documentation Release Rubén Abad, Miguel Olivares
mysolr Documentation Release 0.8.2 Rubén Abad, Miguel Olivares June 05, 2014 Contents 1 Basic Usage 3 2 Contents 5 2.1 Installation................................................ 5 2.2 User Guide................................................
More informationEPL660: Information Retrieval and Search Engines Lab 8
EPL660: Information Retrieval and Search Engines Lab 8 Παύλος Αντωνίου Γραφείο: B109, ΘΕΕ01 University of Cyprus Department of Computer Science What is Apache Nutch? Production ready Web Crawler Operates
More informationFAST Enterprise Search Platform
FAST Enterprise Search Platform version:5.2 Product Overview Guide Document Number: ESP1000, Document Revision: A, April 3, 2008 Copyright Copyright 1997-2008 by Fast Search & Transfer ASA ( FAST ). Some
More informationNoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018
NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE DACE https://dace.unige.ch Data and Analysis Center for Exoplanets. Facility to store, exchange and analyse data
More informationNYC Apache Lucene/Solr Meetup
June 11, 2014 NYC Apache Lucene/Solr Meetup RAMP UP YOUR WEB EXPERIENCES USING DRUPAL AND APACHE SOLR peter.wolanin@acquia.com drupal.org/user/49851 (pwolanin) Peter Wolanin Momentum Specialist @ Acquia,
More informationBUILDING A WEBSITE FOR THE NUMBER ONE CHILDREN S HOSPITAL IN THE U.S. May 10, 2011
BUILDING A WEBSITE FOR THE NUMBER ONE CHILDREN S HOSPITAL IN THE U.S. May 10, 2011 0 Introduction About me and NorthPoint NorthPoint is a USA-based organization Specializing in Open Source technologies
More informationBEAWebLogic Server. Introduction to BEA WebLogic Server and BEA WebLogic Express
BEAWebLogic Server Introduction to BEA WebLogic Server and BEA WebLogic Express Version 10.0 Revised: March, 2007 Contents 1. Introduction to BEA WebLogic Server and BEA WebLogic Express The WebLogic
More informationMarkLogic Server. Administrator s Guide. MarkLogic 9 May, Copyright 2017 MarkLogic Corporation. All rights reserved.
Administrator s Guide 1 MarkLogic 9 May, 2017 Last Revised: 9.0-3, September, 2017 Copyright 2017 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Administrator s Guide 1.0
More informationForeScout Open Integration Module: Data Exchange Plugin
ForeScout Open Integration Module: Data Exchange Plugin Version 3.2.0 Table of Contents About the Data Exchange Plugin... 4 Requirements... 4 CounterACT Software Requirements... 4 Connectivity Requirements...
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationOracle WebLogic Server 11g: Administration Essentials
Oracle University Contact Us: +33 (0) 1 57 60 20 81 Oracle WebLogic Server 11g: Administration Essentials Duration: 5 Days What you will learn This Oracle WebLogic Server 11g: Administration Essentials
More informationApache Lucene - Overview
Table of contents 1 Apache Lucene...2 2 The Apache Software Foundation... 2 3 Lucene News...2 3.1 27 November 2011 - Lucene Core 3.5.0... 2 3.2 26 October 2011 - Java 7u1 fixes index corruption and crash
More informationIndexing HTML files in Solr 1
Indexing HTML files in Solr 1 This tutorial explains how to index html files in Solr using the built-in post tool, which leverages Apache Tika and auto extracts content from html files. You should have
More informationFAST& SCALABLE SYSTEMS WITH APACHESOLR. Arnon Yogev IBM Research
FAST& SCALABLE EMAIL SYSTEMS WITH APACHESOLR Arnon Yogev IBM Research Background IBM Verse is a cloud based business email system Background cont. Verse backend is based on Apache Solr Almost every user
More informationHow to Build a Digital Library
How to Build a Digital Library Ian H. Witten & David Bainbridge Contents Preface Acknowledgements i iv 1. Orientation: The world of digital libraries 1 One: Supporting human development 1 Two: Pushing
More informationApache Lucene Eurocon: Preview
Apache Lucene Eurocon: Preview www.lucene-eurocon.org Overview Introduction Near Real Time Search: Yonik Seeley A link to download these slides will be available after the webcast is complete. An on-demand
More informationDrupal 7 Sql Schema Api Datetime
Drupal 7 Sql Schema Api Datetime See the Entity API section on "Access checking on entities", and the Node and a datetime field type. dblog: Logs and records system events to the database. User warning:
More informationThe main differences with other open source reporting solutions such as JasperReports or mondrian are:
WYSIWYG Reporting Including Introduction: Content at a glance. Create A New Report: Steps to start the creation of a new report. Manage Data Blocks: Add, edit or remove data blocks in a report. General
More informationIstat s Pilot Use Case 1
Istat s Pilot Use Case 1 Pilot identification 1 IT 1 Reference Use case X 1) URL Inventory of enterprises 2) E-commerce from enterprises websites 3) Job advertisements on enterprises websites 4) Social
More informationLanguage Support, Linguistics, and Text Analytics in Solr
Boston Apache Lucene and Solr Meetup Language Support, Linguistics, and Text Analytics in Solr Carl Steve W. Kearns Hoffman Product Manager Basis Technology Founder & CEO www.basistech.com Agenda About
More informationThe Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources
FOSS4G 2017 Boston The Billion Object Platform (BOP): a system to lower barriers to support big, streaming, spatio-temporal data sources Devika Kakkar and Ben Lewis Harvard Center for Geographic Analysis
More informationAPI Gateway Version September Key Property Store User Guide
API Gateway Version 7.5.2 15 September 2017 Key Property Store User Guide Copyright 2017 Axway All rights reserved. This documentation describes the following Axway software: Axway API Gateway 7.5.2 No
More informationForeScout CounterACT. Configuration Guide. Version 3.4
ForeScout CounterACT Open Integration Module: Data Exchange Version 3.4 Table of Contents About the Data Exchange Module... 4 About Support for Dual Stack Environments... 4 Requirements... 4 CounterACT
More informationDeveloping Applications with Java EE 6 on WebLogic Server 12c
Developing Applications with Java EE 6 on WebLogic Server 12c Duration: 5 Days What you will learn The Developing Applications with Java EE 6 on WebLogic Server 12c course teaches you the skills you need
More informationAndré Angelantoni Thanks to France Telecom for allowing me to demo their project.
+ André Angelantoni aangel@mac.com Thanks to France Telecom for allowing me to demo their project. Why Should You Consider Solr? Great search results (plus more control) Sorting Faceted Search Similar
More informationAgent-Enabling Transformation of E-Commerce Portals with Web Services
Agent-Enabling Transformation of E-Commerce Portals with Web Services Dr. David B. Ulmer CTO Sotheby s New York, NY 10021, USA Dr. Lixin Tao Professor Pace University Pleasantville, NY 10570, USA Abstract:
More informationFull-Text Indexing For Heritrix
Full-Text Indexing For Heritrix Project Advisor: Dr. Chris Pollett Committee Members: Dr. Mark Stamp Dr. Jeffrey Smith Darshan Karia CS298 Master s Project Writing 1 2 Agenda Introduction Heritrix Design
More informationHigh Performance Solr. Shalin Shekhar Mangar
High Performance Solr Shalin Shekhar Mangar Performance constraints CPU Memory Disk Network 2 Tuning (CPU) Queries Phrase query Boolean query (AND) Boolean query (OR) Wildcard Fuzzy Soundex roughly in
More informationIn this brief tutorial, we will be explaining the basics of Elasticsearch and its features.
About the Tutorial is a real-time distributed and open source full-text search and analytics engine. It is used in Single Page Application (SPA) projects. is open source developed in Java and used by many
More informationopen source community experience distilled
Alfresco 3 Business Solutions Practical implementation techniques and guidance for delivering business solutions with Alfresco Martin Bergljung [ PUBLISHING I I open source community experience distilled
More informationCERA GUI Usage. Revision History. Contents
CERA GUI Usage Revision History Revision Author Scope February-2017 DKRZ Data management Public release Contents Introduction...2 Intended Audience...2 Revision History...2 Interface...2 Browse...4 Search...6
More informationYonik Seeley 29 June 2006 Dublin, Ireland
Apache Solr Yonik Seeley yonik@apache.org 29 June 2006 Dublin, Ireland History Search for a replacement search platform commercial: high license fees open-source: no full solutions CNET grants code to
More information8KMiles Software Services, Inc
8KMiles Software Services, Inc Comparison Report TABLE OF CONTENTS Smackdown... 3 Introduction... 5 Search features 1-1 comparison... 6 Feature 1: Getting Started... 7 Feature 2: Operations and Management...
More informationEnterprise Data Catalog for Microsoft Azure Tutorial
Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise
More informationrun your own search engine. today: Cablecar
run your own search engine. today: Cablecar Robert Kowalski @robinson_k http://github.com/robertkowalski Search nobody uses that, right? Services on the Market Google Bing Yahoo ask Wolfram Alpha Baidu
More informationHibernate Search: A Successful Search, a Happy User Make it Happen!
Hibernate Search: A Successful Search, a Happy User Make it Happen! Emmanuel Bernard Lead Developer at JBoss by Red Hat September 2nd 2009 1 Emmanuel Bernard Hibernate Search in Action blog.emmanuelbernard.com
More informationGovt. of Karnataka, Department of Technical Education Diploma in Computer Science & Engineering. Fifth Semester. Subject: Web Programming
Govt. of Karnataka, Department of Technical Education Diploma in Computer Science & Engineering Fifth Semester Subject: Web Programming Contact Hrs / week: 4 Total hrs: 64 Table of Contents SN Content
More informationDelivery Options: Attend face-to-face in the classroom or via remote-live attendance.
XML Programming Duration: 5 Days US Price: $2795 UK Price: 1,995 *Prices are subject to VAT CA Price: CDN$3,275 *Prices are subject to GST/HST Delivery Options: Attend face-to-face in the classroom or
More informationProfessional SharePoint 2010 Development
Professional SharePoint 2010 Development Rizzo, T ISBN-13: 9781118131688 Table of Contents INTRODUCTION xxv CHAPTER 1: INTRODUCTION TO SHAREPOINT 2010 1 What s New in the SharePoint Platform and Tools
More informationHomework 4: Comparing Search Engine Ranking Algorithms
Homework 4: Comparing Search Engine Ranking Algorithms Objectives: o o Preparation Experience using Solr Investigating ranking strategies In a previous exercise you used crawler4j to crawl a news website.
More informationFAST InStream. version 4.3 Product Overview Guide
FAST InStream version 4.3 Product Overview Guide Document Number: INS1041, Document Revision: A, May 5, 2006 Copyright 1997-2006 Fast Search & Transfer ASA ( FAST ). Some portions may be copyrighted by
More informationBuilding the News Search Engine
Building the News Search Engine Ramkumar Aiyengar Team Leader, R&D News Search, Bloomberg L.P. andyetitmoves@apache.org A technology company Our strength and focus is data The Terminal, vertical portals
More informationApplication Services for Knowledge Organisation and System Integration
www.askosi.org Application Services for Knowledge Organisation and System Integration A Short Presentation May 2010 Christophe Dupriez dupriez@askosi.org Thesauri: Take a walk on the «Why?» slide! Search
More informationCONTEXT-BASED AUTOSUGGEST ON GRAPH DATA
San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 5-21-2015 CONTEXT-BASED AUTOSUGGEST ON GRAPH DATA Hai Nguyen San Jose State University Follow
More informationPROCE55 Mobile: Web API App. Web API. https://www.rijksmuseum.nl/api/...
PROCE55 Mobile: Web API App PROCE55 Mobile with Test Web API App Web API App Example This example shows how to access a typical Web API using your mobile phone via Internet. The returned data is in JSON
More information