Apache Solr Beginner's Guide.

Saved in:
Bibliographic Details
Author / Creator:Serafini, Alfredo.
Imprint:Birmingham : Packt Publishing, 2013.
Description:1 online resource (324 pages)
Language:English
Series:Baker & Taylor Books (Firm). Axis 360.
Subject:
Format: E-Resource Book
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/11215973
Hidden Bibliographic Details
ISBN:9781782162537
1782162534
9781782162520
1782162526
1306280141
9781306280143
Digital file characteristics:text file
Notes:English.
Print version record.
Summary:Written in a friendly, example-driven format, the book includes plenty of step-by-step instructions and examples that are designed to help you get started with Apache Solr. This book is an entry level text into the wonderful world of Apache Solr. The book will center around a couple of simple projects such as setting up Solr and all the stuff that comes with customizing the Solr schema and configuration. This book is for developers looking to start using Apache Solr who are stuck or intimidated by the difficulty of setting it up and using it. For anyone wanting to embed a search engine in their.
Other form:Print version: Serafini, Alfredo. Apache Solr Beginner's Guide. Birmingham : Packt Publishing, ©2013 9781782162520
Table of Contents:
  • Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Ready with the Essentials; Understanding Solr; Learning the powerful aspects of Solr; Working with Java installation; Downloading and installing Java; Configuring CLASSPATH and PATH variables for Java; Installing and testing Solr; Time for action
  • starting Solr for the first time; Taking a glance at the Solr interface; Time for action
  • posting some example data; Time for action
  • testing Solr with cURL; Who uses Solr?; Resources on Solr.
  • How will we use Solr?Summary; Chapter 2 Indexing with Local PDF Files; Understanding and using an index; Posting example documents to the first Solr core; Analyzing the elements we need in Solr core; Time for action
  • configuring Solr Home and Solr core discovery; Knowing the legacy solr.xml format; Time for action
  • writing a simple solrconfig.xml file; Time for action
  • writing a simple schema.xml file; Time for action
  • starting the new core; Time for action
  • defining an example document; Time for action
  • indexing an example document with cURL; Executing the first search on the new core.
  • Adding documents to the index from the web UITime for action
  • updating an existing document; Time for action
  • cleaning an index; Creating an index prototype from PDF files; Time for action
  • defining the schema.xml file with only dynamic fields and tokenization; Time for action
  • writing a simple solrconfig.xml file with an update handler; Testing the PDF file core with dummy data and an example query; Defining a new tokenized field for fulltext; Time for action
  • using Tika and cURL to extract text from PDFs; Using cURL to index some PDF data.
  • Time for action
  • finding copies of the same files with deduplicationTime for action
  • looking inside an index with SimpleTextCodec; Understanding the structure of an inverted index; Understanding how optimization affects the segments of an index; Writing the full configuration for our PDF index example; Writing the solrconfig.xml file; Writing the schema.xml file; Summarizing some easy recipes for the maintenance of an index; Summary; Chapter 3: Indexing Example Data from DBPedia
  • Paintings; Harvesting paintings' data from DBPedia; Analyzing the entities that we want to index.
  • Analyzing the first entity
  • PaintingWriting Solr core configurations for the first tests; Time for action
  • defining the basic solrconfig.xml file; Looking at the differences between commits and soft commits; Time for action
  • defining the simple schema.xml file; Introducing analyzers, tokenizers, and filters; Thinking fields for atomic updates; Indexing a test entity with JSON; Understanding the update chain; Using the atomic update; Understanding how optimistic concurrency works; Time for action
  • listing all the fields with the CSV output; Defining a new Solr core for our Painting entity.
  • Time for action
  • refactoring the schema.xml file for the paintings core by introducing tokenization and stop words.