Synopses & Reviews
Summary
Solr in Action is a comprehensive guide to implementing scalable search using Apache Solr. This clearly written book walks you through well-documented examples ranging from basic keyword searching to scaling a system for billions of documents and queries. It will give you a deep understanding of how to implement core Solr capabilities.
About the Book
Whether you're handling big (or small) data, managing documents, or building a website, it is important to be able to quickly search through your content and discover meaning in it. Apache Solr is your tool: a ready-to-deploy, Lucene-based, open source, full-text search engine. Solr can scale across many servers to enable real-time queries and data analytics across billions of documents.
Solr in Action teaches you to implement scalable search using Apache Solr. This easy-to-read guide balances conceptual discussions with practical examples to show you how to implement all of Solr's core capabilities. You'll master topics like text analysis, faceted search, hit highlighting, result grouping, query suggestions, multilingual search, advanced geospatial and data operations, and relevancy tuning.
This book assumes basic knowledge of Java and standard database technology. No prior knowledge of Solr or Lucene is required.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
What's Inside
- How to scale Solr for big data
- Rich real-world examples
- Solr as a NoSQL data store
- Advanced multilingual, data, and relevancy tricks
- Coverage of versions through Solr 4.7
About the Authors
Trey Grainger is a director of engineering at CareerBuilder. Timothy Potter is a senior member of the engineering team at LucidWorks. The authors work on the scalability and reliability of Solr, as well as on recommendation engine and big data analytics technologies.
Table of Contents
PART 1 MEET SOLR
- Introduction to Solr
- Getting to know Solr
- Key Solr concepts
- Configuring Solr
- Indexing
- Text analysis
PART 2 CORE SOLR CAPABILITIES
- Performing queries and handling results
- Faceted search
- Hit highlighting
- Query suggestions
- Result grouping/field collapsing
- Taking Solr to production
PART 3 TAKING SOLR TO THE NEXT LEVEL
- SolrCloud
- Multilingual search
- Complex query operations
- Mastering relevancy
Synopsis
Whether handling big data, building cloud-based services, or developing multi-tenant web applications, it's vital to have a fast, reliable search solution. Apache Solr is a scalable and ready-to-deploy open-source full-text search engine powered by Lucene. It offers key features like multi-lingual keyword searching, faceted search, intelligent matching, and relevancy weighting right out of the box.
Solr in Action is the definitive guide to implementing fast and scalable search using Apache Solr 4. It uses well-documented examples ranging from basic keyword searching to scaling a system for billions of documents and queries. Readers will gain a deep understanding of how to implement core Solr capabilities such as faceted navigation through search results, matched snippet highlighting, field collapsing and search results grouping, spell checking, query auto-complete, querying by functions, and more.
Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.
About the Author
Trey Grainger manages the Search Technology Development group at CareerBuilder.com. His search experience includes handling multi-lingual content across dozens of markets/languages, machine learning, genetic algorithm and user group based relevancy tuning, geo-spatial search and validation, and work on customized payload scoring models, data mining, clustering, and recommendations. Trey is the Founder of Celiaccess.com, a gluten-free search engine, and is a frequent speaker at Lucene and Solr-related conferences.
Timothy Potter is an architect on the Big Data team at Dachis Group, where he focuses on large-scale machine learning, text mining, and social network analysis. Tim has worked extensively with Lucene and Solr technologies and has been a speaker at Lucene Revolution. He is a contributing author to Taming Text (Manning 2012) and holds several US Patents related to J2EE-based enterprise application integration. He blogs at thelabdude.blogspot.com.