
Lucene and Solr: The Definitive Guide
The Comprehensive Guide to Lucene and Solr for Realtime Big Data
Jason Rutherglen(Author)
O'Reilly (Publisher)
Book
Paperback/Softback
300 pages
978-1-4493-5995-9 (ISBN)
Description
With the intense interest in big data and the growing complexity of Apache Solr applications, application developers, business professionals, and end-users alike are clamoring for a more in-depth look at Apache Lucene and Solr. This comprehensive one-stop guide helps you gain a thorough understanding of Lucene's underlying architecture so you can design, implement, and tune successful Solr applications. High-speed inverted indexes are inherently difficult to develop. That's why more and more enterprises are implementing the Solr search server and Lucene Core search technology for complex text retrieval, as a NoSQL system for big data, or as a replacement for relational database systems that require horizontal scalability. With this guide's complete coverage of both Lucene and Solr, you'll get a unified view of their value and applicability to your big data projects. Learn how Lucene works from the inside out Get examples for using both Lucene and Solr APIs Configure Solr for optimal production use Learn how to use Solr with Hadoop
More details
Language
English
Place of publication
Sebastopol
United States
Target group
Professional and scholarly
Dimensions
Height: 233 mm
Width: 178 mm
ISBN-13
978-1-4493-5995-9 (9781449359959)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Person
Jason Rutherglen is a software architect at Think Big Analytics and specializes in Big Data, Hadoop, search, and security. Ryan Tabora is a Data Developer at Think Big Analytics, a big data consulting firm based out of Mountain View, CA. He acts as both a developer and as a consultant. As a consultant, he spends time working face to face with Fortune 500 clients to help uncover value from their big data. As a developer, he writes code for applications using cutting edge technology like Cassandra, Solr, HBase, and Hadoop. Applications he's written include a RESTful interface using Solr, HBase, and Hadoop to search log data and a few MapReduce applications that can be used to dynamically reindex Solr based off of an HBase table. He also spends time developing and delivering training material as part of the Think Big Academy. Previously, he worked in a big data group within the Accenture Technology labs which is a research and development group dedicated to building the vision of the firm. Aside from work, he plays bass guitar, reads comic books, and hangs out with his two dachshunds Brutus and Puck. Jack Krupansky is a freelance software developer with a Masters degree in Computer Science from Stevens Institute of Technology and over 35 years of experience developing and using a wide range of software technologies, including compilers and programming tools, graphics and graphical user interfaces, CAD/CAM, document image processing, and traditional databases, but for the past five years he has focused on the problems of Search, Discovery, Relevance, and Big Data, particularly using the Apache open source technologies of Lucene, Solr, Cassandra, and the rest of the Apache open source ecosystem. He has been a contract developer for the Northern Light search engine back in the dot-com days, Lucid Imagination (now LucidWorks) for their Enterprise Search product, and more recently at DataStax for their enterprise search product that uses Solr. He is the inventor of the Liana object-oriented programming language for GUI development and the Base Technology open source software agent server.