Found 111 results in 0.172 seconds. Displaying page 1 of 12, sorted by
LUCENE
Search Lucene Docs
...
http://www.lucidimagination.com/developer/documentation
encounter is where documents in rich formats such as PDF, MS Word/Excel/Powerpoint, etc are stored as BLOBs in a SQL database. Your first reaction might be that this would be a lot of work, since Solr does not support such an import natively. But by using the DataImportHandler of Solr and a
http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Sear...
software foundation where Lucene is hosted and maintained. See http://www.apache.org .
D
Document
The Lucene/Solr abstract representation of one or more units of content. Typically represents a file or a database record, but is ultimately user-defined. A Document consists of one or more Fields. A Document may be boosted in order to indicate it's importance over other Documents. See also Field
http://www.lucidimagination.com/Documentation/Additional-Resources/Glossary
detection and content extraction framework. Tika provides a general application programming interface that can be used to detect the content type of a document and also parse textual content and metadata from several document formats. Tika does not try to understand the full variety of different document
http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Cont...
WorksheetWorks.com is a document (PDF) generator for the K-12 education set. Its Lucene database is built from localization properties and sitemap when the system starts, so as the site grows in size, Lucene keeps up and stays current automatically. WorksheetWorks.com relies on the Extensible Document Framework (XDF), a powerful document generation framework. The document generation code runs inside a container developed for this purpose, as well as a specially configured server. It enables the creation of tiny logic plug-ins that serve specific purposes, such as math problems, illustrations, or text paragraphs. The framework lays them out on the page correctly and sends it over the Internet. This enables site developers to focus on developing new and interesting worksheet content, instead of worrying about the technical aspects of
http://www.lucidimagination.com/Community/Marketplace/Application-Showcase-Wi...
interface, and browse to the statistics page and see how many documents were just indexed.
Now we're going to do some searches with Solr. Using the URL to interface with Solr, we go to the Solr/select URL. We'll add a q parameter, which gives the text to the query
http://www.lucidimagination.com/Developers/Podcasts/Getting-Started-with-Solr...
How Search Works in Lucene Lucene Setup Getting Started with Lucene Documents and Fields
http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Gett...
Award-winning Central Desktop offers wiki-based collaboration for teams and workgroups, similar to Sharepoint and Basecamp. As of January 2008, the company serves more than 200,000 users worldwide. That wikis are good places to submit and store documents is a given, but to be truly useful, users must be able to find what they are looking for. To help users sort through the noise, Lucene is used to manage a variety of document level, full text and user-rights-based search technology. Central Desktop stores over 3 million documents and 1TB of raw data, with a 35GB index. Full text search is available for Word, Excel, PowerPoint and PDF files.
http://www.lucidimagination.com/Community/Marketplace/Application-Showcase-Wi...
The goal of DynaQ , which stands for dynamic queries for document-based, personal information spaces, is to develop an inquiry system to explore the personal information space of individual users. This desktop search engine offers enhanced usability for file, e-mail, and blog searches of all documents on a PC. The current release utilizes Java Webstart and Lucene version 2.4 to enhance memory and CPU performance. A new version of Aperture provides more stable IMAP and Web crawling. An auto-update facility generates thumbnails for nearly all document formats with the help of OpenOffice. Picture mode improves browsing in pictures and thumbnails.
http://www.lucidimagination.com/Community/Marketplace/Application-Showcase-Wi...
. You are not asked to pay unnecessary penalties for growing the breadth of your data or the count of documents to be searched. The search application should be able to grow with the requirements of the organization, without requiring big investments in hardware to match pace with that growth.
Lucene and Solr have been proven to scale in demanding environments - including billions of documents, multiple petabytes of storage.
Here are just a few examples demonstrating the scalability of Lucene and Solr :
Netflix
http://www.lucidimagination.com/solutions/search-scalability