Egothor全文本搜索引擎
Egothor是一个用Java编写的开源而高效的全文本搜索引擎。借助Java的跨平台特性,Egothor能应用于任何环境的应用,既可配置为单独的搜索引擎,又能用于你的应用作为全文检索之用。
关键特性
- Written in JAVA for cross platform compatibility.
- New dynamization algorithm for fast index updating.
- Fully 64-bit kernel
- Transactions (ACID)
- Document revisions, Xdelta
- Incremental updates
- Queries can be solved in a parallel manner
- Able to recognize the most familiar file formats: HTML, PDF, PS, and Microsoft's DOC, and XLS.
- High capacity robot which supports robots.txt recommendation.
- The best compression methods are used, i.e. Golomb, Elias-Gamma, Block coding.
- Based on the extended Boolean model which can operate as the Vector or Boolean models.
- Universal stemmer that processes any language.
评论