Egothor全文本搜索引擎

联合创作 · 2023-09-29 05:00

Egothor是一个用Java编写的开源而高效的全文本搜索引擎。借助Java的跨平台特性,Egothor能应用于任何环境的应用,既可配置为单独的搜索引擎,又能用于你的应用作为全文检索之用。

关键特性

  • Written in JAVA for cross platform compatibility.
  • New dynamization algorithm for fast index updating.
  • Fully 64-bit kernel
  • Transactions (ACID)
  • Document revisions, Xdelta
  • Incremental updates
  • Queries can be solved in a parallel manner
  • Able to recognize the most familiar file formats: HTML, PDF, PS, and Microsoft's DOC, and XLS.
  • High capacity robot which supports robots.txt recommendation.
  • The best compression methods are used, i.e. Golomb, Elias-Gamma, Block coding.
  • Based on the extended Boolean model which can operate as the Vector or Boolean models.
  • Universal stemmer that processes any language.
浏览 2
点赞
评论
收藏
分享

手机扫一扫分享

编辑 分享
举报
评论
图片
表情
推荐
点赞
评论
收藏
分享

手机扫一扫分享

编辑 分享
举报