html5libHTML解析库
html5lib 是一个用来解析 HTML 文档的 Python 类库,支持HTML 5 以及最大程度兼容桌面浏览器。
主要特性包括:
- Parses valid and invalid HTML documents to a tree
- Support for minidom, ElementTree (including cElementTree and lxml.etree), BeautifulSoup and custom simpletree output formats
- DOM to SAX converter
- Reports parse errors
- Character encoding detection
- XML mode for working with illformed XML e.g. feeds
- Filtering and serializing of trees
- HTML+CSS sanitizer
- Many unit tests
- Faster than before :)
评论
DeviceDetectorUser-Agent 解析库
DeviceDetector 是一个用 Ruby 编写的用来解析各种设备 User-Agent 信息
DeviceDetectorUser-Agent 解析库
0
go-parseParsec 解析库
go-parse是一个Go语言的库实现类Parsec的解析。示例代码:func main() { in := new(StringVessel); in.SetInput(`< (&
go-parseParsec 解析库
0