内容简介:jsoup 1.10.3 发布,Java 的 HTML 解析器
jsoup 1.10.3 发布了,该版本带来了更好的 CSS 选择器性能,Jsoup.Connection 改进和其他 bug 修复。
详情包括:
Improvements
-
Added
Elements.eachText()andElements.eachAttr(), which return a list of anElement'stext or attribute values, respectively. This makes it simpler to for example get a list of each URL on a page:List<String> urls = doc.select("a").eachAttr("abs:href""); -
Improved selector validation for
:contains(...)with unbalanced quotes. -
Improved the speed of index based CSS selectors and other methods that use elementSiblingIndex, by a factor of 34x.
-
Added
Node.clearAttributes(), to simplify removing of all attributes of aNode/Element.
Fixes
-
Bugfix: if an attribute name started or ended with a control character, the parse would fail with a validation exception.
-
Bugfix:
Element.hasClass()and the.classnameselector would not find the class attribute case-insensitively. -
Bugfix: In
Jsoup.Connection, if a redirect contained a query string with%xxescapes, they would be double escaped before the redirect was followed, leading to fetching an incorrect location. -
Bugfix: In
Jsoup.Connection, if a request body was set and the connection was redirected, the body would incorrectly still be sent. -
Bugfix: In
DataUtilwhen detecting the character set from meta data, and there are two Content-Types defined, use the one that defines a character set. -
Bugfix: when parsing unknown tags in case-sensitive HTML mode, end tags would not close scope correctly.
-
In
Jsoup.Connection, ensure there is no Content-Type set when being redirected to a GET. -
Bugfix: in certain locales (Turkish specifically), lowercasing and case insensitivity could fail for specific items.
下载地址: https://jsoup.org/download
以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网
猜你喜欢:- Java HTML 解析器 jsoup 发布 1.13.1,解析速度显著提升
- Expat 2.2.8 发布,XML 解析器
- MediaInfo 20.03 发布,多媒体文件解析软件
- JsoupXPath v2.0-Beta 发布,HTML 解析器
- Kubernetes 1.12全新发布!新功能亮点解析
- MediaInfo 19.07 发布,多媒体文件解析软件
本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们。
百度SEO一本通
潘坚、李迅 / 电子工业出版社 / 2015-6 / 59.00元
《百度SEO一本通》通过浅显易懂的叙述方式,以及大量的图示,详细介绍了SEO的关键技术要点,对于搜索引擎优化中重要的关键词优化、链接优化,以及百度推广中的推广技巧都进行了详细的介绍。 《百度SEO一本通》共分为11章,首先让大家了解SEO存在的原因,然后对网页、网站、空间和程序与SEO的关系展开了细节上的讨论,最后几章深入介绍了百度推广的相关概念、设置、技巧和实操,让读者可以轻松上手操作,易......一起来看看 《百度SEO一本通》 这本书的介绍吧!