Download distribution zip (or tar.gz)
Full Changelog | Javadoc | Maven Central
New features
- ConfigurableExtractorJS: Regex rules to skip extracting
<script>tags when their attributes match. #672
Bug fixes
- Docs: Switch bean docs generation to an annotation processor, fixing the bean reference broken by Java language changes. #683
- StatisticsTracker: Don’t restore
crawlEndTimewhen resuming from a checkpoint. #669 - ExtractorJS: Fix overriding the
strictsetting in sheets. #670 - Berkeley DB: Handle more shutdown interrupts gracefully. #671
Dependency upgrades
- amqp-client: 5.26.0 → 5.27.0
- groovy: 4.0.28 → 5.0.2
- jaxb-runtime: 4.0.5 → 4.0.6
- jetty: 12.0.27 → 12.0.29
- jsch: 2.27.3 → 2.27.4
- junit-jupiter: 5.13.4 → 6.0.0
- kafka-clients: 3.9.1 → 4.1.0
- pdfbox: 3.0.5 → 3.0.6
- rethinkdb-driver: 2.3.3 → 2.4.4
- spring: 6.2.11 → 6.2.12
- webarchive-commons: 3.0.0 → 3.0.1
- webjars-locator-lite: 1.1.0 → 1.1.2