Download distribution zip (or tar.gz)
Full Changelog | Javadoc | Maven Central
End of interim releases
This release drops the term "interim release" which distinguished releases made temporarily by the community in the absence of releases made by Internet Archive. The community releases have effectively become the official releases.
In conjunction with this, the version numbers which were paused at 3.4.0 for the interim releases, have now resumed incrementing following the scheme major.minor.patch
with the minor release number incremented when features are added or removed.
Java compatibility notice
This will likely be the last release of Heritrix compatible with Java 8. The next release is expected to require Java 17 or later.
Changes in this release
Removals
- Removed HBase modules from contrib. #621
Fixes
- ConfigurableExtractorJS: Set default value (false) for strict property. #612
- ExtractorHTML: Treat
cite
attribute as a navlink instead of embed. #608 - Building no longer requires the builds.archive.org or Cloudera repositories. #614
- Updated to new URL of the restlet repository.
Dependency Upgrades
- Removed hbase, joda-time, log4j
- commons-io 2.14.0
- kafka-clients 3.8.0
- ftpserver-core 1.2.0
- jetty 9.4.56.v20240826
- webarchive-commons 1.1.10