v0.9.59.rc0
Pre-release
Pre-release
What's Changed
- Client side validation for non fp8 kv cache and fp8 context fmha by @joostinyi in #1302
- Add encoder model support in trt-llm by @michaelfeil in #1294
- Add
kv_cache_host_memory_bytes
as a configurable runtime setting by @joostinyi in #1303
Full Changelog: v0.9.58...v0.9.59.rc0