Use read advice consistently in the knn vector formats #14076

jimczi · 2024-12-17T13:46:48Z

This change reverts #13985 and makes sure each knn format sticks to a single read advice consistently.
Switching read advice during merges might help some use cases, but it can also hurt others—e.g. when search and merges are running at the same time. To balance this, the approach here picks one read advice per format, focusing on what’s most resource-intensive for that format.

For formats using HNSW, the read advice is set to RANDOM and doesn’t change during merges. Copying bytes from old segments to new ones is much faster than re-building the graph, so keeping RANDOM read advice makes the most sense.

For flat formats, the read advice is set to SEQUENTIAL, as brute-force is the only way to retrieve nearest neighbors.

This is a deliberate decision to keep things simple and predictable. While it might seem like a step back compared to #13985, using multiple read advices on the same file can lead to unpredictable behavior—it might seem fine until you test it in a constrained setup.

That said, we could still improve merge performance with a RANDOM read advice in the future, for instance, by adding eager prefetching.

…ce while merging vectors (apache#13985)" This reverts commit 46204f6.

… access pattern

ChrisHegarty

I agree with the direction here. The anti-delta looks correct to me. I left a few comments.

ChrisHegarty · 2024-12-17T14:58:07Z

lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99FlatVectorsFormat.java


  /** Constructs a format */
-  public Lucene99FlatVectorsFormat(FlatVectorsScorer vectorsScorer) {
+  public Lucene99FlatVectorsFormat(FlatVectorsScorer vectorsScorer, ReadAdvice readAdvice) {


++ allowing to pass the read advice here is good, since the higher-level usage of this format really should dictate the intended usage.

I like the idea of passing the read advice. If the top level format can dictate the read advice then that makes code more better. +1 on this idea.

lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99FlatVectorsWriter.java

shatejas · 2024-12-23T01:51:50Z

I understand this and I think overall keeping a constant read advise works, just want to see if there is a path forward for optimizing merge.

e.g. when search and merges are running at the same time

@jimczi, #13985 was put out after a discussion since there was the this tradeoff being made. You can find the discussion here #13920 (comment).

using multiple read advices on the same file can lead to unpredictable behavior—it might seem fine until you test it in a constrained setup.

The experiments seemed to suggest it picked the last advice. I understand the setup wasn't constrained, would you be able to share the details of constrained setup where this caused unpredictable behavior. Just curious

That said, we could still improve merge performance with a RANDOM read advice in the future, for instance, by adding eager prefetching.

Are we sure that introducing prefetch during merge won't impact search?

A thought here would be to see the behavior if the read advice is updated (and revert after merge is over) to normal since sequential read advice can be aggressive as per documentation. Reading ahead of pages will then be decided by kernel heuristics for normal read advice as per my understanding.

@uschindler @jpountz any thoughts or suggestions?

jimczi · 2024-12-27T09:28:23Z

Thanks for looking @shatejas

The opensearch-project/k-NN#2134 (comment) seemed to suggest it picked the last advice. I understand the setup wasn't constrained, would you be able to share the details of constrained setup where this caused unpredictable behavior. Just curious

I may have overstated my point, I didn’t test it directly. My main concern is that the advice applies for the entire duration of the merge. Since the vector copy occurs at the start and represents only a small portion of the total merge time, I believe using prefetch instead of modifying the advice would be more suitable. Prefetch would allow us to fully control the readahead behavior.

Are we sure that introducing prefetch during merge won't impact search?

It would, especially when merging big segments, but only for the time of the copy, which should be relatively fast, rather than for the entire duration of the merge.

shatejas · 2024-12-31T21:33:30Z

I may have overstated my point, I didn’t test it directly. My main concern is that the advice applies for the entire duration of the merge. Since the vector copy occurs at the start and represents only a small portion of the total merge time, I believe using prefetch instead of modifying the advice would be more suitable. Prefetch would allow us to fully control the readahead behavior.

On a high level this makes a lot of sense to me. Since we can control the offset and the length in prefetch, it wouldn't be as aggressive as sequential advice and the impact of other parts like search would not be as much in theory. Thanks @jimczi for the explanation. I may take a stab at prefetch solution

github-actions · 2025-01-15T00:22:34Z

This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the [email protected] list. Thank you for your contribution!

jimczi added 2 commits December 17, 2024 09:39

Revert "Introduces IndexInput#updateReadAdvice to change the ReadAdvi…

6867430

…ce while merging vectors (apache#13985)" This reverts commit 46204f6.

Seal random or sequential access for the knn codec depending on their…

9649461

… access pattern

jimczi requested a review from ChrisHegarty December 17, 2024 13:46

jimczi added 2 commits December 17, 2024 14:06

tidy

0d4a38a

remove unused code

f4391c9

ChrisHegarty reviewed Dec 17, 2024

View reviewed changes

github-actions bot added the Stale label Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use read advice consistently in the knn vector formats #14076

Use read advice consistently in the knn vector formats #14076

jimczi commented Dec 17, 2024

ChrisHegarty left a comment

ChrisHegarty Dec 17, 2024

navneet1v Dec 18, 2024

shatejas commented Dec 23, 2024 •

edited

Loading

jimczi commented Dec 27, 2024

shatejas commented Dec 31, 2024

github-actions bot commented Jan 15, 2025

Use read advice consistently in the knn vector formats #14076

Are you sure you want to change the base?

Use read advice consistently in the knn vector formats #14076

Conversation

jimczi commented Dec 17, 2024

ChrisHegarty left a comment

Choose a reason for hiding this comment

ChrisHegarty Dec 17, 2024

Choose a reason for hiding this comment

navneet1v Dec 18, 2024

Choose a reason for hiding this comment

shatejas commented Dec 23, 2024 • edited Loading

jimczi commented Dec 27, 2024

shatejas commented Dec 31, 2024

github-actions bot commented Jan 15, 2025

shatejas commented Dec 23, 2024 •

edited

Loading