Implement support for COSINE in fused ADC #329

jkni · 2024-05-24T19:18:41Z

This PR also renamed QuickADCPQDecoder to FusedADCPQDecoder for better consistency with naming elsewhere, flattens the type hierarchy in FusedADCPQDecoder, and reduces duplicated C code with some force-inlined functions.

jbellis · 2024-05-24T19:33:19Z

jvector-examples/src/main/java/io/github/jbellis/jvector/example/Grid.java

@@ -132,7 +132,8 @@ static void runOneGraph(List<? extends Set<FeatureId>> featureSets,
                }

                indexes.forEach((features, index) -> {
-                    try (var cs = new ConfiguredSystem(ds, index, cv)) {
+                    try (var cs = new ConfiguredSystem(ds, index instanceof OnDiskGraphIndex ? new CachingGraphIndex((OnDiskGraphIndex) index) : index, cv,


should we move this logic to where indexes are created?

I thought about this, but I hesitated because it pushes the compression grid into the build methods, which otherwise don't care about these compression configurations. I don't really have strong feelings either way given that change, so happy to go with whatever you'd prefer here.

Makes sense, this WFM

jbellis · 2024-05-24T19:34:22Z

jvector-native/src/main/c/jvector_simd.c

 */

+


I am impressed that this worked first try!

jbellis · 2024-05-24T19:36:47Z

jvector-base/src/main/java/io/github/jbellis/jvector/vector/VectorUtilSupport.java

@@ -131,11 +131,71 @@ default void bulkShuffleQuantizedSimilarity(ByteSequence<?> shuffles, int codebo
    }
  }

+  // default implementation used here because Panama SIMD can't express necessary SIMD operations and degrades to scalar


unfortunate!

jbellis

LGTM.

How does performance compare to DP?

jkni · 2024-05-24T20:16:05Z

LGTM.

How does performance compare to DP?

There's some overhead, comparable to the overhead of regular PQ cosine to regular PQ dot product. On openai-v3-large-1536-100k, PQ(192,256), LVQ/Fused ADC, as a fairly representative run based on what I've seen locally:

COSINE

 Query top 100/200 recall 0.9798 in 3.90s after 42,044,504 nodes visited

DOT_PRODUCT

 Query top 100/200 recall 0.9770 in 3.75s after 42,250,616 nodes visited

jbellis · 2024-05-24T21:09:50Z

Ship it!

Implement support for COSINE in fused ADC

da1719a

jkni requested a review from jbellis May 24, 2024 19:20

jbellis reviewed May 24, 2024

View reviewed changes

jvector-native/src/main/c/jvector_simd.c

*/

Copy link

Owner

jbellis May 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am impressed that this worked first try!

jbellis reviewed May 24, 2024

View reviewed changes

jbellis approved these changes May 24, 2024

View reviewed changes

jkni merged commit d8a2b49 into main May 24, 2024
6 checks passed

jkni deleted the cosine-fused-adc branch May 24, 2024 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement support for COSINE in fused ADC #329

Implement support for COSINE in fused ADC #329

jkni commented May 24, 2024

jbellis May 24, 2024

jkni May 24, 2024

jbellis May 24, 2024

jbellis May 24, 2024

jbellis May 24, 2024

jbellis left a comment

jkni commented May 24, 2024

jbellis commented May 24, 2024

Implement support for COSINE in fused ADC #329

Implement support for COSINE in fused ADC #329

Conversation

jkni commented May 24, 2024

jbellis May 24, 2024

Choose a reason for hiding this comment

jkni May 24, 2024

Choose a reason for hiding this comment

jbellis May 24, 2024

Choose a reason for hiding this comment

jbellis May 24, 2024

Choose a reason for hiding this comment

jbellis May 24, 2024

Choose a reason for hiding this comment

jbellis left a comment

Choose a reason for hiding this comment

jkni commented May 24, 2024

jbellis commented May 24, 2024