ChromaDB Support for Python SDK #110

mehrinkiani · 2024-02-02T20:11:00Z

This PR adds Chroma DB support for Python SDK.

ristomcgehee · 2024-02-15T04:59:57Z

Just wanted to let you know that I've been busy lately with my day job and probably won't be able to get to this for at least a week.

mehrinkiani · 2024-02-15T16:40:37Z

Sounds good, and thank you for letting me know @ristomcgehee!

ristomcgehee

Okay, I finally found some time to review this PR. Here are my thoughts!

ristomcgehee · 2024-02-25T04:03:36Z

python-sdk/README.md

@@ -45,14 +45,18 @@ pip install rebuff

 ### Detect prompt injection on user input

+For vector database, Rebuff supports Pinecone (default) and Chroma. To use Chroma, install Rebuff with extras: `pip install rebuff[chromadb]`


Suggested change

For vector database, Rebuff supports Pinecone (default) and Chroma. To use Chroma, install Rebuff with extras: `pip install rebuff[chromadb]`

For vector database, Rebuff supports Pinecone (default) and Chroma.

That same information is repeated a few lines later, so I don't think we need it here.

ristomcgehee · 2024-02-25T04:18:28Z

python-sdk/rebuff/detect_pi_vectorbase.py

-    input: str, similarity_threshold: float, vector_store: Pinecone
+    input: str,
+    similarity_threshold: float,
+    vector_store: Union[Pinecone, Optional[Chroma]],


Suggested change

vector_store: Union[Pinecone, Optional[Chroma]],

vector_store: VectorStore,

And then you'd need to import VectorStore from langchain at the top of the file.

ristomcgehee · 2024-02-25T04:27:16Z

python-sdk/rebuff/detect_pi_vectorbase.py

+try:
+    import chromadb
+
+    chromadb_installed = True
+except ImportError:
+    print(
+        "To use Chromadb, please install rebuff with rebuff extras. 'pip install \"rebuff[chromadb]\"'"
+    )
+    chromadb_installed = False


I'd like to suggest a different approach for handling the fact that chromadb might not be installed. If I understand the code correctly, even if a user is using Pinecone, they'll always see the warning "To use Chromadb, please install...". Also, it's a bit unusual conditionally defining classes and methods.

I would move ChromaCosineSimilarity to a different file but keep init_chroma in this file. At the beginning of init_chroma, you import chromadb and ChromaCosineSimilarity within a try-except. If the import fails, then you display "To use Chromadb, please install...". Then you'd no longer need the chromadb_installed variable.

ristomcgehee · 2024-02-25T04:37:30Z

python-sdk/rebuff/sdk.py

+            )
+
+        elif self.vector_db.name == "CHROMA":
+            from rebuff.detect_pi_vectorbase import init_chroma


If you take my suggestion for refactoring detect_pi_vectorbase.py, you'll be able to move this import to the top of the file.

ristomcgehee · 2024-02-25T04:39:26Z

python-sdk/rebuff/sdk.py

@@ -83,7 +118,7 @@ def detect_injection(
            rebuff_heuristic_score = 0

        if check_vector:
-            self.initialize_pinecone()
+            self.initialize_vector_store()


Suggested change

self.initialize_vector_store()

if self.vector_store is None:

self.initialize_vector_store()

ristomcgehee · 2024-02-25T05:04:46Z

python-sdk/tests/test_sdk.py

+def rebuff(request) -> RebuffSdk:
    rb = RebuffSdk(
        get_environment_variable("OPENAI_API_KEY"),
+        request.param,
        get_environment_variable("PINECONE_API_KEY"),
        get_environment_variable("PINECONE_INDEX_NAME"),
    )


For most test methods, it doesn't do any good testing on both pinecone and chroma; only test_detect_injection_vectorbase needs to test with both. You could do this in the fixture:

Suggested change

def rebuff(request) -> RebuffSdk:

rb = RebuffSdk(

get_environment_variable("OPENAI_API_KEY"),

request.param,

get_environment_variable("PINECONE_API_KEY"),

get_environment_variable("PINECONE_INDEX_NAME"),

)

def rebuff(request) -> RebuffSdk:

vector_db = request.param if hasattr(request, "param") else VectorDB.PINECONE

rb = RebuffSdk(

get_environment_variable("OPENAI_API_KEY"),

vector_db,

get_environment_variable("PINECONE_API_KEY"),

get_environment_variable("PINECONE_INDEX_NAME"),

)

Which would allow you to delete:

@pytest.mark.parametrize( "rebuff", [VectorDB.PINECONE, VectorDB.CHROMA], ids=["pinecone", "chroma"], indirect=True, )

from most of the methods.

ristomcgehee · 2024-02-25T05:07:17Z

python-sdk/tests/test_sdk.py

 def test_detect_injection_vectorbase(
    rebuff: RebuffSdk,
+    add_documents_to_chroma,


This parameter isn't getting used in this method, it probably shouldn't be a parameter. What I would probably do is add add_documents_to_chroma to the end of the rebuff function fixture.

ristomcgehee · 2024-02-25T05:14:25Z

python-sdk/rebuff/detect_pi_vectorbase.py

+        )
+
+        chroma_collection = ChromaCosineSimilarity(
+            client=chromadb.Client(),


When chromadb.Client() is invoked, is that creating an in-memory database that only persists as long as the process does? If that's the case, it wouldn't really work for a production use case. I think what we'd want to do is use chromadb.HttpClient to connect to a remote server. If we go that route, it looks like we might be able to use chromadb-client instead which is a more lightweight version of chromadb (https://docs.trychroma.com/usage-guide?lang=py#using-the-python-http-only-client).

Thank you for the suggestion! I have updated the code to use chromadb.HttpClient. I will check if we can also update the dependency to chromadb-client

I have updated the dependency to chromadb-client.

mehrinkiani · 2024-03-04T21:16:02Z

Thank you @ristomcgehee for the review, and suggestions! I have tried to incorporate most of them. Also using chromadb.HttpClient makes sense to me, I am going to work on setting a remote server. Thought would share the update on the PR for now

ristomcgehee · 2024-03-06T04:56:31Z

python-sdk/tests/test_sdk.py

        get_environment_variable("PINECONE_API_KEY"),
        get_environment_variable("PINECONE_INDEX_NAME"),
    )
+    if hasattr(request, "param") and request.param == VectorDB.CHROMA:


Suggested change

if hasattr(request, "param") and request.param == VectorDB.CHROMA:

if vector_db == VectorDB.CHROMA:

ristomcgehee · 2024-03-06T05:02:57Z

python-sdk/rebuff/sdk.py

+        if self.vector_db.name == "PINECONE":
+            self.pinecone_apikey = pinecone_apikey
+            self.pinecone_index = pinecone_index
+
+        elif self.vector_db.name == "CHROMA":


Suggested change

if self.vector_db.name == "PINECONE":

self.pinecone_apikey = pinecone_apikey

self.pinecone_index = pinecone_index

elif self.vector_db.name == "CHROMA":

if self.vector_db == VectorDB.PINECONE:

self.pinecone_apikey = pinecone_apikey

self.pinecone_index = pinecone_index

elif self.vector_db == VectorDB.CHROMA:

Similar recommendation for within initialize_vector_store().

ristomcgehee · 2024-03-06T05:04:47Z

python-sdk/rebuff/sdk.py

@@ -83,7 +114,8 @@ def detect_injection(
            rebuff_heuristic_score = 0

        if check_vector:
-            self.initialize_pinecone()
+            if self.initialize_vector_store() is None:


Suggested change

if self.initialize_vector_store() is None:

if self.vector_store is None:

mehrinkiani · 2024-03-07T18:17:19Z

Thank you @ristomcgehee, I have now added Docker files for Chroma server. Though not sure why the JS and Python tests (integration tests) are failing. They are detecting prompt injection when there is none

ristomcgehee

Though not sure why the JS and Python tests (integration tests) are failing. They are detecting prompt injection when there is none

I believe this occurs sometimes because LLMs are non-deterministic. Sometimes, you'll give a benign input and it will give a score of 0.6 or 0.8. A couple ways we could address that:

Set the temperature to 0 when calling OpenAI
Retry the tests multiple times when they fail

For now, you could also just re-run the tests and they'll likely pass.

ristomcgehee · 2024-03-10T04:40:37Z

docs/quickstart.md

@@ -32,17 +34,48 @@ if result.injection_detected:
    print("Possible injection detected. Take corrective action.")
 ```

+#### Chroma vector database


For a quickstart page, the simpler you can make it the better. I'd recommend taking out the pinecone section and just show how to use the SDK with Chroma DB (since it requires less setup than Pinecone).

ristomcgehee · 2024-03-10T04:59:13Z

python-sdk/docker-compose.yaml

+
+  application:
+    env_file:
+      - .env


It would be good to mention that an .env file is necessary in documentation, as well as describe what is necessary to be included. Something that projects often do is have an example.env file in the repo that people can copy and fill in with their values.

mehrinkiani · 2024-03-12T16:28:08Z

A couple ways we could address that:

Set the temperature to 0 when calling OpenAI

Retry the tests multiple times when they fail

Thank you for the suggestions. I have tried rerunning the tests multiple times, and have also set temperature to 0 when calling OpenAI, thought don't think it is helping much.

Python SDK tests are also failing because of connection error with chroma server when they do pass locally. I will continue to debug this, but if you have any suggestion please do share.

ristomcgehee · 2024-03-14T03:42:38Z

python-sdk/tests/test_sdk.py

+    if vector_db == VectorDB.CHROMA:
+        rb = RebuffSdk(get_environment_variable("OPENAI_API_KEY"), vector_db)


Aren't these two lines unnecessary since we already initialized rb on line 10?

ristomcgehee · 2024-03-14T03:49:39Z