-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Stop Milvus being a joke #36703
Comments
Hi eaxis,
there might be many other reason to cause this storage expansion, for example:
Also, we believe that disk is the cheapest resource so I doesn't make too much sense to take too much care about it. Memory and CPU is definitely more expensive resources. V2.2.8 is a very old version(released 2 years ago) and we didn't see any report recently
I think you probably need a little patience to learn details about milvus. Is is designed for scale and perfomance(You will know it you try to do some profiling and when you have more than 10m data). The largest deploymend we have is more than 10B. But I admit its API is not as straight forward as chroma or other databases due to it's complicated functionality and architecture. |
please also check https://zilliz.com/, which is our cloud offerings. It's purely managed by us and 80k entities is covered by our free tier. BTW, I didn't notice you said you only have 80k entities. There is no way to achive 80GB storage no matter what you do. 80k is a too small amount of data. So I guess you need to check your code:
|
/assign @eaxis |
Yeah, that's correct, thank you for your patience. The thing is that Attu, which is the Milvus' UI has a warning that the number of records in a collection may not be accurate because of the way Milvus works, so I haven't looked into that in detail. However, I opted for pgvector which seemed more stable and predictable to me, wish you luck! |
Sure,for 80k vectors, pg vector is a great option! enjoy it For anyone who searched to this issue, attu used meta info to show count so it is not accurate .Count operation is accurate but usually takes more resource to execute |
Is there an existing issue for this?
Is your feature request related to a problem? Please describe.
Hey guys, I have recently switched from Chroma to Milvus thinking that Milvus is a more mature, serious and production ready solution... and I'm absolutely disappointed - just wasted my time.
For the context, I have ~80k records in 2 collections defined by the following schema:
and Milvus takes ~77GB on the SSD to store that content. Meanwhile Chroma takes ~3 GB and pgvector takes ~2 GB.
With that in mind, could you please clarify the following points:
Do you really think your users have infinite disk space / money to store everything you want? Some links:
The next thing is collection management (e.g. load / unload), this seems like a pretty internal operation - would you consider making it optional with the ability to assign a default strategy? Like 'load if not loaded' or something like that. I'm building a multi-threaded application, and controlling internal 3rd party statement is crazy.
Last thing: Do you really think your users don't need to count stored documents? The fact that Milvus can't even provide an exact number of managed documents is amusing and absurd at the same time.
Please note, I in no way mean to offend maintainers, but bringing a small(/medium?) sized project with Milvus under the hood to a production level seems to be a joke
Describe the solution you'd like.
No response
Describe an alternate solution.
No response
Anything else? (Additional Context)
No response
The text was updated successfully, but these errors were encountered: