-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reader Group is not deleted #325
Comments
It's not clear to me that the suggested change is the best course of action. A stream job can be restarted, and if we use the same group name, then it should resume from the same revisioned stream. It is not necessary to delete. But, it is also correct that starting a group from scratch and reseting it to a checkpoint should work, so in principle, deleting it on close should be fine, except on ungraceful shutdowns. If the application has to delete the reader group out of band in at least one case (ungraceful shutdowns), then would it work that we do not delete on close, require deleting out of band, and make sure that we are using consistent reader group names? Let's talk about it before we jump to conclusions. |
I support the first approach. It follows the rule "who create who delete" and is enough for restarting with checkpoint-enabled cases. It's not ideal to delete the reader group out of band. There are several reasons.
|
If we only have a single reader group through the lifetime of a Flink application, then we only really ever have one reader group to delete, so there is no risk of having orphaned reader groups accumulating. The main question is whether we can fix the naming:
Could we not rely on random strings and use a well-defined and known name?
How does the cronjob know which reader groups to delete?
I need to be convinced that we can really avoid that burden. |
As reader group(or say
We can tell it's very likely to have the same name. These names can also be overwritten by the Flink application developers, of course we can say, hey, please train the developers that they should use the different names for readers in each Flink app, but it's too demanding and not a good solution to me. We can have another idea that maybe we can name it from the Pravega side, i.e. putting the context of what stream/streamcut it is being read, but it will also get duplicates when two Flink applications reads the same stream (multiple apps reading from one stream is a typical use case I think).
Sorry for the misunderstanding, but this problem is what I'm concerned as an extra burden aforementioned. Deleting the RG in |
Problem description
This connector creates a reader group but never deletes it.
Problem location
FlinkPravegaReader.java
Suggestions for an improvement
FlinkPravegaReader.close()
should callReaderGroupManager.deleteReaderGroup()
.The text was updated successfully, but these errors were encountered: