Introducing an additional parameter for launcher run_client closures #2684

riesentoaster · 2024-11-12T15:49:30Z

Follow-up to #2676.

With Launcher::overcommit, CoreId is no longer necessarily unique. ClientId is generated in LLMP, therefore unique, and an immutable borrow of it is passed to the closure.

…t parameter

tokatoka · 2024-11-12T16:06:43Z

I don't think this is good.
ClientId will not stay the same if the fuzzer restarts. that will make this number useless
(for example, if it is just a varying number you can't do if (id == xxx) {// do this} else {// do that} inside the run_in_client harness

tokatoka · 2024-11-12T16:07:57Z

Also, just in general, we should not assume that every fuzzer has client id. it's a specific identifier to llmp communication, not something to distinguish fuzzer instance

riesentoaster · 2024-11-12T16:11:18Z

That's fair, I haven't thought of that. I still feel like having an id there that uniquely identifies the client is a good idea. I guess in that case we're back to creating a second ClientId based on CoreId and the overcommit index.

@domenukk Opinions?

tokatoka · 2024-11-12T16:13:51Z

we're back to creating a second ClientId based on CoreId and the overcommit index.

yeah whatever number is fine. such as timestamp + pid or anything. just that this is not appropriate case where ClientId should be used

riesentoaster · 2024-11-12T16:19:55Z

I think having something predictable is better, again because of restarts. If we have CoreId * overcommit_i + overcommit_i index, we're guaranteed to have the same numbers if the fuzzer is started with the same config and thus can resume based on files it created before shutdown.

tokatoka · 2024-11-12T16:20:36Z

yeah it's good

riesentoaster · 2024-11-12T16:27:25Z

And another downside is that this id is not going to match whatever the monitors display. That might lead to confusion.

tokatoka · 2024-11-12T16:29:48Z

And another downside is that this id is not going to match whatever the monitors display. That might lead to confusion.

IIRC that monitor isn't even displaying the clientid

riesentoaster · 2024-11-12T16:42:04Z

The basic implementation does (the number after the # is the sender id of the llmp message):

[UserStats   #1]  (GLOBAL) run time: 0h-0m-3s, clients: 1, corpus: 1, objectives: 0, executions: 0, exec/sec: 0.000, stability: 100.000%, edges_objective: 0.006%, edges: 0.011%
                 (CLIENT) corpus: 1, objectives: 0, executions: 0, exec/sec: 0.000, stability: 7/7 (100%), edges_objective: 4/65536 (0%), edges: 7/65536 (0%)
[Objective   #1]  (GLOBAL) run time: 0h-0m-3s, clients: 1, corpus: 1, objectives: 1, executions: 0, exec/sec: 0.000, stability: 100.000%, edges_objective: 0.006%, edges: 0.011%

tokatoka · 2024-11-12T16:52:38Z

uh ok.
but you should not trust that ClientID number. it has LOOOOOOTs of problems. and i don't think that number is unique to each fuzzer process #1850 (comment)

tokatoka · 2024-11-12T17:50:16Z

I think the problem of varying client id after restart is a unique problem to centralized broker.
but anyway client id is unique to connection. so it should not be used for distinguishing fuzzers

riesentoaster · 2024-11-12T18:25:19Z

How about we generate a new ClientId based on the core/overcommit but leave the getter functions for the LLMP ClientId on the managers, this way they are still accessible if anyone ever needs it, but the more obvious option is the "correct" one.

tokatoka · 2024-11-12T18:37:40Z

no... things are much more complex than that.

First,... what you call as Client ID is not just one.
There are 4 Client IDs per one broker-to-client connection.
One connection has one Sender and Receiver, and then broker -> client connection and client -> broker connetion are on differenct channel.
thus 2 x 2 = 4
and each of this sender and receiver has its own client id. now you should see that clientid is really the id of llmp_{sender, receiver}. it's not something unique to a fuzzer client.

If we do like you said there're alot of problems.
For example, you cannot set the client-id of the broker, as broker is not bound to any core, thus you cannot set any number to it.

So client id is the id that should only be used inside llmp. not for anything else.

riesentoaster · 2024-11-12T21:22:09Z

The only interesting one of those 4 is the sender id in the client->broker direction, since this is what is printed in the monitors. I don't really need this information for my current projects, but I figured it'd be worth a suggestion since it's already built and it may be helpful for someone down the line.

I'm still curious about @domenukk's opinion though, since he's the one who initially suggested using the LLMP ClientId.

tokatoka · 2024-11-12T22:42:42Z

The only interesting one of those 4 is the sender id in the client->broker direction

Yeah, but i'm saying even if it is not interesting you have to put something for the llmp to work. and you can't put core/overcommimt there.

Also llmp is independent of Launcher. that means it'd be a problem if you don't use Launcher / core pinning

domenukk · 2024-11-13T01:11:01Z

ClientId was supposed to be unique and survive reboots. That's literally what it's for :D
TIL about issues with centralized.
Maybe we should fix these?

domenukk · 2024-11-13T01:12:30Z

IMHO
Launcher -> CoreId + Overcommit -> pick what features to use (provided on the commandline)
Anythign else should use ClientId and we should make sure that's unique.
Initially I also had a BrokerId for broker2broker, but maybe that didn't survive the C->Rust migration

domenukk · 2024-11-13T01:13:55Z

That being said, we shouldn't add more parameters to the closure in any case. I suggested either adding it into state or extending CoreId to CoreDescription

tokatoka · 2024-11-13T10:38:34Z

ClientId was supposed to be unique and survive reboots.

It's not unique for fuzzers. it's only unique for communication. As I said, each fuzzer-to-broker communication have 4 client ids assigned to each Sender and Receiver.

domenukk · 2024-11-13T19:27:50Z

Yes and we shoudl pick one as fuzzer node id IMHO

riesentoaster · 2024-11-15T12:29:51Z

So I guess we're basically back to #2676? Or do we want to integrate a LLMP id somewhere right now?

Personally, I think that in the long run, there should be one identifier per client/node that is globally unique and used throughout any user-facing interface, including launchers, monitors, and everything else. It may make sense to couple those with LLMP ids, since LLMP also needs unique ids per client, but if there are good reasons to do something else, that's fine too — in that case however it would likely need to be synced along with the messages. I don't know LLMP and the entire architecture nearly well enough to make a judgement call about this.

tokatoka · 2024-11-15T13:03:13Z

there should be one identifier per client/node that is globally unique and used throughout any user-facing interface, including launchers, monitors, and everything else.

Yeah, if you really mean "globally unique" then here you can't even use coreid as the identifier. you can run fuzzers on multiple machines and aggregate logs from others, and in that case, the core id is not a valid globally unique identifier

It may make sense to couple those with LLMP ids, since LLMP also needs unique ids per client

No. imo you shouldn't. "LLMP also needs unique ids per client", yeah, but it's not globally unique. it's only unique within it's sequence of connections. And this is what makes the logs from monitors "looks" unique. because the monitor is simply printing Broker's receiver's client id (which is unique) but if this assumption breaks, for example, if you have two brokers emitting the logs to stdout, nothing is unique anymore.

tokatoka · 2024-11-15T13:07:16Z

if you "just" want a globally unique id. then you should use uuid
https://docs.rs/uuid/latest/uuid/index.html
(and imo it is good to have this for the client)

riesentoaster added 5 commits November 12, 2024 12:46

moving to getter/setter for LLMPs ClientId

141c9cf

introducing an additional client_id parameter to launcher's run_clien…

9999080

…t parameter

Merge branch 'main' into client-id

a4e0f45

removing temp script file

40355ac

moving additional code to use getters

4353054

riesentoaster added 2 commits November 12, 2024 21:55

moving additional code to use getters round 2

a81fa8e

fixing initialization of const

f229220

riesentoaster mentioned this pull request Nov 20, 2024

Make Launcher use ClientDescription instead of CoreId #2676

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing an additional parameter for launcher run_client closures #2684

Introducing an additional parameter for launcher run_client closures #2684

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024

riesentoaster commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

domenukk commented Nov 13, 2024

domenukk commented Nov 13, 2024

domenukk commented Nov 13, 2024

tokatoka commented Nov 13, 2024

domenukk commented Nov 13, 2024

riesentoaster commented Nov 15, 2024

tokatoka commented Nov 15, 2024 •

edited

Loading

tokatoka commented Nov 15, 2024 •

edited

Loading

Introducing an additional parameter for launcher run_client closures #2684

Are you sure you want to change the base?

Introducing an additional parameter for launcher run_client closures #2684

Conversation

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 • edited Loading

tokatoka commented Nov 12, 2024 • edited Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 • edited Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024

riesentoaster commented Nov 12, 2024 • edited Loading

tokatoka commented Nov 12, 2024 • edited Loading

tokatoka commented Nov 12, 2024 • edited Loading

riesentoaster commented Nov 12, 2024

tokatoka commented Nov 12, 2024 • edited Loading

riesentoaster commented Nov 12, 2024 • edited Loading

tokatoka commented Nov 12, 2024 • edited Loading

domenukk commented Nov 13, 2024

domenukk commented Nov 13, 2024

domenukk commented Nov 13, 2024

tokatoka commented Nov 13, 2024

domenukk commented Nov 13, 2024

riesentoaster commented Nov 15, 2024

tokatoka commented Nov 15, 2024 • edited Loading

tokatoka commented Nov 15, 2024 • edited Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

riesentoaster commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 12, 2024 •

edited

Loading

tokatoka commented Nov 15, 2024 •

edited

Loading

tokatoka commented Nov 15, 2024 •

edited

Loading