-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update worker images to optimize IO performance using local data #675
base: master
Are you sure you want to change the base?
Update worker images to optimize IO performance using local data #675
Commits on Oct 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 997ca3b - Browse repository at this point
Copy the full SHA 997ca3bView commit details -
Setting needs_data_local when fulfilled_by is set.
Updating usages of DataRequirement so that whenever the fulfilled_by attribute of an instance is set - creation time or otherwise - the new needs_data_local is also set.
Configuration menu - View commit details
-
Copy full SHA for 4e3e03f - Browse repository at this point
Copy the full SHA 4e3e03fView commit details -
Update, document ngen image custom dir structure.
Add 2 new directories - /dmod/local_volumes and /dmod/cluster_volumes - to ngen image directory structure, meant for mount points of different types of volumes containing necessary data for the job; also, adding README with some initial documentation on this directory structure.
Configuration menu - View commit details
-
Copy full SHA for 3625385 - Browse repository at this point
Copy the full SHA 3625385View commit details -
Update Launcher for using local data volumes.
Updating Launcher to prepare services with local volume mounts when some data requirements must be fulfilled by local data on the physical node, and to update the relevant other args for starting worker services so that one worker on each node makes sure data gets prepared in local volumes as needed as part of job startup.
Configuration menu - View commit details
-
Copy full SHA for 152e8d3 - Browse repository at this point
Copy the full SHA 152e8d3View commit details -
Update ngen-related images to have mc client.
Making MinIO CLI client available within ngen worker image and derivatives (e.g., calibration worker), though without a pre-configured alias for connected to the object store service.
Configuration menu - View commit details
-
Copy full SHA for 1aabb62 - Browse repository at this point
Copy the full SHA 1aabb62View commit details -
Update worker Python functions to make data local.
Adding functionality to py_funcs.py to support making DMOD dataset data local (not just be locally accessible from remote storage).
Configuration menu - View commit details
-
Copy full SHA for 8aff1c2 - Browse repository at this point
Copy the full SHA 8aff1c2View commit details -
Update worker entrypoints for local data.
Updating main entrypoint scripts for ngen and calibration worker images for local data handling.
Configuration menu - View commit details
-
Copy full SHA for 13b61a5 - Browse repository at this point
Copy the full SHA 13b61a5View commit details -
Fix fast dev update script GUI handling.
Fixing script so that GUI services do not get stopped and updated unless that is actually asked for with the available CLI option.
Configuration menu - View commit details
-
Copy full SHA for e163b61 - Browse repository at this point
Copy the full SHA e163b61View commit details -
Move call to make_data_local in entrypoints.
Moving call to this Python function so that it happens before sanity checks (at the entrypoint level) ensuring dataset directories exist, as they won't exist until any data is made local.
Configuration menu - View commit details
-
Copy full SHA for 1f4fa7b - Browse repository at this point
Copy the full SHA 1f4fa7bView commit details -
Fix issues w/ use of separate cluster/local data.
- Order minio client args properly (config dir must come first) - Cleanup output handling during minio client subprocess - Correct a few logical mistakes with how conditionals should behave - Fix issue with path object creation when copying from cluster volume - Adding some helpful logging messages - Make sure we actually create symlinks
Configuration menu - View commit details
-
Copy full SHA for 72067da - Browse repository at this point
Copy the full SHA 72067daView commit details -
Fix more issues with py_funcs functions.
- Fixing handling of symlink for output dataset so it points to cluster volume as needed (i.e., so output can actually make it out of the worker) - Fixing some issues with keyword args coming in from CLI that certain functions weren't set up to disregard properly - Adding a bit more helpful logging in places
Configuration menu - View commit details
-
Copy full SHA for 2919533 - Browse repository at this point
Copy the full SHA 2919533View commit details -
Update worker entrypoints for permissions issues.
Adding logic and reordering certain things to make sure that, given local writing initially of job outputs, etc., that process to then move the results to backing dataset storage works properly and does not run into permissions issues.
Configuration menu - View commit details
-
Copy full SHA for 088b35c - Browse repository at this point
Copy the full SHA 088b35cView commit details -
Configuration menu - View commit details
-
Copy full SHA for d71abfc - Browse repository at this point
Copy the full SHA d71abfcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 15486d8 - Browse repository at this point
Copy the full SHA 15486d8View commit details -
Update dataservice internal deps to latest.
Update dependencies on core and scheduler to 0.21.0 and 0.14.0 respectively.
Configuration menu - View commit details
-
Copy full SHA for 42a09fa - Browse repository at this point
Copy the full SHA 42a09faView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2164ca0 - Browse repository at this point
Copy the full SHA 2164ca0View commit details -
Update schedulerservice internal deps to latest.
Update dependencies on core and scheduler to 0.21.0 and 0.14.0 respectively.
Configuration menu - View commit details
-
Copy full SHA for 0c4ef3c - Browse repository at this point
Copy the full SHA 0c4ef3cView commit details -
Update requestsservice to latest core dep.
Updating dependency on core to 0.21.0.
Configuration menu - View commit details
-
Copy full SHA for b59cb7e - Browse repository at this point
Copy the full SHA b59cb7eView commit details -
Update partitionerservice internal deps to latest.
Updating dependencies on core and scheduler to 0.21.0 and 0.14.0 respectively.
Configuration menu - View commit details
-
Copy full SHA for 84710c4 - Browse repository at this point
Copy the full SHA 84710c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 46e442e - Browse repository at this point
Copy the full SHA 46e442eView commit details -
Account for platform in image mc client download.
Account for building in environments other than Linux X86_64 when downloading the MinIO client for the ngen worker images.
Configuration menu - View commit details
-
Copy full SHA for 25dbf8e - Browse repository at this point
Copy the full SHA 25dbf8eView commit details