-
Notifications
You must be signed in to change notification settings - Fork 118
[SPARK-18278][NOSUBMIT] Ongoing diff for Spark on Kubernetes (branch-2.2) #450
base: branch-2.2
Are you sure you want to change the base?
Commits on Jul 24, 2017
-
Configuration menu - View commit details
-
Copy full SHA for a91f660 - Browse repository at this point
Copy the full SHA a91f660View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d71348 - Browse repository at this point
Copy the full SHA 9d71348View commit details -
Configuration menu - View commit details
-
Copy full SHA for f1baed2 - Browse repository at this point
Copy the full SHA f1baed2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a45654 - Browse repository at this point
Copy the full SHA 5a45654View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5694a8a - Browse repository at this point
Copy the full SHA 5694a8aView commit details -
- Don't hold the raw secret bytes - Add CPU limits and requests
Configuration menu - View commit details
-
Copy full SHA for dbfb87d - Browse repository at this point
Copy the full SHA dbfb87dView commit details -
Configuration menu - View commit details
-
Copy full SHA for acb8b14 - Browse repository at this point
Copy the full SHA acb8b14View commit details -
Reindent comment to conforn to JavaDoc style
The build process fails ScalaStyle checks otherwise.
Configuration menu - View commit details
-
Copy full SHA for f9ae918 - Browse repository at this point
Copy the full SHA f9ae918View commit details -
Configuration menu - View commit details
-
Copy full SHA for f20397b - Browse repository at this point
Copy the full SHA f20397bView commit details -
Use tar and gzip to compress+archive shipped jars (#2)
* Use tar and gzip to archive shipped jars. * Address comments * Move files to resolve merge
Configuration menu - View commit details
-
Copy full SHA for 728be0e - Browse repository at this point
Copy the full SHA 728be0eView commit details -
Use alpine and java 8 for docker images. (#10)
* Use alpine and java 8 for docker images. * Remove installation of vim and redundant comment
Configuration menu - View commit details
-
Copy full SHA for 793143d - Browse repository at this point
Copy the full SHA 793143dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b1a99d - Browse repository at this point
Copy the full SHA 2b1a99dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 457ebd8 - Browse repository at this point
Copy the full SHA 457ebd8View commit details -
Error messages when the driver container fails to start. (#11)
* Error messages when the driver container fails to start. * Fix messages a bit * Use timeout constant * Delete the pod if it fails for any reason (not just timeout) * Actually set submit succeeded * Fix typo
Configuration menu - View commit details
-
Copy full SHA for 94ab8dd - Browse repository at this point
Copy the full SHA 94ab8ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7afadb3 - Browse repository at this point
Copy the full SHA 7afadb3View commit details -
Documentation for the current state of the world (#16)
* Documentation for the current state of the world. * Adding navigation links from other pages * Address comments, add TODO for things that should be fixed * Address comments, mostly making images section clearer * Virtual runtime -> container runtime
Configuration menu - View commit details
-
Copy full SHA for 909b281 - Browse repository at this point
Copy the full SHA 909b281View commit details -
Development workflow documentation for the current state of the world. (
#20) * Development workflow documentation for the current state of the world. * Address comments. * Clarified code change and added ticket link
Configuration menu - View commit details
-
Copy full SHA for 77b287e - Browse repository at this point
Copy the full SHA 77b287eView commit details -
Added service name as prefix to executor pods (#14)
* Added service name as prefix to executor pods to be able to tell them apart from kubectl output * Addressed comments
Configuration menu - View commit details
-
Copy full SHA for 0bcc391 - Browse repository at this point
Copy the full SHA 0bcc391View commit details -
Add kubernetes profile to travis CI yml file (#21)
* Add kubernetes profile to travis yml file * Fix long lines in CompressionUtils.scala
Configuration menu - View commit details
-
Copy full SHA for 979fa92 - Browse repository at this point
Copy the full SHA 979fa92View commit details -
Improved the example commands in running-on-k8s document. (#25)
* Improved the example commands in running-on-k8s document. * Fixed more example commands. * Fixed typo.
Configuration menu - View commit details
-
Copy full SHA for 087555a - Browse repository at this point
Copy the full SHA 087555aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a89b4b0 - Browse repository at this point
Copy the full SHA a89b4b0View commit details -
Support custom labels on the driver pod. (#27)
* Support custom labels on the driver pod. * Add integration test and fix logic. * Fix tests * Fix minor formatting mistake * Reduce unnecessary diff
Configuration menu - View commit details
-
Copy full SHA for 85f02bf - Browse repository at this point
Copy the full SHA 85f02bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for f71abc1 - Browse repository at this point
Copy the full SHA f71abc1View commit details -
A number of small tweaks to the MVP. (#23)
* A number of small tweaks to the MVP. - Master protocol defaults to https if not specified - Removed upload driver extra classpath functionality - Added ability to specify main app resource with container:// URI - Updated docs to reflect all of the above - Add examples to Docker images, mostly for integration testing but could be useful for easily getting started without shipping anything * Add example to documentation.
Configuration menu - View commit details
-
Copy full SHA for 95747bc - Browse repository at this point
Copy the full SHA 95747bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f02444 - Browse repository at this point
Copy the full SHA 2f02444View commit details -
Support setting the driver pod launching timeout. (#36)
* Support setting the driver pod launching timeout. And increase the default value from 30s to 60s. The current value of 30s is kind of short for pulling the image from public docker registry plus the container/JVM start time. * Use a better name for the default timeout.
Configuration menu - View commit details
-
Copy full SHA for 92b2b52 - Browse repository at this point
Copy the full SHA 92b2b52View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c6eed8 - Browse repository at this point
Copy the full SHA 3c6eed8View commit details -
Configuration menu - View commit details
-
Copy full SHA for c47ce5c - Browse repository at this point
Copy the full SHA c47ce5cView commit details -
Use "extraScalaTestArgs" to pass extra options to scalatest. (#52)
* Use "extraTestArgLine" to pass extra options to scalatest. Because the "argLine" option of scalatest is set in pom.xml and we can't overwrite it from the command line. Ref #37 * Added a default value for extraTestArgLine * Use a better name. * Added a tip for this in the dev docs.
Configuration menu - View commit details
-
Copy full SHA for 6bd7240 - Browse repository at this point
Copy the full SHA 6bd7240View commit details -
Configuration menu - View commit details
-
Copy full SHA for 25abc4d - Browse repository at this point
Copy the full SHA 25abc4dView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee01986 - Browse repository at this point
Copy the full SHA ee01986View commit details -
Fix k8s integration tests (#44)
* Fixed k8s integration test - Enable spark ui explicitly for in-process submit - Fixed some broken assertions in integration tests - Fixed a scalastyle error in SparkDockerImageBuilder.scala - Log into target/integration-tests.log like other modules * Fixed line length. * CR
Configuration menu - View commit details
-
Copy full SHA for b986484 - Browse repository at this point
Copy the full SHA b986484View commit details -
Configuration menu - View commit details
-
Copy full SHA for f2b7346 - Browse repository at this point
Copy the full SHA f2b7346View commit details -
Create README to better describe project purpose (#50)
* Create README to better describe project purpose * Add links to usage guide and dev docs * Minor changes
Configuration menu - View commit details
-
Copy full SHA for 9124aac - Browse repository at this point
Copy the full SHA 9124aacView commit details -
Access the Driver Launcher Server over NodePort for app launch + subm…
…it jars (#30) * Revamp ports and service setup for the driver. - Expose the driver-submission service on NodePort and contact that as opposed to going through the API server proxy - Restrict the ports that are exposed on the service to only the driver submission service when uploading content and then only the Spark UI after the job has started * Move service creation down and more thorough error handling * Fix missed merge conflict * Add braces * Fix bad merge * Address comments and refactor run() more. Method nesting was getting confusing so pulled out the inner class and removed the extra method indirection from createDriverPod() * Remove unused method * Support SSL configuration for the driver application submission (#49) * Support SSL when setting up the driver. The user can provide a keyStore to load onto the driver pod and the driver pod will use that keyStore to set up SSL on its server. * Clean up SSL secrets after finishing submission. We don't need to persist these after the pod has them mounted and is running already. * Fix compilation error * Revert image change * Address comments * Programmatically generate certificates for integration tests. * Address comments * Resolve merge conflicts * Fix bad merge * Remove unnecessary braces * Fix compiler error
Configuration menu - View commit details
-
Copy full SHA for 4ff44d3 - Browse repository at this point
Copy the full SHA 4ff44d3View commit details -
Extract constants and config into separate file. Launch => Submit. (#65)
* Extract constants and config into separate file. Launch => Submit. * Address comments * A small shorthand * Refactor more ThreadUtils * Fix scalastyle, use cached thread pool * Tiny Scala style change
Configuration menu - View commit details
-
Copy full SHA for c57ccdc - Browse repository at this point
Copy the full SHA c57ccdcView commit details -
Retry the submit-application request to multiple nodes (#69)
* Retry the submit-application request to multiple nodes. * Fix doc style comment * Check node unschedulable, log retry failures
Configuration menu - View commit details
-
Copy full SHA for 261a624 - Browse repository at this point
Copy the full SHA 261a624View commit details -
Allow adding arbitrary files (#71)
* Allow adding arbitrary files * Address comments and add documentation
Configuration menu - View commit details
-
Copy full SHA for ab731f1 - Browse repository at this point
Copy the full SHA ab731f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0cf0d02 - Browse repository at this point
Copy the full SHA 0cf0d02View commit details -
Introduce blocking submit to kubernetes by default (#53)
* Introduce blocking submit to kubernetes by default Two new configuration settings: - spark.kubernetes.submit.waitAppCompletion - spark.kubernetes.report.interval * Minor touchups * More succinct logging for pod state * Fix import order * Switch to watch-based logging * Spaces in comma-joined volumes, labels, and containers * Use CountDownLatch instead of SettableFuture * Match parallel ConfigBuilder style * Disable logging in fire-and-forget mode Which is enabled with spark.kubernetes.submit.waitAppCompletion=false (default: true) * Additional log line for when application is launched * Minor wording changes * More logging * Drop log to DEBUG
Configuration menu - View commit details
-
Copy full SHA for efd803d - Browse repository at this point
Copy the full SHA efd803dView commit details -
Do not wait for pod finishing in integration tests. (#84)
Since the example job are patched to never finish.
Configuration menu - View commit details
-
Copy full SHA for 381b69a - Browse repository at this point
Copy the full SHA 381b69aView commit details -
Check for user jars/files existence before creating the driver pod. (#86
) * Check for user jars/files existence before creating the driver pod. Close #85 * CR
Configuration menu - View commit details
-
Copy full SHA for 15a8292 - Browse repository at this point
Copy the full SHA 15a8292View commit details -
Use readiness probe instead of client-side ping. (#75)
* Use readiness probe instead of client-side ping. Keep one ping() just as a sanity check, but otherwise set up the readiness probe to report the container as ready only when the ping endpoint can be reached. Also add a liveliness probe for convenience and symmetry. * Extract common HTTP get action * Remove some code * Add delay to liveliness check * Fix merge conflicts. * Fix more merge conflicts * Fix more merge conflicts * Revamp readiness check logic * Add addresses ready condition to endpoints watch * Rearrange the logic some more. * Remove liveness probe, retry against servers * Fix compiler error * Fix another compiler error * Delay between retries. Remove unintended test modification * FIx another compiler error * Extract method * Address comments * Deduplicate node addresses, use lower initial connect timeout * Drop maxRetriesPerServer from 10 to 3
Configuration menu - View commit details
-
Copy full SHA for a62c20f - Browse repository at this point
Copy the full SHA a62c20fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1a43957 - Browse repository at this point
Copy the full SHA 1a43957View commit details -
Bumping up kubernetes-client version to fix GKE and local proxy (#105)
* Bumping up kubernetes-client version to add fixes * Modify wording * Addressed comments
Configuration menu - View commit details
-
Copy full SHA for 3aba68a - Browse repository at this point
Copy the full SHA 3aba68aView commit details -
Truncate k8s hostnames to be no longer than 63 characters (#102)
* Truncate k8s hostnames to be no longer than 63 characters * Use only executorId not executorKubernetesId
Configuration menu - View commit details
-
Copy full SHA for 1f2fd80 - Browse repository at this point
Copy the full SHA 1f2fd80View commit details -
Configuration menu - View commit details
-
Copy full SHA for e239ac7 - Browse repository at this point
Copy the full SHA e239ac7View commit details -
Filter nodes to only try and send files to external IPs (#106)
* Filter node addresses * Added comment
Configuration menu - View commit details
-
Copy full SHA for 3a51dbe - Browse repository at this point
Copy the full SHA 3a51dbeView commit details -
Parse results of minikube status more rigorously (#97)
* Parse results of minikube status more rigorously Prior code assumes the minikubeVM status line is always the first row output from minikube status, and it is not when the version upgrade notifier prints an upgrade suggestion message. * Also filter ip response to expected rows
Configuration menu - View commit details
-
Copy full SHA for ba6a9e5 - Browse repository at this point
Copy the full SHA ba6a9e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for bab88e0 - Browse repository at this point
Copy the full SHA bab88e0View commit details -
Add -DskipTests to dev docs (#115)
* Add -DskipTests to dev docs * Remove extraneous skipTests
Configuration menu - View commit details
-
Copy full SHA for be4330f - Browse repository at this point
Copy the full SHA be4330fView commit details -
Shutdown the thread scheduler in LoggingPodStatusWatcher on receiving…
… job finish event notifications (#121)
Configuration menu - View commit details
-
Copy full SHA for b1d7706 - Browse repository at this point
Copy the full SHA b1d7706View commit details -
Trigger scalatest plugin in the integration-test phase (#93)
* Trigger scalatest plugin in the integration-test phase * Clean up unnecessary config section
Configuration menu - View commit details
-
Copy full SHA for 6ea3047 - Browse repository at this point
Copy the full SHA 6ea3047View commit details -
Fix issue with DNS resolution (#118)
* Fix issue with DNS resolution * Address comments
Configuration menu - View commit details
-
Copy full SHA for de5a105 - Browse repository at this point
Copy the full SHA de5a105View commit details -
Change the API contract for uploading local files (#107)
* Change the API contract for uploading local jars. This mirrors similarly to what YARN and Mesos expects. * Address comments * Fix test
Configuration menu - View commit details
-
Copy full SHA for 81c6968 - Browse repository at this point
Copy the full SHA 81c6968View commit details -
Optionally expose the driver UI port as NodePort (#131)
* Optionally expose driver UI as a NodePort * Update the usage doc * Rename serviceType -> uiServiceType
Configuration menu - View commit details
-
Copy full SHA for e7b3569 - Browse repository at this point
Copy the full SHA e7b3569View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f27fb3 - Browse repository at this point
Copy the full SHA 6f27fb3View commit details -
Pass the actual iterable from the option to get files (#139)
* Pass the actual iterable from the option to get files * Split the original instance variables * Explicitly set the type of the array
Configuration menu - View commit details
-
Copy full SHA for 6d179a6 - Browse repository at this point
Copy the full SHA 6d179a6View commit details -
Use a separate class to track components that need to be cleaned up (#…
…122) * Refactor the cleaning up of Kubernetes components. Create a KubernetesComponentsCleaner which can register arbitrary pods, services, secrets, and ingresses. When an exception is thrown or the JVM shuts down, the cleaner automatically purges any of its registered components from Kubernetes. The components can be unregistered when the driver successfully begins running, so that the application persists beyond the lifetime of the spark-submit process. * Fix spacing * Address comments * Fix compiler error * Pull KubernetesComponentCleaner into instance variable * Remove a parameter * Remove redundant registerOrUpdateSecret for SSL * Remove Ingresses from component cleaner * Clear resources generically as opposed to specifying each type * Remove incorrect test assertion * Rename variable
Configuration menu - View commit details
-
Copy full SHA for e8359ca - Browse repository at this point
Copy the full SHA e8359caView commit details -
Enable unit tests in Travis CI build (#132)
* Configure unit test build while banning flaky tests * Clean up comment
Configuration menu - View commit details
-
Copy full SHA for a9dced2 - Browse repository at this point
Copy the full SHA a9dced2View commit details -
Configuration menu - View commit details
-
Copy full SHA for a124814 - Browse repository at this point
Copy the full SHA a124814View commit details -
Extract SSL configuration handling to a separate class (#123)
* Extract SSL configuration to a separate class * KubernetesSsl -> Ssl, container -> local
Configuration menu - View commit details
-
Copy full SHA for 06f78b4 - Browse repository at this point
Copy the full SHA 06f78b4View commit details -
Exclude known flaky tests (#156)
* Exclude SparkSubmitSuite from Travis unit test build * Remove SortShuffleSuite * Exclude Java tests
Configuration menu - View commit details
-
Copy full SHA for eb25262 - Browse repository at this point
Copy the full SHA eb25262View commit details -
Richer logging and better error handling in driver pod watch (#154)
* pod-watch progress around watch events * Simplify return * comments
Configuration menu - View commit details
-
Copy full SHA for 5587588 - Browse repository at this point
Copy the full SHA 5587588View commit details -
Document blocking submit calls (#152)
* Document blocking submit calls #53 added these config but didn't document them * Update running-on-kubernetes.md
Configuration menu - View commit details
-
Copy full SHA for 4029718 - Browse repository at this point
Copy the full SHA 4029718View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2c181b - Browse repository at this point
Copy the full SHA d2c181bView commit details -
Update client version & minikube version (#142)
* Update client version * Upgrade minikube * Update pom.xml
Configuration menu - View commit details
-
Copy full SHA for 5bbd6bb - Browse repository at this point
Copy the full SHA 5bbd6bbView commit details -
Allow customizing external URI provision + External URI can be set vi…
…a annotations (#147) * Listen for annotations that provide external URIs. * FIx scalstyle * Address comments * Fix doc style * Docs updates * Clearly explain path rewrites
Configuration menu - View commit details
-
Copy full SHA for a4092cd - Browse repository at this point
Copy the full SHA a4092cdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6c42d4b - Browse repository at this point
Copy the full SHA 6c42d4bView commit details -
Allow setting memory on the driver submission server. (#161)
* Allow setting memory on the driver submission server. * Address comments * Address comments
Configuration menu - View commit details
-
Copy full SHA for bd3deca - Browse repository at this point
Copy the full SHA bd3decaView commit details -
Add a section for prerequisites (#171)
* Adding prerequisites * address comments
Configuration menu - View commit details
-
Copy full SHA for 53bf7a1 - Browse repository at this point
Copy the full SHA 53bf7a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2a5d3d - Browse repository at this point
Copy the full SHA b2a5d3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 387aefb - Browse repository at this point
Copy the full SHA 387aefbView commit details -
Logging for resource deletion (#170)
* Logging for resource deletion Remove dangling colon and replace with an ellipses and a second log statement * Update KubernetesResourceCleaner.scala
Configuration menu - View commit details
-
Copy full SHA for ca76fbe - Browse repository at this point
Copy the full SHA ca76fbeView commit details -
* Fix pom versioning * fix k8s versions in pom * Change pom string to 2.1.0-k8s-0.1.0-SNAPSHOT
Configuration menu - View commit details
-
Copy full SHA for a9f1d6e - Browse repository at this point
Copy the full SHA a9f1d6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a5c4d5 - Browse repository at this point
Copy the full SHA 0a5c4d5View commit details -
* Adding official alpha docker image to docs * Reorder sections and create a specific one for "advanced" * Provide limitations and instructions about running on GKE * Fix title of advanced section: submission * Improved section on running in the cloud * Update versioning * Address comments * Address comments (cherry picked from commit e5da90d)
Configuration menu - View commit details
-
Copy full SHA for 73a0de3 - Browse repository at this point
Copy the full SHA 73a0de3View commit details -
Add Apache license to a few files (#175)
* Add Apache license to a few files * Ignore license check on META-INF service (cherry picked from commit 2a61438)
Configuration menu - View commit details
-
Copy full SHA for face1f4 - Browse repository at this point
Copy the full SHA face1f4View commit details -
Adding clarification pre-alpha (#181)
(cherry picked from commit be109ab)
Configuration menu - View commit details
-
Copy full SHA for 804d0f8 - Browse repository at this point
Copy the full SHA 804d0f8View commit details -
Allow providing an OAuth token for authenticating against k8s (#180)
* Allow providing an OAuth token for authenticating against k8s * Organize imports * Fix style * Remove extra newline * Use OAuth token data instead of a file. (cherry picked from commit 1aba361)
Configuration menu - View commit details
-
Copy full SHA for c5ab210 - Browse repository at this point
Copy the full SHA c5ab210View commit details -
Allow the driver pod's credentials to be shipped from the submission …
…client (squashed) (#192) * Allow the driver pod's credentials to be shipped through secrets. * Fix scalastyle * Change apiserver -> authentication * Address comments. Also some quality of life fixes, most notably formatting all of the documentation strings in config.scala to no longer use triple quotes. Triple quoted strings are difficult to format consistently. * Fix scalastyle * Fix comment * Remove unnecessary constants * Remove unnecessary whitespace * Authentication -> Authenticate
Configuration menu - View commit details
-
Copy full SHA for ffacd1f - Browse repository at this point
Copy the full SHA ffacd1fView commit details -
Support using PEM files to configure SSL for driver submission (#173)
* Support configuring SSL using PEM files. * Address some missed comments * Fix import ordering * Slight rewording of comments * Fix scalastyle
Configuration menu - View commit details
-
Copy full SHA for 64f3a69 - Browse repository at this point
Copy the full SHA 64f3a69View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6b3234 - Browse repository at this point
Copy the full SHA d6b3234View commit details -
Configuration menu - View commit details
-
Copy full SHA for 368664f - Browse repository at this point
Copy the full SHA 368664fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 37880e2 - Browse repository at this point
Copy the full SHA 37880e2View commit details -
Register executors using pod IPs instead of pod host names (#215)
* Register executors using pod IPs * Fix block manager port typo * Fix import * Keep requiredEnv to be a val * Clean up indentation
Configuration menu - View commit details
-
Copy full SHA for 3a0b770 - Browse repository at this point
Copy the full SHA 3a0b770View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02ab18e - Browse repository at this point
Copy the full SHA 02ab18eView commit details -
Configuration menu - View commit details
-
Copy full SHA for d0e27b1 - Browse repository at this point
Copy the full SHA d0e27b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a895a8 - Browse repository at this point
Copy the full SHA 9a895a8View commit details -
Respect JVM http proxy settings when using Feign. (#228)
* Respect JVM http proxy settings when using Feign. * Address comments * Address more comments`
Configuration menu - View commit details
-
Copy full SHA for 88ec1c5 - Browse repository at this point
Copy the full SHA 88ec1c5View commit details -
Staging server for receiving application dependencies. (#212)
* Staging server for receiving application dependencies. * Add unit test for file writing * Minor fixes * Remove getting credentials from the API We still want to post them because in the future we can use these credentials to monitor the API server and handle cleaning up the data accordingly. * Generalize to resource staging server outside of Spark * Update code documentation * Val instead of var * Fix naming, remove unused import * Move suites from integration test package to core * Use TrieMap instead of locks * Address comments * Fix imports * Change paths, use POST instead of PUT * Use a resource identifier as well as a resource secret
Configuration menu - View commit details
-
Copy full SHA for 275510a - Browse repository at this point
Copy the full SHA 275510aView commit details -
Reorganize packages between v1 work and v2 work (#220)
* Staging server for receiving application dependencies. * Move packages around to split between v1 work and v2 work * Add unit test for file writing * Remove unnecessary main * Add back license header * Minor fixes * Fix integration test with renamed package for client. Fix scalastyle. * Force json serialization to consider the different package. * Revert extraneous log * Fix scalastyle * Remove getting credentials from the API We still want to post them because in the future we can use these credentials to monitor the API server and handle cleaning up the data accordingly. * Generalize to resource staging server outside of Spark * Update code documentation * Val instead of var * Fix build * Fix naming, remove unused import * Move suites from integration test package to core * Use TrieMap instead of locks * Address comments * Fix imports * Change paths, use POST instead of PUT * Use a resource identifier as well as a resource secret
Configuration menu - View commit details
-
Copy full SHA for b196426 - Browse repository at this point
Copy the full SHA b196426View commit details -
Support SSL on the file staging server (#221)
* Staging server for receiving application dependencies. * Move packages around to split between v1 work and v2 work * Add unit test for file writing * Remove unnecessary main * Allow the file staging server to be secured with TLS. * Add back license header * Minor fixes * Fix integration test with renamed package for client. Fix scalastyle. * Remove unused import * Force json serialization to consider the different package. * Revert extraneous log * Fix scalastyle * Remove getting credentials from the API We still want to post them because in the future we can use these credentials to monitor the API server and handle cleaning up the data accordingly. * Fix build * Randomize name and namespace in test to prevent collisions * Generalize to resource staging server outside of Spark * Update code documentation * Val instead of var * Fix unit tests. * Fix build * Fix naming, remove unused import * Move suites from integration test package to core * Fix unit test * Use TrieMap instead of locks * Address comments * Fix imports * Address comments * Change main object name * Change config variable names * Change paths, use POST instead of PUT * Use a resource identifier as well as a resource secret
Configuration menu - View commit details
-
Copy full SHA for d432dba - Browse repository at this point
Copy the full SHA d432dbaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c29732 - Browse repository at this point
Copy the full SHA 7c29732View commit details -
Enable testing against GCE clusters (#243)
* Part 1: making test code cluster-agnostic * Final checked * Move all test code into KubernetesTestComponents * Addressed comments * Fixed doc * Restructure the test backends (#248) * Restructured the test backends * Address comments * var -> val * Comments * removed deadcode
Configuration menu - View commit details
-
Copy full SHA for 2c753de - Browse repository at this point
Copy the full SHA 2c753deView commit details -
Configuration menu - View commit details
-
Copy full SHA for c902d69 - Browse repository at this point
Copy the full SHA c902d69View commit details -
Configuration menu - View commit details
-
Copy full SHA for f09bf4a - Browse repository at this point
Copy the full SHA f09bf4aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 68ddcd5 - Browse repository at this point
Copy the full SHA 68ddcd5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c4f17b7 - Browse repository at this point
Copy the full SHA c4f17b7View commit details -
Configuration menu - View commit details
-
Copy full SHA for da94d91 - Browse repository at this point
Copy the full SHA da94d91View commit details -
Configuration menu - View commit details
-
Copy full SHA for ecf248c - Browse repository at this point
Copy the full SHA ecf248cView commit details -
Dispatch tasks to right executors that have tasks' input HDFS data (#216
) * Dispatch tasks to right executors that have tasks' input HDFS data on local disks * Fix style issues * Clean up unnecessary fields * Clean up a misleading method name * Address review comments * Fix import ordering * Delete executor pods in watcher * Fix the driver hang by unblocking the main thread * Fix import order * Clear runningExecutorPods * Fix incorrect merge * Address review comments * Clean up imports
Configuration menu - View commit details
-
Copy full SHA for 085fcd1 - Browse repository at this point
Copy the full SHA 085fcd1View commit details -
Add parameter for driver pod name (#258)
* Add parameter for driver pod name * Mark KUBERNETES_DRIVER_POD_NAME not being internal. Update docment. * Add test case for driver pod name * Diff driver pod name with appid * replace 'spark.kubernetes.driver.pod.name` with KUBERNETES_DRIVER_POD_NAME * Update readme to complete item
Configuration menu - View commit details
-
Copy full SHA for 2af7f05 - Browse repository at this point
Copy the full SHA 2af7f05View commit details -
* dynamic allocation: shuffle service docker, yaml and test fixture * dynamic allocation: changes to spark-core * dynamic allocation: tests * dynamic allocation: docs * dynamic allocation: kubernetes allocator and executor accounting * dynamic allocation: shuffle service, node caching
Configuration menu - View commit details
-
Copy full SHA for 20956e7 - Browse repository at this point
Copy the full SHA 20956e7View commit details -
Download remotely-located resources on driver and executor startup vi…
…a init-container (#251) * Download remotely-located resources on driver startup. Use init-container in executors. * FIx owner reference slightly * Clean up config * Don't rely too heavily on conventions that can change * Fix flaky test * Tidy up file resolver * Whitespace arrangement * Indentation change * Fix more indentation * Consolidate init container component providers * Minor method signature and comment changes * Rename class for consistency * Resolve conflicts * Fix flaky test * Add some tests and some refactoring. * Make naming consistent for Staged -> Submitted * Add unit test for the submission client. * Refine expectations * Rename variables and fix typos * Address more comments. Remove redundant SingleKeyConfigMap. * Minor test adjustments. * add another test * Fix conflicts.
Configuration menu - View commit details
-
Copy full SHA for 30597f6 - Browse repository at this point
Copy the full SHA 30597f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 636dbda - Browse repository at this point
Copy the full SHA 636dbdaView commit details -
Exit properly when the k8s cluster is not available. (#256)
* Exit properly when the k8s cluster is not available. * add jetty to k8s module dependency so we can use only rebuild the k8s module. * CR * Fixed single thread scheduler. * Fixed scalastyle check. * CR
Configuration menu - View commit details
-
Copy full SHA for 76c865d - Browse repository at this point
Copy the full SHA 76c865dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a6cebcb - Browse repository at this point
Copy the full SHA a6cebcbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2458b81 - Browse repository at this point
Copy the full SHA 2458b81View commit details -
Configuration menu - View commit details
-
Copy full SHA for 910865f - Browse repository at this point
Copy the full SHA 910865fView commit details -
Monitor pod status in submission v2. (#283)
* Monitor pod status in submission v2. * Address comments
Configuration menu - View commit details
-
Copy full SHA for 2e5f2cd - Browse repository at this point
Copy the full SHA 2e5f2cdView commit details -
Replace submission v1 with submission v2. (#286)
* Replace submission v1 with submission v2. * Address documentation changes. * Fix documentation
Configuration menu - View commit details
-
Copy full SHA for cc5eb85 - Browse repository at this point
Copy the full SHA cc5eb85View commit details -
Added files should be in the working directories. (#294)
* Added files should be in the working directories. * Revert unintentional changes * Fix test
Configuration menu - View commit details
-
Copy full SHA for 27b79a2 - Browse repository at this point
Copy the full SHA 27b79a2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d4819c - Browse repository at this point
Copy the full SHA 4d4819cView commit details -
Remove some leftover code and fix a constant. (#297)
* Remove some leftover code and fix a constant. * Fix build
Configuration menu - View commit details
-
Copy full SHA for 1311de1 - Browse repository at this point
Copy the full SHA 1311de1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e9f0a37 - Browse repository at this point
Copy the full SHA e9f0a37View commit details -
Configuration menu - View commit details
-
Copy full SHA for bd8f6da - Browse repository at this point
Copy the full SHA bd8f6daView commit details -
Add proxy configuration to retrofit clients. (#301)
* Add proxy configuration to retrofit clients. * Add logging
Configuration menu - View commit details
-
Copy full SHA for fc5d9c5 - Browse repository at this point
Copy the full SHA fc5d9c5View commit details -
Fix an HDFS data locality bug in case cluster node names are short ho…
…st names (#291) * Fix an HDFS data locality bug in case cluster node names are not full host names * Add a NOTE about InetAddress caching
Configuration menu - View commit details
-
Copy full SHA for 51a325c - Browse repository at this point
Copy the full SHA 51a325cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b8dc23d - Browse repository at this point
Copy the full SHA b8dc23dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c8bf38 - Browse repository at this point
Copy the full SHA 1c8bf38View commit details -
Allow custom executor labels and annotations (#321)
* Allow custom executor labels and annotations * Address comments. * Fix scalastyle.
Configuration menu - View commit details
-
Copy full SHA for 2cbd6fc - Browse repository at this point
Copy the full SHA 2cbd6fcView commit details -
Dynamic allocation, cleanup in case of driver death (#319)
* Adding cleanup for shuffle service for driver death * Address comments + fix tests * Cleanly open and close resources. * Added unit test, reusing RegisterDriver * lint + fix mesos
Configuration menu - View commit details
-
Copy full SHA for 5be5938 - Browse repository at this point
Copy the full SHA 5be5938View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6610cd3 - Browse repository at this point
Copy the full SHA 6610cd3View commit details -
Clean up resources that are not used by pods. (#305)
* Clean up resources that are not used by pods. * Make client side send correct credentials. * Simplify cleanup logic. Cancellation is no longer instantaneous and we might clean up a little later than the given TTL. However, the tradeoff is a simpler implementation with clearer contracts about when things will and will not be cleaned up. * Remove class * Fix imports and line length. * Remove import. * Add a unit test for StagingResourcesStore. * Revamp cleanup process. - Delete resources immediately when owners do not exist - Delete resources if after they are first uploaded, they are not accessed for a certain period of time. - Resource owners are more specifically defined and can have a type (currently only uses pods) * Clarify log messages * Use a single set of credentials in resource staging server. Also refactors construction of Kubernetes Clients to unify the code paths. * Fix unit test. * Safe close if creating shuffle block handler fails * Use implicit class. * Address comments. * Fix broken test.
Configuration menu - View commit details
-
Copy full SHA for 04ff1d8 - Browse repository at this point
Copy the full SHA 04ff1d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for c312567 - Browse repository at this point
Copy the full SHA c312567View commit details -
Allow docker image pull policy to be configurable (#328)
* Allow docker image pull policy to be configurable * Add flag documentation * Update running-on-kubernetes.md
Configuration menu - View commit details
-
Copy full SHA for 4f6a4d7 - Browse repository at this point
Copy the full SHA 4f6a4d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for f208d68 - Browse repository at this point
Copy the full SHA f208d68View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cdccbe - Browse repository at this point
Copy the full SHA 9cdccbeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a41e1e - Browse repository at this point
Copy the full SHA 5a41e1eView commit details -
Support specify CPU cores and Memory restricts for driver (#340)
Signed-off-by: duyanghao <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 069bd04 - Browse repository at this point
Copy the full SHA 069bd04View commit details -
Generate the application ID label irrespective of app name. (#331)
* Generate the application ID label irrespective of app name. * Add an integration test. * Fix scalastyle
Configuration menu - View commit details
-
Copy full SHA for 4a01baf - Browse repository at this point
Copy the full SHA 4a01bafView commit details -
Create base-image and minimize layer count (#324)
* Create base-image and minimize layer count * Create running-on-kubernetes.md
Configuration menu - View commit details
-
Copy full SHA for e763252 - Browse repository at this point
Copy the full SHA e763252View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f2ce8e - Browse repository at this point
Copy the full SHA 9f2ce8eView commit details -
Use node affinity to launch executors on preferred nodes benefitting …
…from data locality (#316) * Use node affinity to launch executors on data local nodes * Fix comment style * Use JSON object mapper * Address review comments * Fix a style issue * Clean up and add a TODO * Fix style issue * Address review comments
Configuration menu - View commit details
-
Copy full SHA for 0010a57 - Browse repository at this point
Copy the full SHA 0010a57View commit details -
* Fix sbt build. - Remove extraneous Feign dependency that we no longer use in submission v2. - Exclude Jackson from various modules to ensure every Jackson module is forced to 2.6.5. - Fix a linter error only caught by sbt. - Add Kubernetes modules to various parts of the SBT infrastructure * Actually remove feign * Actually exclude Jackson from kubernetes client.
Configuration menu - View commit details
-
Copy full SHA for efb5081 - Browse repository at this point
Copy the full SHA efb5081View commit details -
New API for custom labels and annotations. (#346)
* New API for custom labels and annotations. This APi allows for these labels and annotations to have = and , characters, which is hard to accomplish in the old scheme. * Compare correct values in requirements * Use helper method * Address comments. * Fix scalastyle * Use variable * Remove unused import
Configuration menu - View commit details
-
Copy full SHA for af7297e - Browse repository at this point
Copy the full SHA af7297eView commit details -
Allow spark driver find shuffle pods in specified namespace (#357)
The conf property spark.kubernetes.shuffle.namespace is used to specify the namesapce of shuffle pods. In normal cases, only one "shuffle daemonset" is deployed and shared by all spark pods. The spark driver should be able to list and watch shuffle pods in the namespace specified by user. Note: by default, spark driver pod doesn't have authority to list and watch shuffle pods in another namespace. Some action is needed to grant it the authority. For example, below ABAC policy works. ``` {"apiVersion": "abac.authorization.kubernetes.io/v1beta1", "kind": "Policy", "spec": {"group": "system:serviceaccounts", "namespace": "SHUFFLE_NAMESPACE", "resource": "pods", "readonly": true}} ```
Configuration menu - View commit details
-
Copy full SHA for 38287f6 - Browse repository at this point
Copy the full SHA 38287f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 168ef0a - Browse repository at this point
Copy the full SHA 168ef0aView commit details -
Configuration menu - View commit details
-
Copy full SHA for cdf6c36 - Browse repository at this point
Copy the full SHA cdf6c36View commit details -
Allow number of executor cores to have fractional values (#361)
This commit tries to solve issue #359 by allowing the `spark.executor.cores` configuration key to take fractional values, e.g., 0.5 or 1.5. The value is used to specify the cpu request when creating the executor pods, which is allowed to be fractional by Kubernetes. When the value is passed to the executor process through the environment variable `SPARK_EXECUTOR_CORES`, the value is rounded up to the closest integer as required by the `CoarseGrainedExecutorBackend`. Signed-off-by: Yinan Li <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9dc5eed - Browse repository at this point
Copy the full SHA 9dc5eedView commit details -
Python Bindings for launching PySpark Jobs from the JVM (#364)
* Adding PySpark Submit functionality. Launching Python from JVM * Addressing scala idioms related to PR351 * Removing extends Logging which was necessary for LogInfo * Refactored code to leverage the ContainerLocalizedFileResolver * Modified Unit tests so that they would pass * Modified Unit Test input to pass Unit Tests * Setup working environent for integration tests for PySpark * Comment out Python thread logic until Jenkins has python in Python * Modifying PythonExec to pass on Jenkins * Modifying python exec * Added unit tests to ClientV2 and refactored to include pyspark submission resources * Modified unit test check * Scalastyle * PR 348 file conflicts * Refactored unit tests and styles * further scala stylzing and logic * Modified unit tests to be more specific towards Class in question * Removed space delimiting for methods * Submission client redesign to use a step-based builder pattern. This change overhauls the underlying architecture of the submission client, but it is intended to entirely preserve existing behavior of Spark applications. Therefore users will find this to be an invisible change. The philosophy behind this design is to reconsider the breakdown of the submission process. It operates off the abstraction of "submission steps", which are transformation functions that take the previous state of the driver and return the new state of the driver. The driver's state includes its Spark configurations and the Kubernetes resources that will be used to deploy it. Such a refactor moves away from a features-first API design, which considers different containers to serve a set of features. The previous design, for example, had a container files resolver API object that returned different resolutions of the dependencies added by the user. However, it was up to the main Client to know how to intelligently invoke all of those APIs. Therefore the API surface area of the file resolver became untenably large and it was not intuitive of how it was to be used or extended. This design changes the encapsulation layout; every module is now responsible for changing the driver specification directly. An orchestrator builds the correct chain of steps and hands it to the client, which then calls it verbatim. The main client then makes any final modifications that put the different pieces of the driver together, particularly to attach the driver container itself to the pod and to apply the Spark configuration as command-line arguments. * Don't add the init-container step if all URIs are local. * Python arguments patch + tests + docs * Revert "Python arguments patch + tests + docs" This reverts commit 4533df2. * Revert "Don't add the init-container step if all URIs are local." This reverts commit e103225. * Revert "Submission client redesign to use a step-based builder pattern." This reverts commit 5499f6d. * style changes * space for styling
Configuration menu - View commit details
-
Copy full SHA for 442490a - Browse repository at this point
Copy the full SHA 442490aView commit details -
Submission client redesign to use a step-based builder pattern (#365)
* Submission client redesign to use a step-based builder pattern. This change overhauls the underlying architecture of the submission client, but it is intended to entirely preserve existing behavior of Spark applications. Therefore users will find this to be an invisible change. The philosophy behind this design is to reconsider the breakdown of the submission process. It operates off the abstraction of "submission steps", which are transformation functions that take the previous state of the driver and return the new state of the driver. The driver's state includes its Spark configurations and the Kubernetes resources that will be used to deploy it. Such a refactor moves away from a features-first API design, which considers different containers to serve a set of features. The previous design, for example, had a container files resolver API object that returned different resolutions of the dependencies added by the user. However, it was up to the main Client to know how to intelligently invoke all of those APIs. Therefore the API surface area of the file resolver became untenably large and it was not intuitive of how it was to be used or extended. This design changes the encapsulation layout; every module is now responsible for changing the driver specification directly. An orchestrator builds the correct chain of steps and hands it to the client, which then calls it verbatim. The main client then makes any final modifications that put the different pieces of the driver together, particularly to attach the driver container itself to the pod and to apply the Spark configuration as command-line arguments. * Add a unit test for BaseSubmissionStep. * Add unit test for kubernetes credentials mounting. * Add unit test for InitContainerBootstrapStep. * unit tests for initContainer * Add a unit test for DependencyResolutionStep. * further modifications to InitContainer unit tests * Use of resolver in PythonStep and unit tests for PythonStep * refactoring of init unit tests and pythonstep resolver logic * Add unit test for KubernetesSubmissionStepsOrchestrator. * refactoring and addition of secret trustStore+Cert checks in a SubmissionStepSuite * added SparkPodInitContainerBootstrapSuite * Added InitContainerResourceStagingServerSecretPluginSuite * style in Unit tests * extremely minor style fix in variable naming * Address comments. * Rename class for consistency. * Attempt to make spacing consistent. Multi-line methods should have four-space indentation for arguments that aren't on the same line as the method call itself... but this is difficult to do consistently given how IDEs handle Scala multi-line indentation in most cases.
Configuration menu - View commit details
-
Copy full SHA for fd30c5d - Browse repository at this point
Copy the full SHA fd30c5dView commit details -
Add implicit conversions to imports. (#374)
Otherwise we can get a Scalastyle error when building from SBT.
Configuration menu - View commit details
-
Copy full SHA for f46443e - Browse repository at this point
Copy the full SHA f46443eView commit details -
Fix import order and scalastyle (#375)
Test with ./dev/scalastyle
Configuration menu - View commit details
-
Copy full SHA for 42f578f - Browse repository at this point
Copy the full SHA 42f578fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2c00103 - Browse repository at this point
Copy the full SHA 2c00103View commit details -
Configuration menu - View commit details
-
Copy full SHA for e086f4d - Browse repository at this point
Copy the full SHA e086f4dView commit details -
Retry binding server to random port in the resource staging server te…
…st. (#378) * Retry binding server to random port in the resource staging server test. * Break if successful start * Start server in try block. * FIx scalastyle * More rigorous cleanup logic. Increment port numbers. * Move around more exception logic. * More exception refactoring. * Remove whitespace * Fix test * Rename variable
Configuration menu - View commit details
-
Copy full SHA for 7d0fa56 - Browse repository at this point
Copy the full SHA 7d0fa56View commit details -
set RestartPolicy=Never for executor (#367)
* set RestartPolicy=Never for executor As for current implementation the RestartPolicy of executor pod is not set, so the default value "OnFailure" is in effect. But this causes problem. If an executor is terminated unexpectedly, for example, exit by java.lang.OutOfMemoryError, it'll be restarted by k8s with the same executor ID. When the new executor tries to fetch a block hold by the last executor, ShuffleBlockFetcherIterator.splitLocalRemoteBlocks() think it's a **local** block and tries to read it from it's local dir. But the executor's local dir is changed because random generated ID is part of local dir. FetchFailedException will raise and the stage will fail. The rolling Error message: 17/06/29 01:54:56 WARN KubernetesTaskSetManager: Lost task 0.1 in stage 2.0 (TID 7, 172.16.75.92, executor 1): FetchFailed(BlockManagerId(1, 172.16.75.92, 40539, None), shuffleId=2, mapId=0, reduceId=0, message= org.apache.spark.shuffle.FetchFailedException: /data2/spark/blockmgr-0e228d3c-8727-422e-aa97-2841a877c42a/32/shuffle_2_0_0.index (No such file or directory) at org.apache.spark.storage.ShuffleBlockFetcherIterator.throwFetchFailedException(ShuffleBlockFetcherIterator.scala:357) at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:332) at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:54) at scala.collection.Iterator$$anon$11.next(Iterator.scala:409) * Update KubernetesClusterSchedulerBackend.scala
Configuration menu - View commit details
-
Copy full SHA for 4ffb4d6 - Browse repository at this point
Copy the full SHA 4ffb4d6View commit details -
Read classpath entries from SPARK_EXTRA_CLASSPATH on executors. (#383)
This makes executors consistent with the driver. Note that SPARK_EXTRA_CLASSPATH isn't set anywhere by Spark itself, but it's primarily meant to be set by images that inherit from the base driver/executor images.
Configuration menu - View commit details
-
Copy full SHA for e3b2360 - Browse repository at this point
Copy the full SHA e3b2360View commit details -
Changes to support executor recovery behavior during static allocatio…
…n. (#244) * Changes to support executor recovery behavior during static allocation. * addressed review comments * Style changes and removed inocrrectly merged code * addressed latest review comments * changed import order * Minor changes to avoid exceptions when exit code is missing * fixed style check * Addressed review comments from Yinan LiAddressed review comments from Yinan Li.. * Addressed comments and got rid of an explicit lock object. * Fixed imports order. * Addressed review comments from Matt * Couple of style fixes
Configuration menu - View commit details
-
Copy full SHA for 15e13f4 - Browse repository at this point
Copy the full SHA 15e13f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 823bf0e - Browse repository at this point
Copy the full SHA 823bf0eView commit details -
Fix: changed signature of ExternalShuffleClient
Please enter the commit message for your changes. Lines starting
Configuration menu - View commit details
-
Copy full SHA for 436482e - Browse repository at this point
Copy the full SHA 436482eView commit details -
Configuration menu - View commit details
-
Copy full SHA for beb1361 - Browse repository at this point
Copy the full SHA beb1361View commit details
Commits on Jul 25, 2017
-
Merge pull request #388 from apache-spark-on-k8s/branch-2.2-kubernetes-g
Branch 2.2 kubernetes
Configuration menu - View commit details
-
Copy full SHA for a8330eb - Browse repository at this point
Copy the full SHA a8330ebView commit details
Commits on Jul 28, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 64f3ddd - Browse repository at this point
Copy the full SHA 64f3dddView commit details -
Add an entrypoint.sh script to add a passwd entry if one does not exi…
…st for the container UID
Configuration menu - View commit details
-
Copy full SHA for bce9b77 - Browse repository at this point
Copy the full SHA bce9b77View commit details
Commits on Jul 29, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 8ecff61 - Browse repository at this point
Copy the full SHA 8ecff61View commit details
Commits on Aug 1, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 702a8f6 - Browse repository at this point
Copy the full SHA 702a8f6View commit details
Commits on Aug 3, 2017
-
Merge pull request #407 from bloomberg/python-testing
PySpark Submission Failing on --py-files
Configuration menu - View commit details
-
Copy full SHA for 2c5d784 - Browse repository at this point
Copy the full SHA 2c5d784View commit details -
Merge pull request #404 from erikerlandson/anonymous-uids
An entrypoint.sh script to add a passwd entry if one does not exist
Configuration menu - View commit details
-
Copy full SHA for fa67455 - Browse repository at this point
Copy the full SHA fa67455View commit details
Commits on Aug 8, 2017
-
Exclude com.sun.jersey from docker-minimal-bundle. (#420)
This probably is not the correct fix long-term as we should find the specific module that is pulling in the bad jersey-1 version. But until we can track down what the specific offending module is, this will have to do.
Configuration menu - View commit details
-
Copy full SHA for 5fdaa7f - Browse repository at this point
Copy the full SHA 5fdaa7fView commit details -
Flag-guard expensive DNS lookup of cluster node full names, part of H…
…DFS locality support (#412) * Flag-guard expensive DNS lookup of cluster node full names, part of HDFS locality support * Clean up a bit * Improve unit tests
Configuration menu - View commit details
-
Copy full SHA for e3cfaa4 - Browse repository at this point
Copy the full SHA e3cfaa4View commit details -
Configuration menu - View commit details
-
Copy full SHA for bd50627 - Browse repository at this point
Copy the full SHA bd50627View commit details -
Initial architecture documentation. (#401)
* Initial architecture documentation. Initial full documentation for the submission client. Templates for the external shuffle service and the scheduler backend. * Add title to scheduler backend doc. * edits for PR review feedback
Configuration menu - View commit details
-
Copy full SHA for 24cd9ee - Browse repository at this point
Copy the full SHA 24cd9eeView commit details
Commits on Aug 9, 2017
-
Allow configuration to set environment variables on driver and execut…
…or (#424) * allow configuration to set environment variables on driver and executor as below: --conf spark.executorEnv.[EnvironmentVariableName] --conf spark.driverEnv.[EnvironmentVariableName] * change the driver environment key prefix to spark.kubernetes.driverEnv.
Configuration menu - View commit details
-
Copy full SHA for 372ae41 - Browse repository at this point
Copy the full SHA 372ae41View commit details -
Configuration menu - View commit details
-
Copy full SHA for 410dc9c - Browse repository at this point
Copy the full SHA 410dc9cView commit details -
4
Configuration menu - View commit details
-
Copy full SHA for 737abdc - Browse repository at this point
Copy the full SHA 737abdcView commit details
Commits on Aug 10, 2017
-
Revert "bump to 2.2.0-k8s-0.4.0-SNAPSHOT"
This reverts commit 737abdc.
Configuration menu - View commit details
-
Copy full SHA for a46b4a3 - Browse repository at this point
Copy the full SHA a46b4a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for ff601a3 - Browse repository at this point
Copy the full SHA ff601a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19f49d0 - Browse repository at this point
Copy the full SHA 19f49d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 982760c - Browse repository at this point
Copy the full SHA 982760cView commit details
Commits on Aug 14, 2017
-
Configuration menu - View commit details
-
Copy full SHA for cb645ca - Browse repository at this point
Copy the full SHA cb645caView commit details -
Updated with documentation (#430)
Direct copy of revised design doc
Configuration menu - View commit details
-
Copy full SHA for 437eb89 - Browse repository at this point
Copy the full SHA 437eb89View commit details -
Merge pull request #431 from apache-spark-on-k8s/foxish-patch-2
Update external shuffle service docs
Configuration menu - View commit details
-
Copy full SHA for 6ab02e2 - Browse repository at this point
Copy the full SHA 6ab02e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b3aeb7 - Browse repository at this point
Copy the full SHA 3b3aeb7View commit details -
Removed deprecated option from pom (#433)
This fixes local integration testing
Configuration menu - View commit details
-
Copy full SHA for 6e1d69e - Browse repository at this point
Copy the full SHA 6e1d69eView commit details
Commits on Aug 17, 2017
-
Support HDFS rack locality (#350)
* Support HDFS rack locality * Fix unit tests * Address review comments * Address some review comments * Use traits for InetAddress and RackResolver util classes * Disables expensive DNS lookup by default
Configuration menu - View commit details
-
Copy full SHA for c457f10 - Browse repository at this point
Copy the full SHA c457f10View commit details
Commits on Aug 18, 2017
-
Required for ./dev/check-license to pass
Configuration menu - View commit details
-
Copy full SHA for 4a322ad - Browse repository at this point
Copy the full SHA 4a322adView commit details
Commits on Aug 21, 2017
-
Configuration menu - View commit details
-
Copy full SHA for f8cf9db - Browse repository at this point
Copy the full SHA f8cf9dbView commit details -
Use a secret to mount small files in driver and executors. (#437)
* Use a secret to mount small files in driver and executors. Allows bypassing the resource staging server in a few scenarios. * Fix scalstyle * Address comments and add tests. * Lightly brush up formatting. * Make the working directory empty so that added files don't clobber existing binaries. * Address comments. * Drop testing file size to N+1 of the limit
Configuration menu - View commit details
-
Copy full SHA for 455317d - Browse repository at this point
Copy the full SHA 455317dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 58cebd1 - Browse repository at this point
Copy the full SHA 58cebd1View commit details
Commits on Aug 22, 2017
-
Configuration menu - View commit details
-
Copy full SHA for e44d81a - Browse repository at this point
Copy the full SHA e44d81aView commit details -
Use a list of environment variables for JVM options. (#444)
* Use a list of environment variables for JVM options. * Fix merge conflicts.
Configuration menu - View commit details
-
Copy full SHA for f7b5820 - Browse repository at this point
Copy the full SHA f7b5820View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7959fc5 - Browse repository at this point
Copy the full SHA 7959fc5View commit details
Commits on Aug 23, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 2cb2074 - Browse repository at this point
Copy the full SHA 2cb2074View commit details
Commits on Aug 24, 2017
-
Bumping versions to v2.2.0-kubernetes-0.3.0
Sahil Prasad committedAug 24, 2017 Configuration menu - View commit details
-
Copy full SHA for 0c160f5 - Browse repository at this point
Copy the full SHA 0c160f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6e922d - Browse repository at this point
Copy the full SHA d6e922dView commit details -
Merge pull request #460 from sahilprasad/bump-shuffle-version
Bumping version to v2.2.0-kubernetes-0.3.0
Configuration menu - View commit details
-
Copy full SHA for dca9b04 - Browse repository at this point
Copy the full SHA dca9b04View commit details -
Merge pull request #451 from kimoonkim/override-service-account
Support service account override
Configuration menu - View commit details
-
Copy full SHA for e600a07 - Browse repository at this point
Copy the full SHA e600a07View commit details
Commits on Aug 25, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 6177bf8 - Browse repository at this point
Copy the full SHA 6177bf8View commit details
Commits on Aug 30, 2017
-
Fix conversion from GB to MiB (#470)
* Fix conversion from GB to MiB Previously we applied a value which was in units MiB to Kubernetes _as if_ it were in MB units. Now, place MiB units in variable names and apply them to Kubernetes as the correct MiB units. * Fix test * Update tests
Configuration menu - View commit details
-
Copy full SHA for c6bc19d - Browse repository at this point
Copy the full SHA c6bc19dView commit details
Commits on Aug 31, 2017
-
spark-examples jar filename misses k8s-0.3.0 (#476)
`spark_examples_2.11-2.2.0.jar` should be `spark-examples_2.11-2.2.0-k8s-0.3.0.jar`
Configuration menu - View commit details
-
Copy full SHA for 1e63a60 - Browse repository at this point
Copy the full SHA 1e63a60View commit details -
Configuration menu - View commit details
-
Copy full SHA for d710563 - Browse repository at this point
Copy the full SHA d710563View commit details -
Set ENV_DRIVER_MEMORY to memory instead of memory+overhead (#475)
* Set ENV_DRIVER_MEMORY to memory instead of memory+overhead Signed-off-by: duyanghao <[email protected]> * Restore test
Configuration menu - View commit details
-
Copy full SHA for 728ba0a - Browse repository at this point
Copy the full SHA 728ba0aView commit details
Commits on Sep 4, 2017
-
Configuration menu - View commit details
-
Copy full SHA for bc845c3 - Browse repository at this point
Copy the full SHA bc845c3View commit details
Commits on Sep 6, 2017
-
Move executor pod construction to a separate class. (#452)
* Move executor pod construction to a separate class. This is the first of several measures to make KubernetesClusterSchedulerBackend feasible to test. * Revert change to README * Address comments. * Resolve merge conflicts. Move MiB change to ExecutorPodFactory.
Configuration menu - View commit details
-
Copy full SHA for fa02fb1 - Browse repository at this point
Copy the full SHA fa02fb1View commit details
Commits on Sep 7, 2017
-
Added configuration properties to inject arbitrary secrets into the d…
…river/executors (#479) * Added configuration properties to inject arbitrary secrets into the driver/executors * Addressed comments
Configuration menu - View commit details
-
Copy full SHA for 8b63aad - Browse repository at this point
Copy the full SHA 8b63aadView commit details -
Extract more of the shuffle management to a different class. (#454)
* Extract more of the shuffle management to a different class. More efforts to reduce the complexity of the KubernetesClusterSchedulerBackend. The scheduler backend should not be concerned about anything other than the coordination of the executor lifecycle. * Fix scalastyle * Add override annotation * Fix Java style * Remove unused imports. * Move volume index to the beginning to satisfy index * Address PR comments.
Configuration menu - View commit details
-
Copy full SHA for 6d7d798 - Browse repository at this point
Copy the full SHA 6d7d798View commit details
Commits on Sep 8, 2017
-
Unit Tests for KubernetesClusterSchedulerBackend (#459)
* Start unit tests for the scheduler backend. * More tests for the scheduler backend. * Unit tests and possible preemptive corrections to failover logic. * Address PR comments. * Resolve merge conflicts. Move MiB change to ExecutorPodFactory. * Revert accidental thread pool name change
Configuration menu - View commit details
-
Copy full SHA for 6053455 - Browse repository at this point
Copy the full SHA 6053455View commit details -
Use a headless service to give a hostname to the driver. (#483)
* Use a headless service to give a hostname to the driver. Required since SPARK-21642 was added upstream. * Fix scalastyle. * Add back import * Fix conflict properly. * Fix orchestrator test.
Configuration menu - View commit details
-
Copy full SHA for 6cebfb6 - Browse repository at this point
Copy the full SHA 6cebfb6View commit details
Commits on Sep 15, 2017
-
Code enhancement: Replaced explicit synchronized access to a hashmap …
…with a concurrent map. (#392) * Replaced explicit synchronized access to hashmap with a concurrent map * Removed usages of scala.collection.concurrent.Map
Configuration menu - View commit details
-
Copy full SHA for e5838c1 - Browse repository at this point
Copy the full SHA e5838c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cb0b04 - Browse repository at this point
Copy the full SHA 2cb0b04View commit details
Commits on Sep 16, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 52fe7f5 - Browse repository at this point
Copy the full SHA 52fe7f5View commit details -
Fail submission if submitter-local files are provided without resourc… (
#447) * Fail submission if submitter-local files are provided without resource staging server URI * Modified logic to validate only submitted jars; added orchestrator tests * Incorporated feedback * Fix failing test case
Configuration menu - View commit details
-
Copy full SHA for 8a0f485 - Browse repository at this point
Copy the full SHA 8a0f485View commit details
Commits on Sep 21, 2017
-
* Rename package to k8s * Rename string constants
Configuration menu - View commit details
-
Copy full SHA for 7477cbe - Browse repository at this point
Copy the full SHA 7477cbeView commit details
Commits on Sep 22, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 3eb04bb - Browse repository at this point
Copy the full SHA 3eb04bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c1a16a - Browse repository at this point
Copy the full SHA 3c1a16aView commit details
Commits on Sep 25, 2017
-
Update poms for 2.2 release 0.4.0 (#508)
* Update POMs * Update extensions/v1beta1.Deployment to apps * Modified defaults on rss and ss
Configuration menu - View commit details
-
Copy full SHA for 562f301 - Browse repository at this point
Copy the full SHA 562f301View commit details
Commits on Sep 26, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 3c7dec5 - Browse repository at this point
Copy the full SHA 3c7dec5View commit details
Commits on Oct 10, 2017
-
Add unit-testing for executorpodfactory (#491)
* Unit test for executorpodfactory * Fix test * Indentation fix * Fix isEmpty and split between lines * Address issues with multi-line code fragments * Replace == with === * mock shuffleManager * .kubernetes. => .k8s. * move to k8s subdir * fix package clause to k8s * mock nodeAffinityExecutorPodModifier * remove commented code * move when clause to before{} block * mock initContainerBootstrap, smallFiles * insert actual logic into smallFiles mock * verify application of nodeAffinityExecutorPodModifier * avoid cumulative invocation * Fixed env-var check to include values, removed mock for small files
Configuration menu - View commit details
-
Copy full SHA for 887fdce - Browse repository at this point
Copy the full SHA 887fdceView commit details
Commits on Oct 16, 2017
-
Mount emptyDir volumes for temporary directories on executors in stat…
…ic allocation mode (rebased) (#522) * Use emptyDir volume mounts for executor local directories. * Mount local dirs in the driver. Remove shuffle dir configuration. * Arrange imports * Fix style and integration tests. * Add TODO note for volume types to change. * Add unit test and extra documentation. * Fix existing unit tests and add tests for empty dir volumes * Remove extraneous constant
Configuration menu - View commit details
-
Copy full SHA for 49932d6 - Browse repository at this point
Copy the full SHA 49932d6View commit details
Commits on Oct 18, 2017
-
* initial R support without integration tests * finished sparkR integration * case sensitive file names in unix * revert back to previous lower case in dockerfile * addition into the build-push-docker-images
Configuration menu - View commit details
-
Copy full SHA for f94499b - Browse repository at this point
Copy the full SHA f94499bView commit details
Commits on Oct 20, 2017
-
Use the new initContainers field instead of the deprecated annotation (…
…#528) * Use the new initContainers field in Kubernetes 1.8 * Fixed the integration tests
Configuration menu - View commit details
-
Copy full SHA for 0abf0b9 - Browse repository at this point
Copy the full SHA 0abf0b9View commit details
Commits on Oct 26, 2017
-
Use the driver pod IP address for spark.driver.bindAddress (#533)
* Use the driver pod IP address for spark.driver.bindAddress * Addressed comments * Addressed more comments * Fixed broken DriverServiceBootstrapStepSuite
Configuration menu - View commit details
-
Copy full SHA for 6b1caca - Browse repository at this point
Copy the full SHA 6b1cacaView commit details
Commits on Oct 27, 2017
-
Configuration menu - View commit details
-
Copy full SHA for b008be3 - Browse repository at this point
Copy the full SHA b008be3View commit details
Commits on Nov 7, 2017
-
Add quotes around $SPARK_CLASSPATH in Dockerfile java commands (#541)
The quotes around $SPARK_CLASSPATH in the Dockerfiles prevents the shell from expanding wildcard paths in cases where the classpath is a single value like /opt/spark/jars/*
Configuration menu - View commit details
-
Copy full SHA for 8f73508 - Browse repository at this point
Copy the full SHA 8f73508View commit details
Commits on Nov 8, 2017
-
Spark Submit changes and test (#542)
* Spark Submit Unit tests * Improvements * Add missing options * Added check for jar
Configuration menu - View commit details
-
Copy full SHA for 3ff2cbb - Browse repository at this point
Copy the full SHA 3ff2cbbView commit details
Commits on Nov 17, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 5fd1304 - Browse repository at this point
Copy the full SHA 5fd1304View commit details
Commits on Nov 28, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 15a333c - Browse repository at this point
Copy the full SHA 15a333cView commit details
Commits on Dec 4, 2017
-
Allow user-specified environment variables and secrets in the init-co…
…ntainer (#564) * Allow setting user-specified environments in the init-container * Use driver/executor env keys for the init-container * Mount user-specified driver/executor secrets * Addressed comments
Configuration menu - View commit details
-
Copy full SHA for 0612195 - Browse repository at this point
Copy the full SHA 0612195View commit details
Commits on Dec 12, 2017
-
Basic Secure HDFS Support [514] (#540)
* first stage of PR #514 of just logic * fixing build and unit test issues * fixed integration tests * fixed issue with executorPodFactory unit tests * first series of PR comments * handle most PR comments * third round of PR comments * initial round of comments and initial unit tests for deploy * handled most of the comments and added test cases for pods * resolve conflicts * merge conflicts * adding thread sleeping for RSS issues as a test * resolving comments and unit testing * regarding comments on PR
Configuration menu - View commit details
-
Copy full SHA for 246b885 - Browse repository at this point
Copy the full SHA 246b885View commit details
Commits on Dec 20, 2017
-
Configuration menu - View commit details
-
Copy full SHA for 6428bb9 - Browse repository at this point
Copy the full SHA 6428bb9View commit details
Commits on Jan 2, 2018
-
Configuration menu - View commit details
-
Copy full SHA for 6d724a9 - Browse repository at this point
Copy the full SHA 6d724a9View commit details
Commits on Jan 9, 2018
-
Configuration menu - View commit details
-
Copy full SHA for d7dd259 - Browse repository at this point
Copy the full SHA d7dd259View commit details
Commits on Jan 12, 2018
-
Avoids adding duplicated secret volumes when init-container is used (#…
…597) * Avoids adding duplicated secret volumes when init-container is used Cherry-picked from apache#20148. * Added the missing commit from upstream
Configuration menu - View commit details
-
Copy full SHA for 12d590c - Browse repository at this point
Copy the full SHA 12d590cView commit details
Commits on Feb 3, 2018
-
Create ISSUE_TEMPLATE.md (#608)
* Create ISSUE_TEMPLATE.md * add dev mailing list and jira links
Configuration menu - View commit details
-
Copy full SHA for 90a204c - Browse repository at this point
Copy the full SHA 90a204cView commit details
Commits on Feb 4, 2018
-
Add message to redirect PRs upstream if possible (#607)
* Add message to redirect PRs upstream if possible We want to re-direct community dev upstream as much as possible. However, some contributions impact components (e.g. shuffle server) that do not yet exist upstream. To handle this, we decided to add this message and leave it up to developers, but encourage them to submit upstream unless it isn't feasible. * Add dev mailling list and jira links
Configuration menu - View commit details
-
Copy full SHA for a0117ea - Browse repository at this point
Copy the full SHA a0117eaView commit details
Commits on Feb 9, 2018
-
remove camel case naming in kerberos secret names (#612)
The names are currently used when HadoopKerberosKeytabResolverStep tries to safe the kerberos delegation token into a kubernete secret. However, the current camel case values will cause a io.fabric8.kubernetes.client.KubernetesClientException stating the following: a DNS-1123 subdomain must consist of lower case alphanumeric characters, '-' or '.', and must start and end with an alphanumeric character (e.g. 'example.com', regex used for validation is '[a-z0-9]([-a-z0-9]*[a-z0-9])?(\.[a-z0-9]([-a-z0-9]*[a-z0-9])?)*')
Configuration menu - View commit details
-
Copy full SHA for 7b8c9f5 - Browse repository at this point
Copy the full SHA 7b8c9f5View commit details
Commits on Jun 8, 2018
-
Configuration menu - View commit details
-
Copy full SHA for a218f3d - Browse repository at this point
Copy the full SHA a218f3dView commit details
Commits on Jan 8, 2020
-
Configuration menu - View commit details
-
Copy full SHA for 72cf35d - Browse repository at this point
Copy the full SHA 72cf35dView commit details