RubyGems - google-cloud-dataproc - Versions diffs - 0.3.0 → 0.3.1 - Mend

google-cloud-dataproc 0.3.0 → 0.3.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (31) hide show

checksums.yaml CHANGED

@@ -1,7 +1,7 @@
 ---
 SHA256:
-  metadata.gz: 4f6730b23eacab9d232b6a9dbdc22f5fa73f9326a98ddbef2b881b6e5e61bedf
-  data.tar.gz: 06ab96ebf91d023a2d5d23fc4aa74a0b0271be19accf86dbd0ea4899b5529bad
+  metadata.gz: 433b67960fde33d581be68d4316747f4ee70009aabb1c6e89029410a627a947a
+  data.tar.gz: e10c766af7e68ea6e1ab8d5471c0153bcbb32487f8240ff5bbe7bbc34e4b1849
 SHA512:
-  metadata.gz: ecdef56467fb0bf3d650ceb63cc9e7cb2e45268c7b5018742830d322a6d115a402f74dae9e84b62e1c71a4be32fe1407562cdbebf36606af6dd3e8cd97e0aa25
-  data.tar.gz: '0782a97fa3291a926790bafd142b9ea9164a4fe7b99ad007090aca801f703a8e9ced0c63c9c6e87df1df9033698cd24110c58147da22d3d931ba54e57e86f81e'
+  metadata.gz: 4688350b720a26778e6751f65408f08a76848504eb5c7c2f729841736cb8004c104b274fbd4e31d47b34ee073a771a38500416da149e558cccc03d3d42018b4a
+  data.tar.gz: 9edfe8721650d1dab30ab294f5174be7d37037198bc1b37de3e56d7b2114a317e38ae03c4d7a03cf716b5659f8ab45e88075616f8fe06a55fea14b2e22b478bd

data/.yardopts CHANGED

@@ -7,3 +7,5 @@
 ./lib/**/*.rb
 -
 README.md
+AUTHENTICATION.md
+LICENSE

data/AUTHENTICATION.md ADDED

@@ -0,0 +1,199 @@
+# Authentication
+In general, the google-cloud-dataproc library uses [Service
+Account](https://cloud.google.com/iam/docs/creating-managing-service-accounts)
+credentials to connect to Google Cloud services. When running within [Google
+Cloud Platform environments](#google-cloud-platform-environments)
+the credentials will be discovered automatically. When running on other
+environments, the Service Account credentials can be specified by providing the
+path to the [JSON
+keyfile](https://cloud.google.com/iam/docs/managing-service-account-keys) for
+the account (or the JSON itself) in [environment
+variables](#environment-variables). Additionally, Cloud SDK credentials can also
+be discovered automatically, but this is only recommended during development.
+## Quickstart
+1. [Create a service account and credentials](#creating-a-service-account).
+2. Set the [environment variable](#environment-variables).
+```sh
+export DATAPROC_CREDENTIALS=/path/to/json`
+```
+3. Initialize the client.
+```ruby
+require "google/cloud/dataproc"
+client = Google::Cloud::Dataproc.new
+```
+## Project and Credential Lookup
+The google-cloud-dataproc library aims to make authentication
+as simple as possible, and provides several mechanisms to configure your system
+without providing **Project ID** and **Service Account Credentials** directly in
+code.
+**Project ID** is discovered in the following order:
+1. Specify project ID in method arguments
+2. Specify project ID in configuration
+3. Discover project ID in environment variables
+4. Discover GCE project ID
+5. Discover project ID in credentials JSON
+**Credentials** are discovered in the following order:
+1. Specify credentials in method arguments
+2. Specify credentials in configuration
+3. Discover credentials path in environment variables
+4. Discover credentials JSON in environment variables
+5. Discover credentials file in the Cloud SDK's path
+6. Discover GCE credentials
+### Google Cloud Platform environments
+While running on Google Cloud Platform environments such as Google Compute
+Engine, Google App Engine and Google Kubernetes Engine, no extra work is needed.
+The **Project ID** and **Credentials** and are discovered automatically. Code
+should be written as if already authenticated. Just be sure when you [set up the
+GCE instance][gce-how-to], you add the correct scopes for the APIs you want to
+access. For example:
+  * **All APIs**
+    * `https://www.googleapis.com/auth/cloud-platform`
+    * `https://www.googleapis.com/auth/cloud-platform.read-only`
+  * **BigQuery**
+    * `https://www.googleapis.com/auth/bigquery`
+    * `https://www.googleapis.com/auth/bigquery.insertdata`
+  * **Compute Engine**
+    * `https://www.googleapis.com/auth/compute`
+  * **Datastore**
+    * `https://www.googleapis.com/auth/datastore`
+    * `https://www.googleapis.com/auth/userinfo.email`
+  * **DNS**
+    * `https://www.googleapis.com/auth/ndev.clouddns.readwrite`
+  * **Pub/Sub**
+    * `https://www.googleapis.com/auth/pubsub`
+  * **Storage**
+    * `https://www.googleapis.com/auth/devstorage.full_control`
+    * `https://www.googleapis.com/auth/devstorage.read_only`
+    * `https://www.googleapis.com/auth/devstorage.read_write`
+### Environment Variables
+The **Project ID** and **Credentials JSON** can be placed in environment
+variables instead of declaring them directly in code. Each service has its own
+environment variable, allowing for different service accounts to be used for
+different services. (See the READMEs for the individual service gems for
+details.) The path to the **Credentials JSON** file can be stored in the
+environment variable, or the **Credentials JSON** itself can be stored for
+environments such as Docker containers where writing files is difficult or not
+encouraged.
+The environment variables that google-cloud-dataproc checks for project ID are:
+1. `DATAPROC_PROJECT`
+2. `GOOGLE_CLOUD_PROJECT`
+The environment variables that google-cloud-dataproc checks for credentials are configured on {Google::Cloud::Dataproc::V1::Credentials}:
+1. `DATAPROC_CREDENTIALS` - Path to JSON file, or JSON contents
+2. `DATAPROC_KEYFILE` - Path to JSON file, or JSON contents
+3. `GOOGLE_CLOUD_CREDENTIALS` - Path to JSON file, or JSON contents
+4. `GOOGLE_CLOUD_KEYFILE` - Path to JSON file, or JSON contents
+5. `GOOGLE_APPLICATION_CREDENTIALS` - Path to JSON file
+```ruby
+require "google/cloud/dataproc"
+ENV["DATAPROC_PROJECT"]     = "my-project-id"
+ENV["DATAPROC_CREDENTIALS"] = "path/to/keyfile.json"
+client = Google::Cloud::Dataproc.new
+```
+### Configuration
+The **Project ID** and **Credentials JSON** can be configured instead of placing them in environment variables or providing them as arguments.
+```ruby
+require "google/cloud/dataproc"
+Google::Cloud::Dataproc.configure do |config|
+  config.project_id  = "my-project-id"
+  config.credentials = "path/to/keyfile.json"
+end
+client = Google::Cloud::Dataproc.new
+```
+### Cloud SDK
+This option allows for an easy way to authenticate during development. If
+credentials are not provided in code or in environment variables, then Cloud SDK
+credentials are discovered.
+To configure your system for this, simply:
+1. [Download and install the Cloud SDK](https://cloud.google.com/sdk)
+2. Authenticate using OAuth 2.0 `$ gcloud auth login`
+3. Write code as if already authenticated.
+**NOTE:** This is _not_ recommended for running in production. The Cloud SDK
+*should* only be used during development.
+[gce-how-to]: https://cloud.google.com/compute/docs/authentication#using
+[dev-console]: https://console.cloud.google.com/project
+[enable-apis]: https://raw.githubusercontent.com/GoogleCloudPlatform/gcloud-common/master/authentication/enable-apis.png
+[create-new-service-account]: https://raw.githubusercontent.com/GoogleCloudPlatform/gcloud-common/master/authentication/create-new-service-account.png
+[create-new-service-account-existing-keys]: https://raw.githubusercontent.com/GoogleCloudPlatform/gcloud-common/master/authentication/create-new-service-account-existing-keys.png
+[reuse-service-account]: https://raw.githubusercontent.com/GoogleCloudPlatform/gcloud-common/master/authentication/reuse-service-account.png
+## Creating a Service Account
+Google Cloud requires a **Project ID** and **Service Account Credentials** to
+connect to the APIs. You will use the **Project ID** and **JSON key file** to
+connect to most services with google-cloud-dataproc.
+If you are not running this client within [Google Cloud Platform
+environments](#google-cloud-platform-environments), you need a Google
+Developers service account.
+1. Visit the [Google Developers Console][dev-console].
+1. Create a new project or click on an existing project.
+1. Activate the slide-out navigation tray and select **API Manager**. From
+   here, you will enable the APIs that your application requires.
+   ![Enable the APIs that your application requires][enable-apis]
+   *Note: You may need to enable billing in order to use these services.*
+1. Select **Credentials** from the side navigation.
+   You should see a screen like one of the following.
+   ![Create a new service account][create-new-service-account]
+   ![Create a new service account With Existing Keys][create-new-service-account-existing-keys]
+   Find the "Add credentials" drop down and select "Service account" to be
+   guided through downloading a new JSON key file.
+   If you want to re-use an existing service account, you can easily generate a
+   new key file. Just select the account you wish to re-use, and click "Generate
+   new JSON key":
+   ![Re-use an existing service account][reuse-service-account]
+   The key file you download will be used by this library to authenticate API
+   requests and should be stored in a secure location.
+## Troubleshooting
+If you're having trouble authenticating you can ask for help by following the
+{file:TROUBLESHOOTING.md Troubleshooting Guide}.

data/lib/google/cloud/dataproc/v1/cluster_controller_client.rb CHANGED

@@ -234,10 +234,11 @@ module Google
           #   can also be provided.
           # @param request_id [String]
           #   Optional. A unique id used to identify the request. If the server
-          #   receives two {Google::Cloud::Dataproc::V1::CreateClusterRequest CreateClusterRequest} requests  with the same
-          #   id, then the second request will be ignored and the
-          #   first {Google::Longrunning::Operation} created and stored in the backend
-          #   is returned.
+          #   receives two
+          #   {Google::Cloud::Dataproc::V1::CreateClusterRequest CreateClusterRequest}
+          #   requests  with the same id, then the second request will be ignored and the
+          #   first {Google::Longrunning::Operation} created
+          #   and stored in the backend is returned.
           #
           #   It is recommended to always set this value to a
           #   [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).
@@ -390,10 +391,11 @@ module Google
           #   can also be provided.
           # @param request_id [String]
           #   Optional. A unique id used to identify the request. If the server
-          #   receives two {Google::Cloud::Dataproc::V1::UpdateClusterRequest UpdateClusterRequest} requests  with the same
-          #   id, then the second request will be ignored and the
-          #   first {Google::Longrunning::Operation} created and stored in the
-          #   backend is returned.
+          #   receives two
+          #   {Google::Cloud::Dataproc::V1::UpdateClusterRequest UpdateClusterRequest}
+          #   requests  with the same id, then the second request will be ignored and the
+          #   first {Google::Longrunning::Operation} created
+          #   and stored in the backend is returned.
           #
           #   It is recommended to always set this value to a
           #   [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).
@@ -496,10 +498,11 @@ module Google
           #   (with error NOT_FOUND) if cluster with specified UUID does not exist.
           # @param request_id [String]
           #   Optional. A unique id used to identify the request. If the server
-          #   receives two {Google::Cloud::Dataproc::V1::DeleteClusterRequest DeleteClusterRequest} requests  with the same
-          #   id, then the second request will be ignored and the
-          #   first {Google::Longrunning::Operation} created and stored in the
-          #   backend is returned.
+          #   receives two
+          #   {Google::Cloud::Dataproc::V1::DeleteClusterRequest DeleteClusterRequest}
+          #   requests  with the same id, then the second request will be ignored and the
+          #   first {Google::Longrunning::Operation} created
+          #   and stored in the backend is returned.
           #
           #   It is recommended to always set this value to a
           #   [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).

data/lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/clusters.rb CHANGED

@@ -36,8 +36,9 @@ module Google
         #     Label **keys** must contain 1 to 63 characters, and must conform to
         #     [RFC 1035](https://www.ietf.org/rfc/rfc1035.txt).
         #     Label **values** may be empty, but, if present, must contain 1 to 63
-        #     characters, and must conform to [RFC 1035](https://www.ietf.org/rfc/rfc1035.txt).
-        #     No more than 32 labels can be associated with a cluster.
+        #     characters, and must conform to [RFC
+        #     1035](https://www.ietf.org/rfc/rfc1035.txt). No more than 32 labels can be
+        #     associated with a cluster.
         # @!attribute [rw] status
         #   @return [Google::Cloud::Dataproc::V1::ClusterStatus]
         #     Output only. Cluster status.
@@ -52,8 +53,8 @@ module Google
         #   @return [Google::Cloud::Dataproc::V1::ClusterMetrics]
         #     Contains cluster daemon metrics such as HDFS and YARN stats.
         #
-        #     **Beta Feature**: This report is available for testing purposes only. It may
-        #     be changed before final release.
+        #     **Beta Feature**: This report is available for testing purposes only. It
+        #     may be changed before final release.
         class Cluster; end
         # The cluster config.
@@ -89,9 +90,11 @@ module Google
         #     Optional. Commands to execute on each node after config is
         #     completed. By default, executables are run on master and all worker nodes.
         #     You can test a node's `role` metadata to run an executable on
-        #     a master or worker node, as shown below using `curl` (you can also use `wget`):
+        #     a master or worker node, as shown below using `curl` (you can also use
+        #     `wget`):
         #
-        #         ROLE=$(curl -H Metadata-Flavor:Google http://metadata/computeMetadata/v1/instance/attributes/dataproc-role)
+        #         ROLE=$(curl -H Metadata-Flavor:Google
+        #         http://metadata/computeMetadata/v1/instance/attributes/dataproc-role)
         #         if [[ "${ROLE}" == 'Master' ]]; then
         #           ... master specific actions ...
         #         else
@@ -150,11 +153,11 @@ module Google
         # @!attribute [rw] internal_ip_only
         #   @return [true, false]
         #     Optional. If true, all instances in the cluster will only have internal IP
-        #     addresses. By default, clusters are not restricted to internal IP addresses,
-        #     and will have ephemeral external IP addresses assigned to each instance.
-        #     This `internal_ip_only` restriction can only be enabled for subnetwork
-        #     enabled networks, and all off-cluster dependencies must be configured to be
-        #     accessible without external IP addresses.
+        #     addresses. By default, clusters are not restricted to internal IP
+        #     addresses, and will have ephemeral external IP addresses assigned to each
+        #     instance. This `internal_ip_only` restriction can only be enabled for
+        #     subnetwork enabled networks, and all off-cluster dependencies must be
+        #     configured to be accessible without external IP addresses.
         # @!attribute [rw] service_account
         #   @return [String]
         #     Optional. The service account of the instances. Defaults to the default
@@ -164,7 +167,8 @@ module Google
         #     * roles/logging.logWriter
         #     * roles/storage.objectAdmin
         #
-        #     (see https://cloud.google.com/compute/docs/access/service-accounts#custom_service_accounts
+        #     (see
+        #     https://cloud.google.com/compute/docs/access/service-accounts#custom_service_accounts
         #     for more information).
         #     Example: `[account_id]@[project_id].iam.gserviceaccount.com`
         # @!attribute [rw] service_account_scopes
@@ -190,7 +194,8 @@ module Google
         # @!attribute [rw] metadata
         #   @return [Hash{String => String}]
         #     The Compute Engine metadata entries to add to all instances (see
-        #     [Project and instance metadata](https://cloud.google.com/compute/docs/storing-retrieving-metadata#project_and_instance_metadata)).
+        #     [Project and instance
+        #     metadata](https://cloud.google.com/compute/docs/storing-retrieving-metadata#project_and_instance_metadata)).
         class GceClusterConfig; end
         # Optional. The config settings for Compute Engine resources in
@@ -219,7 +224,8 @@ module Google
         #     * `n1-standard-2`
         #
         #     **Auto Zone Exception**: If you are using the Cloud Dataproc
-        #     [Auto Zone Placement](https://cloud.google.com/dataproc/docs/concepts/configuring-clusters/auto-zone#using_auto_zone_placement)
+        #     [Auto Zone
+        #     Placement](/dataproc/docs/concepts/configuring-clusters/auto-zone#using_auto_zone_placement)
         #     feature, you must use the short name of the machine type
         #     resource, for example, `n1-standard-2`.
         # @!attribute [rw] disk_config
@@ -227,7 +233,8 @@ module Google
         #     Optional. Disk option config settings.
         # @!attribute [rw] is_preemptible
         #   @return [true, false]
-        #     Optional. Specifies that this instance group contains preemptible instances.
+        #     Optional. Specifies that this instance group contains preemptible
+        #     instances.
         # @!attribute [rw] managed_group_config
         #   @return [Google::Cloud::Dataproc::V1::ManagedGroupConfig]
         #     Output only. The config for Compute Engine Instance Group
@@ -258,7 +265,8 @@ module Google
         #   @return [String]
         #     Full URL, partial URI, or short name of the accelerator type resource to
         #     expose to this instance. See
-        #     [Compute Engine AcceleratorTypes](https://cloud.google.com/compute/docs/reference/beta/acceleratorTypes).
+        #     [Compute Engine
+        #     AcceleratorTypes](/compute/docs/reference/beta/acceleratorTypes).
         #
         #     Examples:
         #
@@ -267,7 +275,8 @@ module Google
         #     * `nvidia-tesla-k80`
         #
         #     **Auto Zone Exception**: If you are using the Cloud Dataproc
-        #     [Auto Zone Placement](https://cloud.google.com/dataproc/docs/concepts/configuring-clusters/auto-zone#using_auto_zone_placement)
+        #     [Auto Zone
+        #     Placement](/dataproc/docs/concepts/configuring-clusters/auto-zone#using_auto_zone_placement)
         #     feature, you must use the short name of the accelerator type
         #     resource, for example, `nvidia-tesla-k80`.
         # @!attribute [rw] accelerator_count
@@ -366,10 +375,12 @@ module Google
         # Specifies the selection and config of software inside the cluster.
         # @!attribute [rw] image_version
         #   @return [String]
-        #     Optional. The version of software inside the cluster. It must be one of the supported
-        #     [Cloud Dataproc Versions](https://cloud.google.com/dataproc/docs/concepts/versioning/dataproc-versions#supported_cloud_dataproc_versions),
+        #     Optional. The version of software inside the cluster. It must be one of the
+        #     supported [Cloud Dataproc
+        #     Versions](/dataproc/docs/concepts/versioning/dataproc-versions#supported_cloud_dataproc_versions),
         #     such as "1.2" (including a subminor version, such as "1.2.29"), or the
-        #     ["preview" version](https://cloud.google.com/dataproc/docs/concepts/versioning/dataproc-versions#other_versions).
+        #     ["preview"
+        #     version](/dataproc/docs/concepts/versioning/dataproc-versions#other_versions).
         #     If unspecified, it defaults to the latest version.
         # @!attribute [rw] properties
         #   @return [Hash{String => String}]
@@ -419,10 +430,11 @@ module Google
         # @!attribute [rw] request_id
         #   @return [String]
         #     Optional. A unique id used to identify the request. If the server
-        #     receives two {Google::Cloud::Dataproc::V1::CreateClusterRequest CreateClusterRequest} requests  with the same
-        #     id, then the second request will be ignored and the
-        #     first {Google::Longrunning::Operation} created and stored in the backend
-        #     is returned.
+        #     receives two
+        #     {Google::Cloud::Dataproc::V1::CreateClusterRequest CreateClusterRequest}
+        #     requests  with the same id, then the second request will be ignored and the
+        #     first {Google::Longrunning::Operation} created
+        #     and stored in the backend is returned.
         #
         #     It is recommended to always set this value to a
         #     [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).
@@ -507,10 +519,11 @@ module Google
         # @!attribute [rw] request_id
         #   @return [String]
         #     Optional. A unique id used to identify the request. If the server
-        #     receives two {Google::Cloud::Dataproc::V1::UpdateClusterRequest UpdateClusterRequest} requests  with the same
-        #     id, then the second request will be ignored and the
-        #     first {Google::Longrunning::Operation} created and stored in the
-        #     backend is returned.
+        #     receives two
+        #     {Google::Cloud::Dataproc::V1::UpdateClusterRequest UpdateClusterRequest}
+        #     requests  with the same id, then the second request will be ignored and the
+        #     first {Google::Longrunning::Operation} created
+        #     and stored in the backend is returned.
         #
         #     It is recommended to always set this value to a
         #     [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).
@@ -537,10 +550,11 @@ module Google
         # @!attribute [rw] request_id
         #   @return [String]
         #     Optional. A unique id used to identify the request. If the server
-        #     receives two {Google::Cloud::Dataproc::V1::DeleteClusterRequest DeleteClusterRequest} requests  with the same
-        #     id, then the second request will be ignored and the
-        #     first {Google::Longrunning::Operation} created and stored in the
-        #     backend is returned.
+        #     receives two
+        #     {Google::Cloud::Dataproc::V1::DeleteClusterRequest DeleteClusterRequest}
+        #     requests  with the same id, then the second request will be ignored and the
+        #     first {Google::Longrunning::Operation} created
+        #     and stored in the backend is returned.
         #
         #     It is recommended to always set this value to a
         #     [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier).

data/lib/google/cloud/dataproc/v1/doc/google/cloud/dataproc/v1/jobs.rb CHANGED

@@ -59,8 +59,10 @@ module Google
         end
         # A Cloud Dataproc job for running
-        # [Apache Hadoop MapReduce](https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html)
-        # jobs on [Apache Hadoop YARN](https://hadoop.apache.org/docs/r2.7.1/hadoop-yarn/hadoop-yarn-site/YARN.html).
+        # [Apache Hadoop
+        # MapReduce](https://hadoop.apache.org/docs/current/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html)
+        # jobs on [Apache Hadoop
+        # YARN](https://hadoop.apache.org/docs/r2.7.1/hadoop-yarn/hadoop-yarn-site/YARN.html).
         # @!attribute [rw] main_jar_file_uri
         #   @return [String]
         #     The HCFS URI of the jar file containing the main class.
@@ -75,8 +77,8 @@ module Google
         # @!attribute [rw] args
         #   @return [Array<String>]
         #     Optional. The arguments to pass to the driver. Do not
-        #     include arguments, such as `-libjars` or `-Dfoo=bar`, that can be set as job
-        #     properties, since a collision may occur that causes an incorrect job
+        #     include arguments, such as `-libjars` or `-Dfoo=bar`, that can be set as
+        #     job properties, since a collision may occur that causes an incorrect job
         #     submission.
         # @!attribute [rw] jar_file_uris
         #   @return [Array<String>]
@@ -142,7 +144,8 @@ module Google
         class SparkJob; end
         # A Cloud Dataproc job for running
-        # [Apache PySpark](https://spark.apache.org/docs/0.9.0/python-programming-guide.html)
+        # [Apache
+        # PySpark](https://spark.apache.org/docs/0.9.0/python-programming-guide.html)
         # applications on YARN.
         # @!attribute [rw] main_python_file_uri
         #   @return [String]
@@ -210,8 +213,8 @@ module Google
         # @!attribute [rw] continue_on_failure
         #   @return [true, false]
         #     Optional. Whether to continue executing queries if a query fails.
-        #     The default value is `false`. Setting to `true` can be useful when executing
-        #     independent parallel queries.
+        #     The default value is `false`. Setting to `true` can be useful when
+        #     executing independent parallel queries.
         # @!attribute [rw] script_variables
         #   @return [Hash{String => String}]
         #     Optional. Mapping of query variable names to values (equivalent to the
@@ -229,8 +232,8 @@ module Google
         #     and UDFs.
         class HiveJob; end
-        # A Cloud Dataproc job for running [Apache Spark SQL](http://spark.apache.org/sql/)
-        # queries.
+        # A Cloud Dataproc job for running [Apache Spark
+        # SQL](http://spark.apache.org/sql/) queries.
         # @!attribute [rw] query_file_uri
         #   @return [String]
         #     The HCFS URI of the script that contains SQL queries.
@@ -265,8 +268,8 @@ module Google
         # @!attribute [rw] continue_on_failure
         #   @return [true, false]
         #     Optional. Whether to continue executing queries if a query fails.
-        #     The default value is `false`. Setting to `true` can be useful when executing
-        #     independent parallel queries.
+        #     The default value is `false`. Setting to `true` can be useful when
+        #     executing independent parallel queries.
         # @!attribute [rw] script_variables
         #   @return [Hash{String => String}]
         #     Optional. Mapping of query variable names to values (equivalent to the Pig
@@ -484,8 +487,8 @@ module Google
         #   @return [Array<Google::Cloud::Dataproc::V1::YarnApplication>]
         #     Output only. The collection of YARN applications spun up by this job.
         #
-        #     **Beta** Feature: This report is available for testing purposes only. It may
-        #     be changed before final release.
+        #     **Beta** Feature: This report is available for testing purposes only. It
+        #     may be changed before final release.
         # @!attribute [rw] driver_output_resource_uri
         #   @return [String]
         #     Output only. A URI pointing to the location of the stdout of the job's
@@ -501,8 +504,9 @@ module Google
         #     Label **keys** must contain 1 to 63 characters, and must conform to
         #     [RFC 1035](https://www.ietf.org/rfc/rfc1035.txt).
         #     Label **values** may be empty, but, if present, must contain 1 to 63
-        #     characters, and must conform to [RFC 1035](https://www.ietf.org/rfc/rfc1035.txt).
-        #     No more than 32 labels can be associated with a job.
+        #     characters, and must conform to [RFC
+        #     1035](https://www.ietf.org/rfc/rfc1035.txt). No more than 32 labels can be
+        #     associated with a job.
         # @!attribute [rw] scheduling
         #   @return [Google::Cloud::Dataproc::V1::JobScheduling]
         #     Optional. Job scheduling configuration.
@@ -540,8 +544,8 @@ module Google
         # @!attribute [rw] request_id
         #   @return [String]
         #     Optional. A unique id used to identify the request. If the server
-        #     receives two {Google::Cloud::Dataproc::V1::SubmitJobRequest SubmitJobRequest} requests  with the same
-        #     id, then the second request will be ignored and the
+        #     receives two {Google::Cloud::Dataproc::V1::SubmitJobRequest SubmitJobRequest}
+        #     requests  with the same id, then the second request will be ignored and the
         #     first {Google::Cloud::Dataproc::V1::Job Job} created and stored in the backend
         #     is returned.
         #