gcloud_alpha_dataproc_jobs_submit (1)
NAME
- gcloud alpha dataproc jobs submit - submit Google Cloud Dataproc jobs to execute on a cluster
SYNOPSIS
-
gcloud alpha dataproc jobs submit COMMAND [--async] [--bucket=BUCKET] [--region=REGION] [GCLOUD_WIDE_FLAG ...]
DESCRIPTION
(ALPHA) Submit Google Cloud Dataproc jobs to execute on a cluster.
FLAGS
-
- --async
-
- Does not wait for the job to run.
- Does not wait for the job to run.
- --bucket=BUCKET
-
The Cloud Storage bucket to stage files in. Defaults to the cluster's configured
bucket.
- --region=REGION
-
Cloud Dataproc region to use. Each Cloud Dataproc region constitutes an
independent resource namespace constrained to deploying instances into Compute
Engine zones inside the region. The default value of global is a special
multi-region namespace which is capable of deploying instances into all Compute
Engine zones globally, and is disjoint from other Cloud Dataproc regions.
Overrides the default dataproc/region property value for this command
invocation.
GCLOUD WIDE FLAGS
These flags are available to all commands: --account, --configuration, --flags-file, --flatten, --format, --help, --log-http, --project, --quiet, --trace-token, --user-output-enabled, --verbosity. Run $ gcloud help for details.
COMMANDS
COMMAND is one of the following:
-
- hadoop
-
(ALPHA) Submit a Hadoop job to a cluster.
- hive
-
(ALPHA) Submit a Hive job to a cluster.
- pig
-
(ALPHA) Submit a Pig job to a cluster.
- pyspark
-
(ALPHA) Submit a PySpark job to a cluster.
- spark
-
(ALPHA) Submit a Spark job to a cluster.
- spark-r
-
(ALPHA) Submit a SparkR job to a cluster.
- spark-sql
-
(ALPHA) Submit a Spark SQL job to a cluster.
EXAMPLES
To submit a Hadoop MapReduce job, run:
-
$ gcloud alpha dataproc jobs submit hadoop --cluster my_cluster \
--jar my_jar.jar -- arg1 arg2
To submit a Spark Scala or Java job, run:
-
$ gcloud alpha dataproc jobs submit spark --cluster my_cluster \
--jar my_jar.jar -- arg1 arg2
To submit a PySpark job, run:
-
$ gcloud alpha dataproc jobs submit pyspark --cluster my_cluster \
my_script.py -- arg1 arg2
To submit a Spark SQL job, run:
-
$ gcloud alpha dataproc jobs submit spark-sql --cluster my_cluster \
--file my_queries.q
To submit a Pig job, run:
-
$ gcloud alpha dataproc jobs submit pig --cluster my_cluster \
--file my_script.pig
To submit a Hive job, run:
-
$ gcloud alpha dataproc jobs submit hive --cluster my_cluster \
--file my_queries.q
NOTES
This command is currently in ALPHA and may change without notice. If this command fails with API permission errors despite specifying the right project, you will have to apply for early access and have your projects registered on the API whitelist to use it. To do so, contact Support at cloud.google.com/support These variants are also available:
- $ gcloud dataproc jobs submit $ gcloud beta dataproc jobs submit