[Feb-2022] Pass Google Professional-Machine-Learning-Engineer Tests Engine pdf - All Free Dumps [Q25-Q50]

Share

[Feb-2022] Pass Google Professional-Machine-Learning-Engineer Tests Engine pdf - All Free Dumps

Google Professional Machine Learning Engineer Practice Tests 2022 | Pass Professional-Machine-Learning-Engineer with confidence!


Topics of Professional Machine Learning Engineer - Google

Candidates must know the exam topics before they start preparation. Because it will help them in hitting the core. Google Professional-Machine-Learning-Engineer exam dumps pdf will include the following topics:

  • Data Preparation and Processing
  • ML Problem Framing
  • ML Model Development

 

NEW QUESTION 25
You developed an ML model with Al Platform, and you want to move it to production. You serve a few thousand queries per second and are experiencing latency issues. Incoming requests are served by a load balancer that distributes them across multiple Kubeflow CPU-only pods running on Google Kubernetes Engine (GKE). Your goal is to improve the serving latency without changing the underlying infrastructure. What should you do?

  • A. Significantly increase the max_batch_size TensorFlow Serving parameter
  • B. Recompile TensorFlow Serving using the source to support CPU-specific optimizations Instruct GKE to choose an appropriate baseline minimum CPU platform for serving nodes
  • C. Switch to the tensorflow-model-server-universal version of TensorFlow Serving
  • D. Significantly increase the max_enqueued_batches TensorFlow Serving parameter

Answer: B

 

NEW QUESTION 26
A Machine Learning Specialist at a company sensitive to security is preparing a dataset for model training. The dataset is stored in Amazon S3 and contains Personally Identifiable Information (PII).
The dataset:
* Must be accessible from a VPC only.
* Must not traverse the public internet.
How can these requirements be satisfied?

  • A. Create a VPC endpoint and apply a bucket access policy that allows access from the given VPC endpoint and an Amazon EC2 instance.
  • B. Create a VPC endpoint and apply a bucket access policy that restricts access to the given VPC endpoint and the VPC.
  • C. Create a VPC endpoint and use Network Access Control Lists (NACLs) to allow traffic between only the given VPC endpoint and an Amazon EC2 instance.
  • D. Create a VPC endpoint and use security groups to restrict access to the given VPC endpoint and an Amazon EC2 instance

Answer: B

 

NEW QUESTION 27
You are training a Resnet model on Al Platform using TPUs to visually categorize types of defects in automobile engines. You capture the training profile using the Cloud TPU profiler plugin and observe that it is highly input-bound. You want to reduce the bottleneck and speed up your model training process. Which modifications should you make to the tf .data dataset?
Choose 2 answers

  • A. Decrease the batch size argument in your transformation
  • B. Use the interleave option for reading data
  • C. Increase the buffer size for the shuffle option.
  • D. Reduce the value of the repeat parameter
  • E. Set the prefetch option equal to the training batch size

Answer: B,E

 

NEW QUESTION 28
You have been asked to develop an input pipeline for an ML training model that processes images from disparate sources at a low latency. You discover that your input data does not fit in memory. How should you create a dataset following Google-recommended best practices?

  • A. Convert the images to tf .Tensor Objects, and then run tf. data. Dataset. from_tensors ().
  • B. Convert the images Into TFRecords, store the images in Cloud Storage, and then use the tf. data API to read the images for training
  • C. Create a tf.data.Dataset.prefetch transformation
  • D. Convert the images to tf .Tensor Objects, and then run Dataset. from_tensor_slices{).

Answer: D

 

NEW QUESTION 29
You need to build classification workflows over several structured datasets currently stored in BigQuery. Because you will be performing the classification several times, you want to complete the following steps without writing code: exploratory data analysis, feature selection, model building, training, and hyperparameter tuning and serving. What should you do?

  • A. Configure AutoML Tables to perform the classification task
  • B. Use Al Platform to run the classification model job configured for hyperparameter tuning
  • C. Run a BigQuery ML task to perform logistic regression for the classification
  • D. Use Al Platform Notebooks to run the classification model with pandas library

Answer: C

Explanation:
BigQuery ML supports supervised learning with the logistic regression model type.

 

NEW QUESTION 30
Your team needs to build a model that predicts whether images contain a driver's license, passport, or credit card. The data engineering team already built the pipeline and generated a dataset composed of 10,000 images with driver's licenses, 1,000 images with passports, and 1,000 images with credit cards. You now have to train a model with the following label map: ['driversjicense', 'passport', 'credit_card']. Which loss function should you use?

  • A. Binary cross-entropy
  • B. Categorical hinge
  • C. Sparse categorical cross-entropy
  • D. Categorical cross-entropy

Answer: C

Explanation:
se sparse_categorical_crossentropy. Examples for above 3-class classification problem: [1] , [2], [3]

 

NEW QUESTION 31
You work with a data engineering team that has developed a pipeline to clean your dataset and save it in a Cloud Storage bucket. You have created an ML model and want to use the data to refresh your model as soon as new data is available. As part of your CI/CD workflow, you want to automatically run a Kubeflow Pipelines training job on Google Kubernetes Engine (GKE). How should you architect this workflow?

  • A. Configure your pipeline with Dataflow, which saves the files in Cloud Storage After the file is saved, start the training job on a GKE cluster
  • B. Configure a Cloud Storage trigger to send a message to a Pub/Sub topic when a new file is available in a storage bucket. Use a Pub/Sub-triggered Cloud Function to start the training job on a GKE cluster
  • C. Use App Engine to create a lightweight python client that continuously polls Cloud Storage for new files As soon as a file arrives, initiate the training job
  • D. Use Cloud Scheduler to schedule jobs at a regular interval. For the first step of the job. check the timestamp of objects in your Cloud Storage bucket If there are no new files since the last run, abort the job.

Answer: B

 

NEW QUESTION 32
A Mobile Network Operator is building an analytics platform to analyze and optimize a company's operations using Amazon Athena and Amazon S3.
The source systems send data in .CSV format in real time. The Data Engineering team wants to transform the data to the Apache Parquet format before storing it on Amazon S3.
Which solution takes the LEAST effort to implement?

  • A. Ingest .CSV data from Amazon Kinesis Data Streams and use Amazon Glue to convert data into Parquet.
  • B. Ingest .CSV data from Amazon Kinesis Data Streams and use Amazon Kinesis Data Firehose to convert data into Parquet.
  • C. Ingest .CSV data using Apache Spark Structured Streaming in an Amazon EMR cluster and use Apache Spark to convert data into Parquet.
  • D. Ingest .CSV data using Apache Kafka Streams on Amazon EC2 instances and use Kafka Connect S3 to serialize data as Parquet

Answer: A

Explanation:
Explanation/Reference:

 

NEW QUESTION 33
You are building a linear model with over 100 input features, all with values between -1 and 1. You suspect that many features are non-informative. You want to remove the non-informative features from your model while keeping the informative ones in their original form. Which technique should you use?

  • A. After building your model, use Shapley values to determine which features are the most informative.
  • B. Use L1 regularization to reduce the coefficients of uninformative features to 0.
  • C. Use an iterative dropout technique to identify which features do not degrade the model when removed.
  • D. Use Principal Component Analysis to eliminate the least informative features.

Answer: A

 

NEW QUESTION 34
A Data Science team is designing a dataset repository where it will store a large amount of training data commonly used in its machine learning models. As Data Scientists may create an arbitrary number of new datasets every day, the solution has to scale automatically and be cost-effective. Also, it must be possible to explore the data using SQL.
Which storage scheme is MOST adapted to this scenario?

  • A. Store datasets as files in an Amazon EBS volume attached to an Amazon EC2 instance.
  • B. Store datasets as files in Amazon S3.
  • C. Store datasets as global tables in Amazon DynamoDB.
  • D. Store datasets as tables in a multi-node Amazon Redshift cluster.

Answer: B

 

NEW QUESTION 35
You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent.
Which data transformation strategy would likely improve the performance of your classifier?

  • A. Write your data in TFRecords.
  • B. Oversample the fraudulent transaction 10 times.
  • C. Use one-hot encoding on all categorical features.
  • D. Z-normalize all the numeric features.

Answer: B

 

NEW QUESTION 36
You recently designed and built a custom neural network that uses critical dependencies specific to your organization's framework. You need to train the model using a managed training service on Google Cloud. However, the ML framework and related dependencies are not supported by Al Platform Training. Also, both your model and your data are too large to fit in memory on a single machine. Your ML framework of choice uses the scheduler, workers, and servers distribution structure. What should you do?

  • A. Use a built-in model available on Al Platform Training
  • B. Reconfigure your code to a ML framework with dependencies that are supported by Al Platform Training
  • C. Build your custom container to run jobs on Al Platform Training
  • D. Build your custom containers to run distributed training jobs on Al Platform Training

Answer: D

 

NEW QUESTION 37
You developed an ML model with Al Platform, and you want to move it to production. You serve a few thousand queries per second and are experiencing latency issues. Incoming requests are served by a load balancer that distributes them across multiple Kubeflow CPU-only pods running on Google Kubernetes Engine (GKE). Your goal is to improve the serving latency without changing the underlying infrastructure. What should you do?

  • A. Recompile TensorFlow Serving using the source to support CPU-specific optimizations Instruct GKE to choose an appropriate baseline minimum CPU platform for serving nodes
  • B. Switch to the tensorflow-model-server-universal version of TensorFlow Serving
  • C. Significantly increase the max_batch_size TensorFlow Serving parameter
  • D. Significantly increase the max_enqueued_batches TensorFlow Serving parameter

Answer: C

 

NEW QUESTION 38
A Machine Learning team uses Amazon SageMaker to train an Apache MXNet handwritten digit classifier model using a research dataset. The team wants to receive a notification when the model is overfitting.
Auditors want to view the Amazon SageMaker log activity report to ensure there are no unauthorized API calls.
What should the Machine Learning team do to address the requirements with the least amount of code and fewest steps?

  • A. Use AWS CloudTrail to log Amazon SageMaker API calls to Amazon S3. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.
  • B. Implement an AWS Lambda function to log Amazon SageMaker API calls to AWS CloudTrail. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.
  • C. Use AWS CloudTrail to log Amazon SageMaker API calls to Amazon S3. Set up Amazon SNS to receive a notification when the model is overfitting
  • D. Implement an AWS Lambda function to log Amazon SageMaker API calls to Amazon S3. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.

Answer: B

 

NEW QUESTION 39
You are working on a Neural Network-based project. The dataset provided to you has columns with different ranges. While preparing the data for model training, you discover that gradient optimization is having difficulty moving weights to a good solution. What should you do?

  • A. Improve the data cleaning step by removing features with missing values.
  • B. Change the partitioning step to reduce the dimension of the test set and have a larger training set.
  • C. Use the representation transformation (normalization) technique.
  • D. Use feature construction to combine the strongest features.

Answer: A

 

NEW QUESTION 40
A company is using Amazon Textract to extract textual data from thousands of scanned text-heavy legal documents daily. The company uses this information to process loan applications automatically. Some of the documents fail business validation and are returned to human reviewers, who investigate the errors. This activity increases the time to process the loan applications.
What should the company do to reduce the processing time of loan applications?

  • A. Use an Amazon Textract synchronous operation instead of an asynchronous operation.
  • B. Configure Amazon Textract to route low-confidence predictions to Amazon Augmented AI (Amazon A2I).
    Perform a manual review on those words before performing a business validation.
  • C. Use Amazon Rekognition's feature to detect text in an image to extract the data from scanned images. Use this information to process the loan applications.
  • D. Configure Amazon Textract to route low-confidence predictions to Amazon SageMaker Ground Truth.
    Perform a manual review on those words before performing a business validation.

Answer: B

 

NEW QUESTION 41
A Data Scientist needs to analyze employment data. The dataset contains approximately 10 million observations on people across 10 different features. During the preliminary analysis, the Data Scientist notices that income and age distributions are not normal. While income levels shows a right skew as expected, with fewer individuals having a higher income, the age distribution also show a right skew, with fewer older individuals participating in the workforce.
Which feature transformations can the Data Scientist apply to fix the incorrectly skewed data? (Choose two.)

  • A. One hot encoding
  • B. High-degree polynomial transformation
  • C. Cross-validation
  • D. Logarithmic transformation
  • E. Numerical value binning

Answer: C,E

 

NEW QUESTION 42
You need to design a customized deep neural network in Keras that will predict customer purchases based on their purchase history. You want to explore model performance using multiple model architectures, store training data, and be able to compare the evaluation metrics in the same dashboard. What should you do?

  • A. Run multiple training jobs on Al Platform with similar job names
  • B. Create an experiment in Kubeflow Pipelines to organize multiple runs
  • C. Automate multiple training runs using Cloud Composer
  • D. Create multiple models using AutoML Tables

Answer: A

 

NEW QUESTION 43
During batch training of a neural network, you notice that there is an oscillation in the loss. How should you adjust your model to ensure that it converges?

  • A. Increase the learning rate hyperparameter
  • B. Increase the size of the training batch
  • C. Decrease the learning rate hyperparameter
  • D. Decrease the size of the training batch

Answer: A

 

NEW QUESTION 44
A Machine Learning Specialist is implementing a full Bayesian network on a dataset that describes public transit in New York City. One of the random variables is discrete, and represents the number of minutes New Yorkers wait for a bus given that the buses cycle every 10 minutes, with a mean of 3 minutes.
Which prior probability distribution should the ML Specialist use for this variable?

  • A. Binomial distribution
  • B. Uniform distribution
  • C. Normal distribution
  • D. Poisson distribution

Answer: A

 

NEW QUESTION 45
You have a functioning end-to-end ML pipeline that involves tuning the hyperparameters of your ML model using Al Platform, and then using the best-tuned parameters for training. Hypertuning is taking longer than expected and is delaying the downstream processes. You want to speed up the tuning job without significantly compromising its effectiveness. Which actions should you take?
Choose 2 answers

  • A. Decrease the range of floating-point values
  • B. Change the search algorithm from Bayesian search to random search.
  • C. Decrease the number of parallel trials
  • D. Decrease the maximum number of trials during subsequent training phases.
  • E. Set the early stopping parameter to TRUE

Answer: B,D

 

NEW QUESTION 46
A Data Scientist needs to create a serverless ingestion and analytics solution for high-velocity, real-time streaming data.
The ingestion process must buffer and convert incoming records from JSON to a query-optimized, columnar format without data loss. The output datastore must be highly available, and Analysts must be able to run SQL queries against the data and connect to existing business intelligence dashboards.
Which solution should the Data Scientist build to satisfy the requirements?

  • A. Use Amazon Kinesis Data Analytics to ingest the streaming data and perform real-time SQL queries to convert the records to Apache Parquet before delivering to Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.
  • B. Write each JSON record to a staging location in Amazon S3. Use the S3 Put event to trigger an AWS Lambda function that transforms the data into Apache Parquet or ORC format and writes the data to a processed data location in Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena, and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.
  • C. Create a schema in the AWS Glue Data Catalog of the incoming data format. Use an Amazon Kinesis Data Firehose delivery stream to stream the data and transform the data to Apache Parquet or ORC format using the AWS Glue Data Catalog before delivering to Amazon S3. Have the Analysts query the data directly from Amazon S3 using Amazon Athena, and connect to BI tools using the Athena Java Database Connectivity (JDBC) connector.
  • D. Write each JSON record to a staging location in Amazon S3. Use the S3 Put event to trigger an AWS Lambda function that transforms the data into Apache Parquet or ORC format and inserts it into an Amazon RDS PostgreSQL database. Have the Analysts query and run dashboards from the RDS database.

Answer: C

Explanation:
Explanation/Reference:

 

NEW QUESTION 47
You work for a large technology company that wants to modernize their contact center. You have been asked to develop a solution to classify incoming calls by product so that requests can be more quickly routed to the correct support team. You have already transcribed the calls using the Speech-to-Text API. You want to minimize data preprocessing and development time. How should you build the model?

  • A. Use the Al Platform Training built-in algorithms to create a custom model
  • B. Build a custom model to identify the product keywords from the transcribed calls, and then run the keywords through a classification algorithm
  • C. Use AutoML Natural Language to extract custom entities for classification
  • D. Use the Cloud Natural Language API to extract custom entities for classification

Answer: A

 

NEW QUESTION 48
A financial services company is building a robust serverless data lake on Amazon S3. The data lake should be flexible and meet the following requirements:
* Support querying old and new data on Amazon S3 through Amazon Athena and Amazon Redshift Spectrum.
* Support event-driven ETL pipelines
* Provide a quick and easy way to understand metadata
Which approach meets these requirements?

  • A. Use an AWS Glue crawler to crawl S3 data, an Amazon CloudWatch alarm to trigger an AWS Batch job, and an AWS Glue Data Catalog to search and discover metadata.
  • B. Use an AWS Glue crawler to crawl S3 data, an Amazon CloudWatch alarm to trigger an AWS Glue ETL job, and an external Apache Hive metastore to search and discover metadata.
  • C. Use an AWS Glue crawler to crawl S3 data, an AWS Lambda function to trigger an AWS Batch job, and an external Apache Hive metastore to search and discover metadata.
  • D. Use an AWS Glue crawler to crawl S3 data, an AWS Lambda function to trigger an AWS Glue ETL job, and an AWS Glue Data catalog to search and discover metadata.

Answer: C

 

NEW QUESTION 49
You have trained a text classification model in TensorFlow using Al Platform. You want to use the trained model for batch predictions on text data stored in BigQuery while minimizing computational overhead. What should you do?

  • A. Export the model to BigQuery ML.
  • B. Submit a batch prediction job on Al Platform that points to the model location in Cloud Storage.
  • C. Use Dataflow with the SavedModel to read the data from BigQuery
  • D. Deploy and version the model on Al Platform.

Answer: A

 

NEW QUESTION 50
......


Understanding functional and technical aspects of Professional Machine Learning Engineer - Google ML Model Development

The following will be discussed in Google Professional-Machine-Learning-Engineer exam dumps:

  • Model generalization
  • Choice of framework and model
  • Model performance against baselines, simpler models, and across the time dimension
  • Retraining/redeployment evaluation
  • Overfitting
  • Training a model as a job in different environments
  • Build a model
  • Hardware accelerators
  • Modeling techniques given interpretability requirements
  • Unit tests for model training and serving
  • Distributed training
  • Tracking metrics during training
  • Transfer learning
  • Productionizing

 

Online Exam Practice Tests with detailed explanations!: https://certtree.2pass4sure.com/Google-Certification/Professional-Machine-Learning-Engineer-actual-exam-braindumps.html