Data Engineering

Unit 2 • Chapter 4

Machine Learning with Pyspark

Summary

Concept Check

What is a common algorithm used for classification in Pyspark Machine Learning?

What does the term feature engineering refer to in Pyspark Machine Learning?

Which evaluation metric is commonly used for regression tasks in Pyspark Machine Learning?

What is the purpose of cross-validation in Pyspark Machine Learning?

In Pyspark Machine Learning, what is an ensemble method used for improving model performance?