Skip to main content

Building models with Pega machine learning

6 Tasks

15 mins

Visible to: All users
Beginner Pega Customer Decision Hub '23 English


U+ Bank implements Customer Decision Hub™ to determine which credit card offer to show a customer on the bank's website. To reduce the number of clients that leave the bank, the business wants to make use of the historical data that the bank has collected on customers that have churned in the past to predict which customers are likely to leave the bank soon. The bank wants to show retention offers to the potential churners, instead of credit card offers.

As a data scientist, your task is to create a predictive model that predicts churn. You decide to create the model by using Pega machine learning.

Use the following credentials to log in to the exercise system:

Role User name Password
Data Scientist DataScientist rules

Your assignment consists of the following tasks:

Task 1: Create a new predictive model

Create a new predictive model, ChurnPegaML, by using the Churn Modeling template in the Retention category.

Task 2: Prepare the data

Load the data set by using the file. Set the type of predictors that have no predictive power, such as CustomerID, to Not used. In the Outcome definition, use a Binary outcome type and Outcome as the outcome field. Map the values of the outcome field to the outcome categories.

Task 3: Analyze the data

Examine the trends exhibited by the best-performing predictors. Create a virtual field by combining several numerical predictors, and then examine the trend exhibited by this new predictor.

Task 4: Develop predictive models

For predictor grouping, use the best predictor of each group. Create a new bivariate model.

Task 5: Analyze the models

Compare the scores of the three models. Pay particular attention to Discrimination.

Task 6: Select model

Select the Regression model. Ensure that all predictors are mapped to customer properties. Reclassify the classes into a loyal class and a churned class. Save the model.


You must initiate your own Pega instance to complete this Challenge.

Initialization may take up to 5 minutes so please be patient.

Challenge Walkthrough

Detailed Tasks

1 Create a predictive model

  1. On the exercise system landing page, click Launch Pega Infinity™ to log in to Customer Decision Hub.
  2. Log in as a data scientist:
    1. In the User name field, enter DataScientist
    2. In the Password field, enter rules.
  3. In the Work from 1:1 Operations Manager section, click CR-1 to open the change request.
  4. In the Scope of changes section, click Create, and then select Decision > Predictive model.
  5. In the Predictive Model Record Configuration section, in the Label field, enter ChurnPegaML.
  6. In the Context section, in the Apply to field, enter or select UBank-CDH-Data-Customer.
  7. In the upper-right corner, click Create and open, and then click Create new model.
  8. In the Category list, select Retention.
  9. In the Template list, select Churn Modeling.
    The setting for the new predictive model
  10. Click Start to proceed to the data preparation step.

2 Prepare the data

  1. Download and extract the CustomerData.csv file.
  2. In the Source selection section, click Choose File, and then select the CustomerData.csv file.
  3. Check the data, and then click Next to proceed to the sample construction step.
  4. In the CustomerID field, change the type to Not used.
    The CustomerID field set to Not used
  5. Click Next to proceed to the outcome definition step.
  6. In the Outcome definition section, in the Outcome type list, confirm that Binary is selected.
  7. In the Outcome field to predict list, select Outcome.
  8. In the Churn row, in the Outcome category column, select churned.
  9. In the Loyal row, in the Outcome category column, confirm that loyal Is selected.
    The mapping of the outcomes
  10. Confirm that the number of cases in the development, validation, and test sets is approximately equal for both outcome categories.
    The outcome field values for the data sets
  11. Click Next to proceed to the data analysis step.

3 Analyze the data

  1. In the list of predictors, click Age and examine the grouping for this predictor.
    The grouping for the Age predictor
  2. In the upper-right corner, click Cancel to close the predictor report.
  3. Click New virtual field to open the Virtual field dialog box.
  4. In the Virtual field dialog box, in the Name field, enter Age*CreditScore.
  5. Click Fields, select Age, click Insert, and then enter *.
  6. Click Fields, select CreditScore, and then click Insert.
  7. Click Save & close.
  8. Confirm that the newly created predictor outperforms the two original predictors.
    The virtual field outperforms
  9. Click Next to proceed to the model development step.

4 Develop predictive models

  1. In the Predictor grouping section, select Use best of each group.
    The best of each group button
  2. Click Next to proceed to the model creation step.
  3. In the Model creation section, in the Create model list, select Bivariate.
    The model creation options
  4. In the upper-right corner, click Submit to add the model to the model list.
  5. Click Next to proceed to the model analysis step.

5 Analyze the models

  1. On the Score comparison page, ensure that all the model checkboxes are selected.
  2. Click Analyze charts to access the model analysis.
  3. On the Discrimination tab, examine the results.
    The discrimination graph
    Note: The regression model outperforms the decision tree model and the bivariate model as it has the largest area under the curve (AUC). However, before you choose a model, consider the number of predictors required by the model. Under certain circumstances, you might decide to select a predicting model that is performing lower but has fewer predictors.
  1. In the upper-left corner, click the arrow next to Model analysis charts to return to the Score comparison page.
    The Back to the PML Wizard button
  2. Click Next. On this page, you can analyze the score distribution.
  3. Click Next. On this page, you can analyze class comparison.
  4. In the Model selection section, ensure that the Regression model is selected.
  5. Click Finish & Update to proceed to the model selection step.

6 Select the model

  1. In the navigation pane of Prediction Studio, click Models to view the model list.
  2. Click ChurnPegaML to configure the model.
  3. On the Model tab, click the Gear icon in the Outcome definition section.
  4. In the Label 'churned' as field, enter Churn.
  5. In the Label 'loyal' as field, enter Loyal, and then click OK to submit the outcome definitions.
  6. In the Confirm outcome update dialog box, click Confirm.
  7. In the Expected score distribution section, click the area between class 6 and class 7 in the score distribution chart.
    The score distribution graph
  8. In the Classification groups section, in the class 1-6 row, in the Name column, enter Loyal.
  9. In the Class 7-10 row, in the Name column, enter Churn.
    The classification groups
  10. On the Mapping tab, ensure that all predictors are mapped to the appropriate customer fields.
  11. In the upper-right corner, click Save.

This Challenge is to practice what you learned in the following Module:

Available in the following mission:

If you are having problems with your training, please review the Pega Academy Support FAQs.

Did you find this content helpful?

Want to help us improve this content?

We'd prefer it if you saw us at our best.

Pega Academy has detected you are using a browser which may prevent you from experiencing the site as intended. To improve your experience, please update your browser.

Close Deprecation Notice