0% found this document useful (0 votes)
150 views10 pages

Pentaho Data Integration Fundamentals: Course Code DI1000 7.0

Uploaded by

Yonathan Gambin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
150 views10 pages

Pentaho Data Integration Fundamentals: Course Code DI1000 7.0

Uploaded by

Yonathan Gambin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Pentaho Data Integration Fundamentals

Course Code DI1000


Version 7.0
Student Guide

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Pentaho Data Integration Fundamentals Course Code DI1000

Table of Contents

Guided Demo 1: Launching and Customizing PDI............................................................... 3


Guided Demo 2: Creating a “Hello World” Transformation ............................................... 7
Guided Demo 3: Error Handling and Basic Logging .......................................................... 14
Exercise 1: Generate Rows, Add Sequence, and Select Values ........................................ 20
Guided Demo 4: Saving a Transformation in the Repository ........................................... 27
Guided Demo 5: Combining Several Inputs into One Output .......................................... 35
Guided Demo 6: Creating kettle.properties Variables ..................................................... 44
Exercise 2: CSV Input to Multiple Text Output Using Switch / Case................................. 51
Exercise 2 Advanced: CSV Input to Multiple Text Output Using Switch / Case ................ 62
Exercise 3: Serializing Multiple Text Files ......................................................................... 64
Exercise 3 Advanced: Serializing Multiple Text Files ........................................................ 69
Exercise 4: De-serializing a File ......................................................................................... 70
Exercise 4 Advanced: De-serializing Multiple Text Files ................................................... 75
Guided Demo 7: Connections and the Database Explorer ............................................... 76
Exercise 5: Reading and Writing to Database Tables ....................................................... 84
Exercise 5 Advanced: Reading and Writing to Database Tables....................................... 95
Guided Demo 8: Data Cleansing ....................................................................................... 96
Exercise 6: Input with Parameters and Table Copy Wizard ............................................ 106
Exercise 6 Advanced: Input with Parameters and Table Copy Wizard ........................... 118
Exercise 7: Parallel Processing ........................................................................................ 120
Guided Demo 9: Choosing Adequate Sample Size for ‘Get Fields’ ................................. 129
Exercise 8: Lookups and Data Formatting ...................................................................... 136
Guided Demo 10: Creating Summary Fields Using Group By ......................................... 145
Exercise 9: Calculating and Aggregating Order Quantity................................................ 151
Exercise 9 Advanced: Calculating and Aggregating the Quantity ................................... 158
Guided Demo 11: Loading JVM Data into a Table .......................................................... 160
Guided Demo 12: Using the Pentaho Enterprise Repository ......................................... 169
Guided Demo 13: Scheduling and Monitoring ............................................................... 173
Guided Demo 14: Detailed Logging Throughout Execution ........................................... 177

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |1
Pentaho Data Integration Fundamentals Course Code DI1000

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |2
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 1: Launching and Customizing PDI

Introduction In this guided demonstration, we launch Spoon, PDI’s graphical interface, and
customize some of its options and default behavior.

Objectives After completing this guided demonstration, you will be able to:
• Launch Spoon.
• Turn the Welcome Screen on and off.
• Open Spoon’s Options dialog.
• Describe the common options and look & feel settings.
• Change the grid settings.

Continued on next page

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |3
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 1: Launching and Customizing PDI, Continued

Launch and Step Action


Customize PDI 1 To launch Spoon, from the Windows task bar, select:

Start>All Programs>Pentaho Enterprise Edition>Design


Tools>Data Integration

TIP: Create a shortcut to Spoon on the Desktop to easily start


the interface throughout this course.
2 To view the Welcome Screen, from the menu, select Help>
Welcome Screen.
3 Scroll through the Welcome Screen to familiarize yourself with
its contents.

4 To close the Welcome Screen, on the Welcome! tab, click X.


5 To view the Kettle Options, from the menu, select
Tools>Options.

Continued on next page

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |4
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 1: Launching and Customizing PDI, Continued

Launch and Step Action


Customize PDI, 6 Review the options on the General tab, and then click the Look
continued & Feel tab.

7 To modify the grid settings, on the Look & Feel tab:


• Change the Canvas Grid Size to 32.
• Click to select the Show Canvas Grid checkbox.
• Click OK.

Continued on next page


Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |5
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 1: Launching and Customizing PDI, Continued

Launch and Step Action


Customize PDI, 8 To close the Info dialog, click OK.
continued
NOTE: Although restarting Spoon after changing options is the
best habit to have, not all options require a restart for the
changes to take effect.
9 To close Spoon, from the menu, select File>Exit.

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |6
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 2: Creating a “Hello World” Transformation

Introduction In this guided demonstration, we use Spoon to create a new transformation


with data containing “Hello World.” Although this is probably not a task you
will be asked to do in the real world, the concepts learned in this guided
demonstration help to build the foundation necessary for creating any
transformation.

Objectives After completing this guided demonstration, you will be able to:
• Create a new transformation.
• Add steps and hops.
• Configure the ‘Generate Rows’ step.
• Preview the ‘Generate Rows’ step.
• Add a ‘Dummy (do nothing)’ step.
• Add a note to a transformation.
• Preview the transformation.

Steps Used This guided demonstration uses the following steps:


• Generate Rows
• Dummy (do nothing)

Model

Continued on next page

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |7
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 2: Creating a “Hello World” Transformation, Continued

Creating the In this section of the guided demonstration, we create a new transformation,
Transformation and then add and configure the ‘Generate Rows’ step.
and Adding a
Generate Rows Step Action
Step
1 To create a new transformation, from the menu, select
File>New> Transformation.

Alternatively, on the toolbar, click the New file button, and then
select Transformation.
2 To add a ‘Generate Rows’ step to the transformation:
• Click to view the Design tab.
• Expand the Input category.
• Drag the Generate Rows step from the Design tab to the
canvas.

3 To configure the ‘Generate Rows’ step, on the canvas, double-


click the Generate Rows step.

TIP: You might find it helpful to enlarge the step dialog


windows.
4 Enter values in the properties of the Generate Rows dialog as
follows:

Property Name Value


Step name Create 15 rows
Limit 15

Continued on next page

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |8
Pentaho Data Integration Fundamentals Course Code DI1000

Guided Demo 2: Creating a “Hello World” Transformation, Continued

Creating the Step Action


Transformation 5 Complete the Fields grid of the Generate Rows dialog as follows:
and Adding a
Generate Rows Column Name Value
Step, continued Name Greeting Message
Type 15
Value Hello, World!

6 To preview this step to verify it generates the data we expect:


• Click the Preview button.
• In the Enter preview size dialog, click OK.
• Verify 15 rows of data with Hello, World! are displayed.
• In the Examine preview data dialog, click OK.

TIP: Previewing data and testing steps along the way can help to
minimize errors and trouble shooting time later in the
transformation creation process.
7 To close the Generate Rows configuation dialog, click OK.

Continued on next page

Copyright © 2017 Pentaho Corporation. All trademarks are the property of their respective owners.
Course books may not be reproduced or distributed, in whole or in part, without the prior written permission of Pentaho Training.
www.pentaho.com/services/training or email: [email protected]
Page |9

You might also like