Install Guide for Kettle/Pentaho Data Integration 4.4 (Community Edition) with MySQL on Windows

For details about MySQL setup you can refer to my earlier entry on setting up the Pentaho BI Server on this blog.

File Descriptions

Under http://sourceforge.net/projects/pentaho/files/Data%20Integration/4.4.0-stable/pdi-ce-4.4.0-stable.zip/download/
  • Download and unzip pdi-ce-4.4.0-stable.zip to c:\pentaho\data-integration\
  • Download and copy the JDBC Connector to C:\pentaho\data-integration\libext\JDBC\

Starting the Server/Carte

  • Kettle has a 2-tier architecture where the client (Spoon) connects directly to the repository
  • The server is really a lightweight process for remote execution. You can kick off remote jobs either through the web interface or from Spoon which sends commands to the remote host over HTTP
  • To launch the Carte server you run the C:\pentaho\data-integration\Carte.bat script. You need to pass it the IP Address and the port number you'd like it to run on (passing it localhost won't work) -
C:\pentaho\data-integration\Carte.bat 127.0.0.1 8081  

Repository Setup

  • Go into the MySQL Workbench and create a database and a user called kettle
  • Grant the user appropriate schema rights on the database
  • Launch C:\pentaho\data-integration\Spoon.bat
  • You'll be prompted for a Repository Connection
  • Click the little green icon with the '+' in the top right corner to add a new connection
  • You will be prompted to select a Repository type. Select 'Kettle database repository' and click 'OK'
  • You'll be prompted to enter Repository information. Click on 'New'
  • You'll be presented with a Database Connection dialog. Use the following values (I'm assuming that MySQL is installed on the same host)
    • Connection Name: Pentaho MySQL
    • Connection Type: MySQL
    • Access: Native (JDBC)
    • Host Name: localhost
    • Database name: kettle
    • Port Number: 3306
    • User Name: kettle
    • Password: ****
  • Click on 'Test' and you should get a dialog that says Connection to [Pentaho MySQL] is OK
  • This will bring you back to the 'Repository Information' dialog
  • Enter your userid and password (kettle/****) and click on 'Create or Upgrade'
  • You'll be prompted to review the SQL generated. Go ahead and execute it. The Kettle Repository has now been created
  • You will now be presented with the login dialog with the 'kettle' repository highlighted
  • The Admin user is call 'Admin' and the default password is 'admin'
  • Select Tools -> Repository -> Explore... to invoke the Repository Explorer. This is where you administer the Repository from
  • Repository and other config files are stored under c:\Users\\.kettle\ (Note:there is a leading dot before kettle)

Comments

Popular posts from this blog

The Forecaster Brown Fan Club

How to Create a Pentaho Report Using the REST Client

Automated Testing with vncdotool (Not Headless, but Hairless)