Client installation

Introduction

Integrator sandbox virtual machine contains following applications pre-installed.

  • Integrator Server

  • Integrator Hadoop Agent

  • Apache Hadoop with spark

Please note that this is not an ideal configuration for production use and is intended for testing purposes only.

Installation Requirements:

Step1: Import Virtual Machine

  • Launch Oracle VirtualBox
  • Click [File] -> [Import Appliance]

  • Browse/Select downloaded .ova file then click [Next]

  • Click [Import]

Step2: Configure Virtual Machine (don't launch VM yet)

  • Select VM and click [Settings]

  • Select [System] on left panel

  • On [Motherboard] tab set base memory to 8192MB or higher

  • On [Processor] tab set processor count to 4 (ideal) or higher. Recommended processor count is 4 but if your system only has 2 then set it to 2.

  • Select [Network] on left panel

  • On [Adapter 1] tab check [Enable Network Adapter]

  • Set Attached to: [Bridged Adapter]

  • Set Name: <Your network adapter name> (if you are on wifi connection select appropriate wifi device name)

The adapter name is the name of network adapter your computer is using to connect to network(internet). If you are on wireless network, select appropriate wireless device name.

  • Expand [Advanced]

  • Set MAC Address: 0800271FB789 (Important!!)

  • Set Check [Cable Connected]

  • Click [OK]

  • Click [Start] to launch Virtual Machine

  • On Virtual Machine Console

  • Note: Click anywhere on console to start using it. Right-Ctrl key to get mouse control back.

  • Use following credentials to log in

  • User Name: hadoop

  • Password: welcome123

  • Note down IP Address: 192.168.1.22 (your server ip address may be different than shown), this address may/may not change after rebooting. Note down the ip address if it changes after reboot

  • We will refer this IP Address as SANDBOX_IP_ADDRESS for remainder of documentation

Step3: Begin Using Sandbox

It may take few minutes after server has started for all applications to initialize. A log file is updated when the services start during startup. For debugging purpose you can check contents of log file /tmp/sandbox.log

The last line on log file should contain "********* Spark Submit [Found]"

  • Launch Integrator Windows Client application.

  • Server: SANDBOX_IP_ADDRESS

  • Port: 8181

  • User Name: Integrator

  • Password: Integrator

  • Click [Login] to start using Integrator Client

Default SSH Credentials

User Name Password ————— ————– root welcome123 hadoop welcome123

You can use ssh client like Putty to connect to SANDBOX_IP_ADDRESS using above credentials

Default Integrator Windows Client Login

User Name Password
Integrator Integrator

Hadoop Web Access (Using Web Browser Like Chrome/FireFox)

URL Type
http://SANDBOX_IP_ADDRESS:50070/ Hadoop Service
http://SANDBOX_IP_ADDRESS:8088/ Hadoop Applications
http://SANDBOX_IP_ADDRESS:50090/ Secondary Name Node
http://SANDBOX_IP_ADDRESS:50075/ Data Node

Shut Down Procedure

  • SSH Login as user [root]

  • Run Following Commands

su - hadoop -c "/home/hadoop/scripts/stop_services.sh"

halt

Start Up Procedure

Integrator and Hadoop related services are set to automatically start when Sandbox Virtual Machine reboots. It may take few minutes for all services to come online after server has been started.

Sample DVD Rental Database