We are going to install Sqoop. Ensure you have Hadoop installed already.
This assumes your hostname is “hadoop”
Install Java JDK
- apt-get update
- apt-get upgrade
- apt-get install default-jdk
Download Sqoop:
- wget https://archive.apache.org/dist/sqoop/1.99.7/sqoop-1.99.7-bin-hadoop200.tar.gz
- tar -zxvf sqoop-1.99.7-bin-hadoop200.tar.gz
- sudo mv sqoop-1.99.7-bin-hadoop200 /usr/local/sqoop/
- sudo chown -R root:hadoopuser /usr/local/sqoop/
Setup .bashrc:
- sudo nano ~/.bashrc
Add the following to the end of the file.
#SQOOP VARIABLES START
export SQOOP_HOME=/usr/local/sqoop
export PATH=$PATH:$SQOOP_HOME/bin
export SQOOP_CONF_DIR=$SQOOP_HOME/conf
export SQOOP_CLASS_PATH=$SQOOP_CONF_DIR
#SQOOP VARIABLES STOP
- source ~/.bashrc
Initialise Repository
- ./bin/sqoop2-tool upgrade
Modify sqoop2-server
If you are running Hadoop on the same server as Sqoop Server you will need to modify this file. The reason is because Sqoop needs you to point to the lib directory for common, hdfs, mapreduce and yarn.
- nano /usr/loca/sqoop/bin/sqoop.sh
- #Modify these lines
- HADOOP_COMMON_HOME=${HADOOP_COMMON_HOME:-${HADOOP_HOME}/share/hadoop/common}
- HADOOP_HDFS_HOME=${HADOOP_HDFS_HOME:-${HADOOP_HOME}/share/hadoop/hdfs}
- HADOOP_MAPRED_HOME=${HADOOP_MAPRED_HOME:-${HADOOP_HOME}/share/hadoop/mapreduce}
- HADOOP_YARN_HOME=${HADOOP_YARN_HOME:-${HADOOP_HOME}/share/hadoop/yarn}
- #TO
- HADOOP_COMMON_HOME=${HADOOP_HOME}/share/hadoop/common
- HADOOP_HDFS_HOME=${HADOOP_HOME}/share/hadoop/hdfs
- HADOOP_MAPRED_HOME=${HADOOP_HOME}/share/hadoop/mapreduce
- HADOOP_YARN_HOME=${HADOOP_HOME}/share/hadoop/yarn
Configuration
- nano /usr/local/sqoop/conf/sqoop.properties
- #Update the following line
- org.apache.sqoop.submission.engine.mapreduce.configuration.directory=/usr/local/hadoop/etc/hadoop/
Start Sqoop Server
- ./bin/sqoop2-server start
References
https://linoxide.com/tools/install-apache-sqoop-ubuntu-16-04/