Chris Agocs learns something new: Hadoop: JAVA

This post is mostly expounding on my findings here: http://stackoverflow.com/questions/10824462/hadoop-java-home-is-not-set

Background

After installing Java, I wanted to install Hadoop on this Ubuntu 12.04 Server. There are a million tutorials, but I went with this one: http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/. I wanted to be able to run Hadoop as all users, so I set JAVA_HOME inside of /etc/profile. That way, all profiles get that environmental variable.

I got down to the part where you format the HDFS filesystem through the following command:

/usr/local/hadoop/bin/hadoop namenode -format

And everything went just fine... NOT.

user@linux01:~$ sudo $HADOOP_INSTALL/bin/hadoop namenode -format
Error: JAVA_HOME is not set.

I was indignant! JAVA_HOME is totally set!

user@linux01:~$ tail -n 4 /etc/profile
export JAVA_HOME=/usr/local/jdk1.6.0_32/bin

export JDK_HOME=$JAVA_HOME
export PATH=$PATH:/usr/local/jdk1.6.0_32/bin

export HADOOP_INSTALL=/usr/local/hadoop/hadoop-1.0.3

user@linux01:~$ echo $JAVA_HOME
/usr/local/jdk1.6.0_32/bin

user@linux01:~$ ls $JAVA_HOME
appletviewer extcheck jar javac and so forth...

The Symptoms

Some geeks on StackOverflow pointed me in the right direction. Environmental variables sometimes don't persist when you sudo. I su'd as my hadoop user and ran the command again. Success!

The Cure

Of course, that's not the end of it. I went to start Hadoop using:

/usr/local/hadoop/bin/start-all.sh

I can't remember the exact error I got. Namenode and Jobtracker started right up, but Datanode, SecondaryNamenode and Tasktracker didn't. I did some digging, and the ones that worked are part of the Namenode, started by hadoop-daemon.sh. The ones that didn't are part of the Hadoop Datanode, and are started by hadoop-daemons.sh. The processes that were not starting all had error logs complaining about, guess what, JAVA_HOME not being set. Finally, I bit the bullet and hard-coded JAVA_HOME in conf/hadoop-env.sh.

Too long, didn't read

The moral of the story is, hard code JAVA_HOME in conf/hadoop-env.sh.

5 comments:

UnknownOctober 15, 2013 at 9:06 AM
Thanks man! I tried all methods, put JAVA_HOME in /etc/profile, .bash_profile, even in start-all.sh, still no luck.
I used the wrong file :-)
mareddyonlineJuly 19, 2014 at 9:49 PM
This comment has been removed by a blog administrator.
UnknownAugust 4, 2015 at 5:27 AM
This comment has been removed by a blog administrator.
AnonymousOctober 26, 2015 at 10:17 PM
This comment has been removed by a blog administrator.
UnknownDecember 27, 2015 at 11:12 PM
This comment has been removed by a blog administrator.

Chris Agocs learns something new

Thursday, June 7, 2012

Hadoop: JAVA_HOME is not set... Yes it is!

Background

The Symptoms

The Cure

Too long, didn't read

5 comments:

About Me