Installing Apache Pig on Ubuntu Linux 11.10 (Oneiric Ocelot)

I am running Ubuntu Linux and wanted to install Apache Pig. Cloudera offer some cool ubuntu and debian hadoop packages, however they don't, currently, offer oneiric ocelot packages. Other people have run into this problem.

If you get errors like this:

W: Failed to fetch http://archive.cloudera.com/debian/dists/oneiric-cdh3/contrib/source/Sources
404  Not Found

When running sudo aptitude update after following the Cloudera ubuntu instructions, then this is the solution for you.

It's possible to hack it. I have only played a little bit around with this, so there might be some stupid and subtle bugs that will feck things up later for you. This is based on this suggestion

To install it on Oneiric Ocelot 11.10, just pretend you're running Maverick Meerkat 11.04!

Add a file called, /etc/apt/source.list.d/cloudera.list with content:

deb http://archive.cloudera.com/debian maverick-cdh3 contrib
deb-src http://archive.cloudera.com/debian maverick-cdh3 contrib

Then update your apt to contact the new repository.

sudo aptitude update

Install Apache Pig with:

sudo aptitude install hadoop-pig

After installing it, you can run it with pig on the command line. Find out how to fix JAVA_HOME is not set errors.

This entry is tagged: