I was trying to use Hadoop's streaming pattern to use python code on a largish data set. However, uploading my data to the cluster actually takes approximately forever (I've not yet actually succeeded ...
Hello, I'm trying to pickle a third-party lib object, and I get the following error:<BR><BR>(cPickle)<BR><BR>cPickle.PicklingError: Can't pickle <type 'function ...