Sunday 15 April 2012

hadoop - distcp2 in CDH5.2 with MR1 -



hadoop - distcp2 in CDH5.2 with MR1 -

we have requirement restrict mappers bandwidth when distcp s3 local cluster.

so downloaded hadoop-distcp-2.5.0-cdh5.2.0-20141009.063640-188.jar https://repository.cloudera.com

here link : https://repository.cloudera.com/artifactory/public/org/apache/hadoop/hadoop-distcp/2.5.0-cdh5.2.0-snapshot/hadoop-distcp-2.5.0-cdh5.2.0-20141009.063640-188.jar

then ran next distcp command got error. don't know if doing wrong g

export hadoop_user_classpath_first=true && hadoop_classpath=hadoop-distcp-2.5.0-cdh5.2.0-20141009.063640-188.jar hadoop org.apache.hadoop.tools.distcp -bandwidth 1 s3n://com.xyz/2014/10/23/ hdfs:///user/abc/2014-10-23/ 14/11/05 09:54:55 info tools.distcp: input options: distcpoptions{atomiccommit=false, syncfolder=false, deletemissing=false, ignorefailures=false, maxmaps=20, sslconfigurationfile='null', copystrategy='uniformsize', sourcefilelisting=null, sourcepaths=[s3n://com.xyz.rtb/2014/10/23], targetpath=hdfs:/user/abc/2014-10-23, targetpathexists=true, preserverawxattrs=false} exception in thread "main" java.lang.nosuchmethoderror: org.apache.hadoop.mapreduce.jobsubmissionfiles.getstagingdir(lorg/apache/hadoop/mapreduce/cluster;lorg/apache/hadoop/conf/configuration;)lorg/apache/hadoop/fs/path; @ org.apache.hadoop.tools.distcp.createmetafolderpath(distcp.java:379) @ org.apache.hadoop.tools.distcp.execute(distcp.java:155) @ org.apache.hadoop.tools.distcp.run(distcp.java:121) @ org.apache.hadoop.util.toolrunner.run(toolrunner.java:70) @ org.apache.hadoop.tools.distcp.main(distcp.java:401)

any help

https://groups.google.com/a/cloudera.org/forum/#!topic/cdh-user/ld60ty_z-jg

the distcp2 default , distcp in mr2. isn't called distcp2, distcp.

the command 'hadoop distcp' in mr2 runs distcp2.

hadoop cloudera-cdh distcp

No comments:

Post a Comment