下载hadoop:Apache Hadoop
下载下来之后以管理员身份解压:或者使用7-zip
配置环境变量:
4.0.0
com.xuan
hadoopdemo
1.0-SNAPSHOT
hadoopdemo
http://www.example.com
UTF-8
1.8
1.8
junit
junit
4.11
test
org.apache.hadoop
hadoop-common
2.5.2
org.apache.hadoop
hadoop-hdfs
2.5.2
org.apache.hadoop
hadoop-client
2.5.2
junit
junit
4.11
test
下载:winutils
package com.hadoop;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import java.io.IOException;
import java.util.StringTokenizer;
public class WordCount {
public static class TokenizerMapper extends Mapper
运行程序可能找不到job:hadoop.dll放到C:WindowsSystem32下面
file1.txt
q
w
ww
q
we
qwe
as
q
w
ww
q
w
we
点击运行:
提示错误:
Error: JAVA_HOME is incorrectly set.
H:yingjianhadoophadoop-2.10.1etchadoophadoop-env
JAVA_HOME目录是C:Program FilesJavajdk1.8.0_121,因为Program Files中存在空格,所以出现错误,只需要用
PROGRA~1代替Program Files即可,即改为C:PROGRA~1Javajdk1.8.0_121
记录存在一个问题:
上面的output输出目录为空,找了一晚上资料,最后把idea重启了一下突然出来了,我操
!!