빅데이터공부하기 56_1 MyMapper.map

행복한짱짱이 2014. 11. 21. 15:34

2014. 11. 21. 15:34

MyMapper1.map

- 2M.Title.ID가 이 Mapper에 의해 처리

- map 함수의 키로는 Title스트링이, 밸류로는 그 타이틀을 갖는 문서의 ID가 들어옴

- ID를 키로 그리고 Title+"＼t+1"을 밸류로 Reducer로 넘긴다.

public static class MyMapper1 extends Mapper<Text, Text, Text, Text>

{

@Override

protected void map (Text key, Text value, final Context context) throws IOException, InterruptedException

{

context.write(value, new Text(key+"＼t+1"));

context.getCounter("Stats", "Number of Title+DocID").increment(1);

}

MyMapper2.map

- CountCitation(과 TopN)의 결과가 이 Mapper에 의해 처리

- map 함수의 키로는 ID가 들어오고 밸류로는 빈도수가 들어온다.

- Reducer로 출력할 때는 ID를 그대로 키로 그리고 빈도수 +"＼t"+2을 밸류로 넘긴다.

public static class MyMapper2 extends Mapper<Text, Text, Text, Text>

{

@Override

protected void map(Text key, Text value, final Context context)

throws IOException, Interrupted Exception

{

context.write(key, new Text(value+"＼t"+2));

context.getCounter("Stats",DocID+Citation").increment(1);

}

it개발자스터디공간