Java MultipleOutputs类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中org.apache.hadoop.mapred.lib.MultipleOutputs类的典型用法代码示例。如果您正苦于以下问题：Java MultipleOutputs类的具体用法？Java MultipleOutputs怎么用？Java MultipleOutputs使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

MultipleOutputs类属于org.apache.hadoop.mapred.lib包，在下文中一共展示了MultipleOutputs类的13个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: getMultipleConvertedOutputs

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public static CollectMultipleConvertedOutputs getMultipleConvertedOutputs(JobConf job)
{		
	byte[] resultIndexes=MRJobConfiguration.getResultIndexes(job);
	Converter[] outputConverters=new Converter[resultIndexes.length];
	MatrixCharacteristics[] stats=new MatrixCharacteristics[resultIndexes.length];
	HashMap<Byte, ArrayList<Integer>> tagMapping=new HashMap<>();
	for(int i=0; i<resultIndexes.length; i++)
	{
		byte output=resultIndexes[i];
		ArrayList<Integer> vec=tagMapping.get(output);
		if(vec==null)
		{
			vec=new ArrayList<>();
			tagMapping.put(output, vec);
		}
		vec.add(i);
		
		outputConverters[i]=getOuputConverter(job, i);
		stats[i]=MRJobConfiguration.getMatrixCharacteristicsForOutput(job, output);
	}
	
	MultipleOutputs multipleOutputs=new MultipleOutputs(job);
	
	return new CollectMultipleConvertedOutputs(outputConverters, stats, multipleOutputs);
	
}

开发者ID:apache，项目名称:systemml，代码行数:27，代码来源:MRJobConfiguration.java

示例2: createJobConf

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
/**
 * Create a job configuration
 */
@SuppressWarnings("rawtypes")
public static JobConf createJobConf(String name, String topic, Props props, Class classobj) 
throws Exception {
    JobConf conf = getJobConf(name, props, classobj);
    
    conf.set("topic", topic);
    
    // input format
    conf.setInputFormat(KafkaETLInputFormat.class);

    //turn off mapper speculative execution
    conf.setMapSpeculativeExecution(false);
    
    // setup multiple outputs
    MultipleOutputs.addMultiNamedOutput(conf, "offsets", SequenceFileOutputFormat.class, 
                KafkaETLKey.class, BytesWritable.class);


    return conf;
}

开发者ID:yanfang724，项目名称:hadoop-consumer，代码行数:24，代码来源:KafkaETLJob.java

示例3: configure

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
@Override
public void configure(JobConf conf) {
    multipleOutputs = new MultipleOutputs(conf);
    currentTable = conf.get("org.acacia.partitioner.hbase.table");
    zookeeperhost = getZookeeperLocation();
    contactHost = conf.get("org.acacia.partitioner.index.contacthost");
    totalVertexCount = Long.parseLong(conf.get("vert-count"));
    initalPartitionID = Integer.parseInt(conf.get("initpartition-id"));
    zeroFlag = Boolean.parseBoolean(conf.get("zero-flag"));
    loadIndex();
}

开发者ID:miyurud，项目名称:Acacia，代码行数:12，代码来源:EdgeDistributor.java

示例4: main

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
@SuppressWarnings("unused")
public static void main(String[] args) throws IOException {
    JobConf conf = new JobConf(EdgelistPartitioner.class);

    if (conf == null) {
        return;
    }
    String dir1 = "/user/miyuru/merged";
    String dir2 = "/user/miyuru/merged-out";

    // We first delete the temporary directories if they exist on the HDFS
    FileSystem fs1 = FileSystem.get(new JobConf());
    // only delete dir2 because dir1 is uploaded externally.
    if (fs1.exists(new Path(dir2))) {
        fs1.delete(new Path(dir2), true);
    }

    conf.setInputFormat(WholeFileInputFormat.class);
    conf.setOutputFormat(TextOutputFormat.class);

    WholeFileInputFormat.setInputPaths(conf, new Path(dir1));
    SequenceFileOutputFormat.setOutputPath(conf, new Path(dir2));

    conf.setOutputKeyClass(Text.class);
    conf.setOutputValueClass(Text.class);

    conf.setMapperClass(SequenceFileMapper.class);
    conf.setReducerClass(MultipleOutputsInvertedReducer.class);
    conf.setOutputFormat(NullOutputFormat.class);

    conf.setJobName("EdgelistPartitioner");

    MultipleOutputs.addMultiNamedOutput(conf, "partition",
            TextOutputFormat.class, NullWritable.class, Text.class);

    JobClient.runJob(conf);
}

开发者ID:miyurud，项目名称:Acacia，代码行数:38，代码来源:EdgelistPartitioner.java

示例5: setUpMultipleOutputs

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public static void setUpMultipleOutputs(JobConf job, byte[] resultIndexes, byte[] resultDimsUnknown, String[] outputs, 
		OutputInfo[] outputInfos, boolean inBlockRepresentation, boolean mayContainCtable) 
throws Exception
{
	if(resultIndexes.length!=outputs.length)
		throw new Exception("number of outputs and result indexes does not match");
	if(outputs.length!=outputInfos.length)
		throw new Exception("number of outputs and outputInfos indexes does not match");
	
	job.set(RESULT_INDEXES_CONFIG, MRJobConfiguration.getIndexesString(resultIndexes));
	job.set(RESULT_DIMS_UNKNOWN_CONFIG, MRJobConfiguration.getIndexesString(resultDimsUnknown));
	job.setStrings(OUTPUT_MATRICES_DIRS_CONFIG, outputs);
	job.setOutputCommitter(MultipleOutputCommitter.class);
	
	for(int i=0; i<outputs.length; i++)
	{
		MapReduceTool.deleteFileIfExistOnHDFS(new Path(outputs[i]), job);
		if ( mayContainCtable && resultDimsUnknown[i] == (byte) 1 )  {
			setOutputInfo(job, i, outputInfos[i], false);
		}
		else {
			setOutputInfo(job, i, outputInfos[i], inBlockRepresentation);
		}
		MultipleOutputs.addNamedOutput(job, Integer.toString(i), 
				outputInfos[i].outputFormatClass, outputInfos[i].outputKeyClass, 
				outputInfos[i].outputValueClass);
	}
	job.setOutputFormat(NullOutputFormat.class);
	
	// configure temp output
	Path tempOutputPath = new Path( constructTempOutputFilename() );
	FileOutputFormat.setOutputPath(job, tempOutputPath);
	MapReduceTool.deleteFileIfExistOnHDFS(tempOutputPath, job);
}

开发者ID:apache，项目名称:systemml，代码行数:35，代码来源:MRJobConfiguration.java

示例6: CollectMultipleConvertedOutputs

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public CollectMultipleConvertedOutputs(Converter[] convts, MatrixCharacteristics[] stats, 
		MultipleOutputs outputs)
{
	outputConverters=convts;
	multipleOutputs=outputs;
	matrixStats=stats;
}

开发者ID:apache，项目名称:systemml，代码行数:8，代码来源:CollectMultipleConvertedOutputs.java

示例7: KafkaETLContext

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
/**
 * construct using input string
 */
@SuppressWarnings("unchecked")
public KafkaETLContext(JobConf job, Props props, Reporter reporter, 
                                MultipleOutputs mos, int index, String input) 
throws Exception {
    
    _bufferSize = getClientBufferSize(props);
    _timeout = getClientTimeout(props);
    System.out.println("bufferSize=" +_bufferSize);
    System.out.println("timeout=" + _timeout);
    _reporter = reporter;
    _mos = mos;
    
    // read topic and current offset from input
    _index= index; 
    _input = input;
    _request = new KafkaETLRequest(input.trim());
    
    // read data from queue
    URI uri = _request.getURI();
    _consumer = new SimpleConsumer(uri.getHost(), uri.getPort(), _timeout, _bufferSize);
    
    // get available offset range
    _offsetRange = getOffsetRange();
    System.out.println("Connected to node " + uri 
            + " beginning reading at offset " + _offsetRange[0]
            + " latest offset=" + _offsetRange[1]);

    _offset = _offsetRange[0];
    _count = 0;
    _requestTime = 0;
    _retry = 0;
    
    _startTime = System.currentTimeMillis();
}

开发者ID:yanfang724，项目名称:hadoop-consumer，代码行数:38，代码来源:KafkaETLContext.java

示例8: KafkaETLRecordReader

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public KafkaETLRecordReader(InputSplit split, JobConf job, Reporter reporter) 
throws IOException {
   super(job, (FileSplit) split);
   
   _props = KafkaETLUtils.getPropsFromJob(job);
   _contextList = new ArrayList<KafkaETLContext>();
   _job = job;
   _reporter = reporter;
   _contextIndex = -1;
   _mos = new MultipleOutputs(job);
   try {
       _limit = _props.getInt("kafka.request.limit", -1);
       
       /*get attemp id*/
       String taskId = _job.get("mapred.task.id");
       if (taskId == null) {
           throw new IllegalArgumentException(
                             "Configutaion does not contain the property mapred.task.id");
       }
       String[] parts = taskId.split("_");
       if (    parts.length != 6 || !parts[0].equals("attempt") 
            || (!"m".equals(parts[3]) && !"r".equals(parts[3]))) {
               throw new IllegalArgumentException(
                             "TaskAttemptId string : " + taskId + " is not properly formed");
       }
      _attemptId = parts[4]+parts[3];
   }catch (Exception e) {
       throw new IOException (e);
   }
}

开发者ID:yanfang724，项目名称:hadoop-consumer，代码行数:31，代码来源:KafkaETLRecordReader.java

示例9: main

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public static void main(String[] args) throws IOException,
        InterruptedException, ClassNotFoundException {
    String dir1 = "/user/miyuru/input";
    String dir2 = "/user/miyuru/edgedistributed-out";

    // //We first delete the temporary directories if they exist on the HDFS
    FileSystem fs1 = FileSystem.get(new JobConf());
    if (fs1.exists(new Path(dir2))) {
        fs1.delete(new Path(dir2), true);
    }

    // First job scans through the edge list and splits the edges in to
    // separate files based on the partitioned vertex files.

    JobConf conf = new JobConf(EdgeDistributor.class);
    conf.set("org.acacia.partitioner.hbase.zookeeper.quorum", args[0]);
    conf.set("org.acacia.partitioner.hbase.table", args[1]);
    conf.set("org.acacia.partitioner.index.contacthost", args[2]);
    conf.set("vert-count", args[3]);
    conf.set("initpartition-id", args[4]);
    conf.set("zero-flag", args[5]);
    conf.setOutputKeyClass(LongWritable.class);
    conf.setOutputValueClass(Text.class);
    conf.setMapperClass(FileMapper.class);
    conf.setReducerClass(FileReducer.class);
    // conf.setInputFormat(TextInputFormat.class);
    conf.setInputFormat(NLinesInputFormat.class);
    conf.setOutputFormat(TextOutputFormat.class);
    conf.setNumReduceTasks(96); // Need to specify the number of reduce
                                // tasks explicitly. Otherwise it creates
                                // only one reduce task.

    FileInputFormat.setInputPaths(conf, new Path(dir1));
    FileOutputFormat.setOutputPath(conf, new Path(dir2));

    MultipleOutputs.addMultiNamedOutput(conf, "partition",
            TextOutputFormat.class, NullWritable.class, Text.class);

    Job job = new Job(conf, "EdgeDistributor");
    job.waitForCompletion(true);

    System.out.println("Done job EdgeDistribution");
}

开发者ID:miyurud，项目名称:Acacia，代码行数:44，代码来源:EdgeDistributor.java

示例10: configure

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
@Override
public void configure(JobConf conf) {
    multipleOutputs = new MultipleOutputs(conf);
}

开发者ID:miyurud，项目名称:Acacia，代码行数:5，代码来源:EdgelistPartitioner.java

示例11: MultipleOutputsCloseableAdapter

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
public MultipleOutputsCloseableAdapter(MultipleOutputs mo) {
  this.mo = mo;
}

开发者ID:saradelrio，项目名称:Chi-FRBCS-BigDataCS，代码行数:4，代码来源:IOUtils.java

示例12: run

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
@Override
public int run(String[] args) throws Exception {

	JobConf conf = new JobConf(RedirectStep.class);
	DumpExtractor.configureJob(conf, args) ;
	
	conf.setJobName("WM: resolve redirects");

	conf.setOutputKeyClass(IntWritable.class);
	conf.setOutputValueClass(DbIntList.class);		

	conf.setMapperClass(Step2Mapper.class);
	conf.setCombinerClass(Step2Reducer.class) ;
	conf.setReducerClass(Step2Reducer.class) ;

	// set up input

	conf.setInputFormat(TextInputFormat.class);
	FileInputFormat.setInputPaths(conf, new Path(conf.get(DumpExtractor.KEY_OUTPUT_DIR) + "/" + DumpExtractor.getDirectoryName(ExtractionStep.page) + "/" + PageStep.Output.tempRedirect + "*"));

	//set up output

	conf.setOutputFormat(RedirectOutputFormat.class);
	FileOutputFormat.setOutputPath(conf, new Path(conf.get(DumpExtractor.KEY_OUTPUT_DIR) + "/" + DumpExtractor.getDirectoryName(ExtractionStep.redirect)));

	//set up distributed cache

	DistributedCache.addCacheFile(new Path(conf.get(DumpExtractor.KEY_OUTPUT_DIR) + "/" + DumpExtractor.OUTPUT_SITEINFO).toUri(), conf);
	DistributedCache.addCacheFile(new Path(conf.get(DumpExtractor.KEY_LANG_FILE)).toUri(), conf);

	//cache page files created in previous step, so we can look up pages by title
	Path pageStepPath = new Path(conf.get(DumpExtractor.KEY_OUTPUT_DIR) + "/" + DumpExtractor.getDirectoryName(ExtractionStep.page)) ;
	for (FileStatus fs:FileSystem.get(conf).listStatus(pageStepPath)) {

		if (fs.getPath().getName().startsWith(PageStep.Output.tempPage.name())) {
			Logger.getLogger(RedirectStep.class).info("Cached page file " + fs.getPath()) ;
			DistributedCache.addCacheFile(fs.getPath().toUri(), conf);
		}
	}
	
	MultipleOutputs.addNamedOutput(conf, Output.redirectTargetsBySource.name(), TextOutputFormat.class,
			IntWritable.class, IntWritable.class);
	
	conf.set("mapred.textoutputformat.separator", ",");
	
	//run job
	JobClient.runJob(conf);
	return 0;
}

开发者ID:busk，项目名称:WikipediaMiner，代码行数:50，代码来源:RedirectStep.java

示例13: configure

import org.apache.hadoop.mapred.lib.MultipleOutputs; //导入依赖的package包/类
@Override
public void configure(JobConf job) {

	HashSet<PageType> pageTypesToCache = new HashSet<PageType>() ;
	pageTypesToCache.add(PageType.article) ;
	pageTypesToCache.add(PageType.redirect) ;
	pageTypesToCache.add(PageType.disambiguation) ;

	try {

		Path[] cacheFiles = DistributedCache.getLocalCacheFiles(job);

		for (Path cf:cacheFiles) {

			if (cf.getName().equals(new Path(DumpExtractor.OUTPUT_SITEINFO).getName())) {
				si = new SiteInfo(cf) ;
			}

			if (cf.getName().equals(new Path(job.get(DumpExtractor.KEY_LANG_FILE)).getName())) {
				lc = new LanguageConfiguration(job.get(DumpExtractor.KEY_LANG_CODE), cf) ;
			}

			if (cf.getName().startsWith(PageStep.Output.tempPage.name())) {
				Logger.getLogger(Step2Mapper.class).info("Located cached page file " + cf.toString()) ;
				pageFiles.add(cf) ;
			}
		}

		if (si == null) 
			throw new Exception("Could not locate '" + DumpExtractor.OUTPUT_SITEINFO + "' in DistributedCache") ;

		if (lc == null) 
			throw new Exception("Could not locate '" + job.get(DumpExtractor.KEY_LANG_FILE) + "' in DistributedCache") ;

		if (pageFiles.isEmpty())
			throw new Exception("Could not gather page summary files produced in step 1") ;
		
		mos = new MultipleOutputs(job);

	} catch (Exception e) {
		Logger.getLogger(Step2Mapper.class).error("Could not configure mapper", e);
		System.exit(1) ;
	}
}

开发者ID:busk，项目名称:WikipediaMiner，代码行数:45，代码来源:RedirectStep.java

注：本文中的org.apache.hadoop.mapred.lib.MultipleOutputs类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java BeanProcessor类代码示例发布时间：2022-05-22

Java Once类代码示例发布时间：2022-05-22

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：18213|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9656|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8168|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8543|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8449|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9375|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8418|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7855|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8403|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7391|2022-11-06

客服电话

电子邮件

Java MultipleOutputs类代码示例

示例1: getMultipleConvertedOutputs

示例2: createJobConf

示例3: configure

示例4: main

示例5: setUpMultipleOutputs

示例6: CollectMultipleConvertedOutputs

示例7: KafkaETLContext

示例8: KafkaETLRecordReader

示例9: main

示例10: configure

示例11: MultipleOutputsCloseableAdapter

示例12: run

示例13: configure

请发表评论

全部评论

上一篇：

下一篇：

mlrun/mlrun: Machine Learning automation

Delphi_09_Delphi_Object_Pascal_面向对象

win7系统将IE11降级为IE10的操作方法

aikar/timings: Source to the Aikar's

信号与系统实验序章0——MATLAB基础命令入

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053

客服电话

电子邮件

Java MultipleOutputs类代码示例

示例1: getMultipleConvertedOutputs

示例2: createJobConf

示例3: configure

示例4: main

示例5: setUpMultipleOutputs

示例6: CollectMultipleConvertedOutputs

示例7: KafkaETLContext

示例8: KafkaETLRecordReader

示例9: main

示例10: configure

示例11: MultipleOutputsCloseableAdapter

示例12: run

示例13: configure

请发表评论

全部评论

上一篇：

下一篇：

mlrun/mlrun: Machine Learning automation

Delphi_09_Delphi_Object_Pascal_面向对象

win7系统将IE11降级为IE10的操作方法

aikar/timings: Source to the Aikar&#39;s

信号与系统实验序章0——MATLAB基础命令入

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053

aikar/timings: Source to the Aikar's