Java FileIterator类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中cc.mallet.pipe.iterator.FileIterator类的典型用法代码示例。如果您正苦于以下问题：Java FileIterator类的具体用法？Java FileIterator怎么用？Java FileIterator使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

FileIterator类属于cc.mallet.pipe.iterator包，在下文中一共展示了FileIterator类的9个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: main

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public static void main(String[] args) {
	String htmldir = args[0];
	Pipe pipe = new SerialPipes(new Pipe[] { new Input2CharSequence(),
			new CharSequenceRemoveHTML() });
	InstanceList list = new InstanceList(pipe);
	list.addThruPipe(new FileIterator(htmldir, FileIterator.STARTING_DIRECTORIES));

	for (int index = 0; index < list.size(); index++) {
		Instance inst = list.get(index);
		System.err.println(inst.getData());
	}

}

开发者ID:kostagiolasn，项目名称:NucleosomePatternClassifier，代码行数:14，代码来源:CharSequenceRemoveHTML.java

示例2: testThree

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public void testThree ()
{
	InstanceList il = new InstanceList (
		new SerialPipes(new Pipe[] {
			new Target2Label(),
			new CharSequence2TokenSequence(),
			new TokenSequenceLowercase(),
			new TokenSequenceRemoveStopwords(),
			new TokenSequence2FeatureSequence(),
			new FeatureSequence2FeatureVector()
		}));
	Iterator<Instance> pi = new FileIterator(new File("foo/bar"), null, Pattern.compile("^([^/]*)/"));
	il.addThruPipe (pi);
}

开发者ID:mimno，项目名称:Mallet，代码行数:15，代码来源:TestRainbowStyle.java

示例3: testIncrementallyTrainedGrowingAlphabets

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public void testIncrementallyTrainedGrowingAlphabets()
{
	System.out.println("testIncrementallyTrainedGrowingAlphabets");
	String[]    args = new String[] {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/a",
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	File[] directories = new File[args.length];
	for (int i = 0; i < args.length; i++)
		directories[i] = new File (args[i]);

	SerialPipes instPipe =
		// MALLET pipeline for converting instances to feature vectors
		new SerialPipes(new Pipe[] {
				new Target2Label(),
				new Input2CharSequence(),
				//SKIP_HEADER only works for Unix
				//new CharSubsequence(CharSubsequence.SKIP_HEADER),
				new CharSequence2TokenSequence(),
				new TokenSequenceLowercase(),
				new TokenSequenceRemoveStopwords(),
				new TokenSequence2FeatureSequence(),
				new FeatureSequence2FeatureVector() });

	InstanceList instList = new InstanceList(instPipe);
	instList.addThruPipe(new
			FileIterator(directories, FileIterator.STARTING_DIRECTORIES));

	System.out.println("Training 1");
	NaiveBayesTrainer trainer = new NaiveBayesTrainer();
	NaiveBayes classifier = trainer.trainIncremental(instList);

	//instList.getDataAlphabet().stopGrowth();

	// incrementally train...
	String[] t2directories = {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	System.out.println("data alphabet size " + instList.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList.getTargetAlphabet().size());
	InstanceList instList2 = new InstanceList(instPipe);
	instList2.addThruPipe(new
			FileIterator(t2directories, FileIterator.STARTING_DIRECTORIES));

	System.out.println("Training 2");

	System.out.println("data alphabet size " + instList2.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList2.getTargetAlphabet().size());

	NaiveBayes classifier2 = (NaiveBayes) trainer.trainIncremental(instList2);
}

开发者ID:kostagiolasn，项目名称:NucleosomePatternClassifier，代码行数:54，代码来源:TestNaiveBayes.java

示例4: testIncrementallyTrained

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public void testIncrementallyTrained()
{
	System.out.println("testIncrementallyTrained");
	String[]    args = new String[] {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/a",
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	File[] directories = new File[args.length];
	for (int i = 0; i < args.length; i++)
		directories[i] = new File (args[i]);

	SerialPipes instPipe =
		// MALLET pipeline for converting instances to feature vectors
		new SerialPipes(new Pipe[] {
				new Target2Label(),
				new Input2CharSequence(),
				//SKIP_HEADER only works for Unix
				//new CharSubsequence(CharSubsequence.SKIP_HEADER),
				new CharSequence2TokenSequence(),
				new TokenSequenceLowercase(),
				new TokenSequenceRemoveStopwords(),
				new TokenSequence2FeatureSequence(),
				new FeatureSequence2FeatureVector() });

	InstanceList instList = new InstanceList(instPipe);
	instList.addThruPipe(new
			FileIterator(directories, FileIterator.STARTING_DIRECTORIES));

	System.out.println("Training 1");
	NaiveBayesTrainer trainer = new NaiveBayesTrainer();
	NaiveBayes classifier = (NaiveBayes) trainer.trainIncremental(instList);

	Classification initialClassification = classifier.classify("Hello Everybody");
	Classification initial2Classification = classifier.classify("Goodbye now");
	System.out.println("Initial Classification = ");
	initialClassification.print();
	initial2Classification.print();
	System.out.println("data alphabet " + classifier.getAlphabet());
	System.out.println("label alphabet " + classifier.getLabelAlphabet());


	// incrementally train...
	String[] t2directories = {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	System.out.println("data alphabet size " + instList.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList.getTargetAlphabet().size());
	InstanceList instList2 = new InstanceList(instPipe);
	instList2.addThruPipe(new
			FileIterator(t2directories, FileIterator.STARTING_DIRECTORIES));

	System.out.println("Training 2");

	System.out.println("data alphabet size " + instList2.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList2.getTargetAlphabet().size());

	NaiveBayes classifier2 = (NaiveBayes) trainer.trainIncremental(instList2);


}

开发者ID:kostagiolasn，项目名称:NucleosomePatternClassifier，代码行数:63，代码来源:TestNaiveBayes.java

示例5: testEmptyStringBug

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public void testEmptyStringBug()
{
	System.out.println("testEmptyStringBug");
	String[]    args = new String[] {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/a",
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	File[] directories = new File[args.length];
	for (int i = 0; i < args.length; i++)
		directories[i] = new File (args[i]);

	SerialPipes instPipe =
		// MALLET pipeline for converting instances to feature vectors
		new SerialPipes(new Pipe[] {
				new Target2Label(),
				new Input2CharSequence(),
				//SKIP_HEADER only works for Unix
				//new CharSubsequence(CharSubsequence.SKIP_HEADER),
				new CharSequence2TokenSequence(),
				new TokenSequenceLowercase(),
				new TokenSequenceRemoveStopwords(),
				new TokenSequence2FeatureSequence(),
				new FeatureSequence2FeatureVector() });

	InstanceList instList = new InstanceList(instPipe);
	instList.addThruPipe(new
			FileIterator(directories, FileIterator.STARTING_DIRECTORIES));

	System.out.println("Training 1");
	NaiveBayesTrainer trainer = new NaiveBayesTrainer();
	NaiveBayes classifier = (NaiveBayes) trainer.trainIncremental(instList);

	Classification initialClassification = classifier.classify("Hello Everybody");
	Classification initial2Classification = classifier.classify("Goodbye now");
	System.out.println("Initial Classification = ");
	initialClassification.print();
	initial2Classification.print();
	System.out.println("data alphabet " + classifier.getAlphabet());
	System.out.println("label alphabet " + classifier.getLabelAlphabet());


	// test
	String[] t2directories = {
			"src/cc/mallet/classify/tests/NaiveBayesData/learn/b"
	};

	System.out.println("data alphabet size " + instList.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList.getTargetAlphabet().size());
	InstanceList instList2 = new InstanceList(instPipe);
	instList2.addThruPipe(new
			FileIterator(t2directories, FileIterator.STARTING_DIRECTORIES, true));

	System.out.println("Training 2");

	System.out.println("data alphabet size " + instList2.getDataAlphabet().size());
	System.out.println("target alphabet size " + instList2.getTargetAlphabet().size());

	NaiveBayes classifier2 = (NaiveBayes) trainer.trainIncremental(instList2);
	Classification secondClassification = classifier.classify("Goodbye now");
	secondClassification.print();

}

开发者ID:kostagiolasn，项目名称:NucleosomePatternClassifier，代码行数:64，代码来源:TestNaiveBayes.java

示例6: main

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public static void main (String[] args) throws IOException {
	CommandOption
								.setSummary(Text2Clusterings.class,
														"A tool to convert a list of text files to a Clusterings.");
	CommandOption.process(Text2Clusterings.class, args);

	if (classDirs.value.length == 0) {
		logger
					.warning("You must include --input DIR1 DIR2 ...' in order to specify a"
										+ "list of directories containing the documents for each class.");
		System.exit(-1);
	}

	Clustering[] clusterings = new Clustering[classDirs.value.length];
	int fi = 0;
	for (int i = 0; i < classDirs.value.length; i++) {
		Alphabet fieldAlph = new Alphabet();
		Alphabet valueAlph = new Alphabet();
		File directory = new File(classDirs.value[i]);
		File[] subdirs = getSubDirs(directory);
		Alphabet clusterAlph = new Alphabet();
		InstanceList instances = new InstanceList(new Noop());
		TIntArrayList labels = new TIntArrayList();
		for (int j = 0; j < subdirs.length; j++) {
			ArrayList<File> records = new FileIterator(subdirs[j]).getFileArray();
			int label = clusterAlph.lookupIndex(subdirs[j].toString());
			for (int k = 0; k < records.size(); k++) {
				if (fi % 100 == 0) System.out.print(fi);
				else if (fi % 10 == 0) System.out.print(".");
				if (fi % 1000 == 0 && fi > 0) System.out.println();
				System.out.flush();
				fi++;


				File record = records.get(k);
				labels.add(label);
				instances.add(new Instance(new Record(fieldAlph, valueAlph, parseFile(record)),
											new Integer(label), record.toString(),
											record.toString()));
			}
		}
		clusterings[i] =
				new Clustering(instances, subdirs.length, labels.toNativeArray());
	}

	logger.info("\nread " + fi + " objects in " + clusterings.length + " clusterings.");
	try {
		ObjectOutputStream oos =
				new ObjectOutputStream(new FileOutputStream(outputFile.value));
		oos.writeObject(new Clusterings(clusterings));
		oos.close();
	} catch (Exception e) {
		logger.warning("Exception writing clustering to file " + outputFile.value
										+ " " + e);
		e.printStackTrace();
	}

}

开发者ID:kostagiolasn，项目名称:NucleosomePatternClassifier，代码行数:59，代码来源:Text2Clusterings.java

示例7: main

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public static void main (String[] args) throws IOException {
	CommandOption
								.setSummary(Text2Clusterings.class,
														"A tool to convert a list of text files to a Clusterings.");
	CommandOption.process(Text2Clusterings.class, args);

	if (classDirs.value.length == 0) {
		logger
					.warning("You must include --input DIR1 DIR2 ...' in order to specify a"
										+ "list of directories containing the documents for each class.");
		System.exit(-1);
	}

	Clustering[] clusterings = new Clustering[classDirs.value.length];
	int fi = 0;
	for (int i = 0; i < classDirs.value.length; i++) {
		Alphabet fieldAlph = new Alphabet();
		Alphabet valueAlph = new Alphabet();
		File directory = new File(classDirs.value[i]);
		File[] subdirs = getSubDirs(directory);
		Alphabet clusterAlph = new Alphabet();
		InstanceList instances = new InstanceList(new Noop());
		TIntArrayList labels = new TIntArrayList();
		for (int j = 0; j < subdirs.length; j++) {
			ArrayList<File> records = new FileIterator(subdirs[j]).getFileArray();
			int label = clusterAlph.lookupIndex(subdirs[j].toString());
			for (int k = 0; k < records.size(); k++) {
				if (fi % 100 == 0) System.out.print(fi);
				else if (fi % 10 == 0) System.out.print(".");
				if (fi % 1000 == 0 && fi > 0) System.out.println();
				System.out.flush();
				fi++;


				File record = records.get(k);
				labels.add(label);
				instances.add(new Instance(new Record(fieldAlph, valueAlph, parseFile(record)),
											new Integer(label), record.toString(),
											record.toString()));
			}
		}
		clusterings[i] =
				new Clustering(instances, subdirs.length, labels.toArray());
	}

	logger.info("\nread " + fi + " objects in " + clusterings.length + " clusterings.");
	try {
		ObjectOutputStream oos =
				new ObjectOutputStream(new FileOutputStream(outputFile.value));
		oos.writeObject(new Clusterings(clusterings));
		oos.close();
	} catch (Exception e) {
		logger.warning("Exception writing clustering to file " + outputFile.value
										+ " " + e);
		e.printStackTrace();
	}

}

开发者ID:iamxiatian，项目名称:wikit，代码行数:59，代码来源:Text2Clusterings.java

示例8: getInstanceList

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
/**
 * 
 * @param data_dir
 * @return
 */
public InstanceList getInstanceList(String data_dir) {
	InstanceList instances = new InstanceList(getPipe());
	instances.addThruPipe(new FileIterator(new File[] { new File(data_dir) }, FileIterator.STARTING_DIRECTORIES, true));
	return instances;
}

开发者ID:hakchul77，项目名称:irnlp_toolkit，代码行数:11，代码来源:MalletWrapper.java

示例9: main

import cc.mallet.pipe.iterator.FileIterator; //导入依赖的package包/类
public static void main (String[] args) throws IOException {
	CommandOption
								.setSummary(Text2Clusterings.class,
														"A tool to convert a list of text files to a Clusterings.");
	CommandOption.process(Text2Clusterings.class, args);

	if (classDirs.value.length == 0) {
		logger
					.warning("You must include --input DIR1 DIR2 ...' in order to specify a"
										+ "list of directories containing the documents for each class.");
		System.exit(-1);
	}

	Clustering[] clusterings = new Clustering[classDirs.value.length];
	int fi = 0;
	for (int i = 0; i < classDirs.value.length; i++) {
		Alphabet fieldAlph = new Alphabet();
		Alphabet valueAlph = new Alphabet();
		File directory = new File(classDirs.value[i]);
		File[] subdirs = getSubDirs(directory);
		Alphabet clusterAlph = new Alphabet();
		InstanceList instances = new InstanceList(new Noop());
		IntArrayList labels = new IntArrayList();
		for (int j = 0; j < subdirs.length; j++) {
			ArrayList<File> records = new FileIterator(subdirs[j]).getFileArray();
			int label = clusterAlph.lookupIndex(subdirs[j].toString());
			for (int k = 0; k < records.size(); k++) {
				if (fi % 100 == 0) System.out.print(fi);
				else if (fi % 10 == 0) System.out.print(".");
				if (fi % 1000 == 0 && fi > 0) System.out.println();
				System.out.flush();
				fi++;


				File record = records.get(k);
				labels.add(label);
				instances.add(new Instance(new Record(fieldAlph, valueAlph, parseFile(record)),
											new Integer(label), record.toString(),
											record.toString()));
			}
		}
		clusterings[i] =
				new Clustering(instances, subdirs.length, labels.toArray());
	}

	logger.info("\nread " + fi + " objects in " + clusterings.length + " clusterings.");
	try {
		ObjectOutputStream oos =
				new ObjectOutputStream(new FileOutputStream(outputFile.value));
		oos.writeObject(new Clusterings(clusterings));
		oos.close();
	} catch (Exception e) {
		logger.warning("Exception writing clustering to file " + outputFile.value
										+ " " + e);
		e.printStackTrace();
	}

}

开发者ID:cmoen，项目名称:mallet，代码行数:59，代码来源:Text2Clusterings.java

注：本文中的cc.mallet.pipe.iterator.FileIterator类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java EmbeddedDataSourceConfiguration类代码示例发布时间：2022-05-22

Java PrecisionPoint类代码示例发布时间：2022-05-22

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：18132|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9629|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8159|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8538|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8439|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9356|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8407|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7843|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8393|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7386|2022-11-06

客服电话

电子邮件

Java FileIterator类代码示例

示例1: main

示例2: testThree

示例3: testIncrementallyTrainedGrowingAlphabets

示例4: testIncrementallyTrained

示例5: testEmptyStringBug

示例6: main

示例7: main

示例8: getInstanceList

示例9: main

请发表评论

全部评论

上一篇：

下一篇：

DELPHITListBox(TStrings类)删除空行的方法

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053