Java CmdLineUtil类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中opennlp.tools.cmdline.CmdLineUtil类的典型用法代码示例。如果您正苦于以下问题：Java CmdLineUtil类的具体用法？Java CmdLineUtil怎么用？Java CmdLineUtil使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

CmdLineUtil类属于opennlp.tools.cmdline包，在下文中一共展示了CmdLineUtil类的8个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: run

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
public void run(String[] args) {
  Params params = validateAndParseParams(args, Params.class);

  File dictInFile = params.getInputFile();

  CmdLineUtil.checkInputFile("dictionary input file", dictInFile);
  Path metadataPath = DictionaryMetadata.getExpectedMetadataLocation(dictInFile.toPath());
  CmdLineUtil.checkInputFile("dictionary metadata (.info) input file", metadataPath.toFile());

  MorfologikDictionayBuilder builder = new MorfologikDictionayBuilder();
  try {
    builder.build(dictInFile.toPath(), params.getOverwrite(),
        params.getValidate(), params.getAcceptBOM(), params.getAcceptCR(),
        params.getIgnoreEmpty());
  } catch (Exception e) {
    throw new TerminateToolException(-1,
        "Error while creating Morfologik POS Dictionay: " + e.getMessage(), e);
  }

}

开发者ID:apache，项目名称:opennlp-addons，代码行数:21，代码来源:MorfologikDictionaryBuilderTool.java

示例2: main

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
public static void main(String[] args) {
if (args.length < 2) {
    System.out.println("usage: <input> <output>\n");
    System.exit(0);
}

String input = args[0];
String output = args[1];

TrainingParameters params = new TrainingParameters();
params.put(TrainingParameters.CUTOFF_PARAM, Integer.toString(0));
params.put(TrainingParameters.ITERATIONS_PARAM, Integer.toString(100));
//params.put(TrainingParameters.ALGORITHM_PARAM, NaiveBayesTrainer.NAIVE_BAYES_VALUE);

AgeClassifyModel model;
try {
    model = AgeClassifySparkTrainer.createModel("en", input, 
        "opennlp.tools.tokenize.SentenceTokenizer", "opennlp.tools.tokenize.BagOfWordsTokenizer", params);
} catch (IOException e) {
    throw new TerminateToolException(-1,
        "IO error while reading training data or indexing data: " + e.getMessage(), e);
}
CmdLineUtil.writeModel("age classifier", new File(output), model);
   }

开发者ID:USCDataScience，项目名称:AgePredictor，代码行数:25，代码来源:AgeClassifySparkTrainer.java

示例3: serializeEntityGazetteers

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
public static void serializeEntityGazetteers(Path dictionaryFile)
    throws IOException {
  Map<String, String> dictionary = new HashMap<String, String>();
  InputStream inputStream = CmdLineUtil.openInFile(dictionaryFile.toFile());
  BufferedReader breader = new BufferedReader(
      new InputStreamReader(inputStream, Charset.forName("UTF-8")));
  String line;
  while ((line = breader.readLine()) != null) {
    String[] lineArray = tabPattern.split(line);
    if (lineArray.length == 2) {
      String normalizedToken = dotInsideI.matcher(lineArray[0])
          .replaceAll("i");
      dictionary.put(normalizedToken.toLowerCase(), lineArray[1].intern());
    } else {
      System.err.println(lineArray[0] + " is not well formed!");
    }
  }
  String outputFile = dictionaryFile.toString() + SER_GZ;
  IOUtils.writeClusterToFile(dictionary, outputFile, IOUtils.TAB_DELIMITER);
  breader.close();
}

开发者ID:ragerri，项目名称:ixa-pipe-convert，代码行数:22，代码来源:SerializeResources.java

示例4: serializeLemmaDictionary

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
public static void serializeLemmaDictionary(Path lemmaDict)
    throws IOException {
  Map<List<String>, String> dictMap = new HashMap<List<String>, String>();
  InputStream inputStream = CmdLineUtil.openInFile(lemmaDict.toFile());
  BufferedReader breader = new BufferedReader(
      new InputStreamReader(inputStream, Charset.forName("UTF-8")));
  String line;
  while ((line = breader.readLine()) != null) {
    final String[] elems = tabPattern.split(line);
    if (elems.length == 3) {
      String normalizedToken = dotInsideI.matcher(elems[0]).replaceAll("I");
      dictMap.put(Arrays.asList(normalizedToken, elems[2]), elems[1]);
    } else {
      System.err.println(elems[0] + " is not well formed!");
    }
  }
  String outputFile = lemmaDict.toString() + SER_GZ;
  IOUtils.writeDictionaryLemmatizerToFile(dictMap, outputFile,
      IOUtils.TAB_DELIMITER);
  breader.close();
}

开发者ID:ragerri，项目名称:ixa-pipe-convert，代码行数:22，代码来源:SerializeResources.java

示例5: train

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
/**
 * Main entry point for training.
 * 
 * @throws IOException
 *           throws an exception if errors in the various file inputs.
 */
public final void train() throws IOException {
  // load training parameters file
  final String paramFile = this.parsedArguments.getString("params");
  final TrainingParameters params = InputOutputUtils
      .loadTrainingParameters(paramFile);
  String outModel = null;
  if (params.getSettings().get("OutputModel") == null
      || params.getSettings().get("OutputModel").length() == 0) {
    outModel = Files.getNameWithoutExtension(paramFile) + ".bin";
    params.put("OutputModel", outModel);
  } else {
    outModel = Flags.getModel(params);
  }
  final Trainer chunkerTrainer = new DefaultTrainer(params);
  final ChunkerModel trainedModel = chunkerTrainer.train(params);
  CmdLineUtil.writeModel("ixa-pipe-chunk", new File(outModel), trainedModel);
}

开发者ID:ixa-ehu，项目名称:ixa-pipe-chunk，代码行数:24，代码来源:CLI.java

示例6: openSampleData

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
static ObjectStream<POSSample> openSampleData(String sampleDataName, File sampleDataFile, Charset encoding) {
    CmdLineUtil.checkInputFile(sampleDataName + " Data", sampleDataFile);
    FileInputStream sampleDataIn = CmdLineUtil.openInFile(sampleDataFile);
    ObjectStream<String> lineStream = new PlainTextByLineStream(sampleDataIn.getChannel(), encoding);
    return new WordTagSampleStream(lineStream);
}

开发者ID:radsimu，项目名称:UaicNlpToolkit，代码行数:7，代码来源:POStrainer.java

示例7: train

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
public void train() throws IOException {
    if (languageCode == null) {
        throw new IllegalStateException("languageCode is not provided");
    }
    if (modelOutFile == null) {
        throw new IllegalStateException("model output path is not provided");
    }
    if (trainParams == null) {
        throw new IllegalStateException("training parameters are not set");
    }
    if (sentenceStream == null) {
        throw new IllegalStateException("sentence stream is not configured");
    }
    if (taggerFactory == null) {
        throw new IllegalStateException("tagger factory is not configured");
    }
    Map<String, String> manifestInfoEntries = new HashMap<>();
    BeamSearchContextGenerator<Token> contextGenerator = taggerFactory.getContextGenerator();

    MaxentModel posModel;
    try {
        if (TrainerFactory.TrainerType.EVENT_MODEL_TRAINER.equals(
                TrainerFactory.getTrainerType(trainParams.getSettings()))) {

            ObjectStream<Event> es = new POSTokenEventStream<>(sentenceStream, contextGenerator);
            EventTrainer trainer = TrainerFactory.getEventTrainer(trainParams.getSettings(), manifestInfoEntries);
            posModel = trainer.train(es);
        } else {
            throw new UnsupportedOperationException("Sequence training");
            //POSSampleSequenceStream ss = new POSSampleSequenceStream(samples, contextGenerator);
            // posModel = TrainUtil.train(ss, trainParams.getSettings(), manifestInfoEntries);
        }
    } finally {
        sentenceStream.close();
    }
    POSModel modelAggregate = new POSModel(languageCode,
            posModel, manifestInfoEntries, taggerFactory);
    CmdLineUtil.writeModel("PoS-tagger", modelOutFile, modelAggregate);
}

开发者ID:textocat，项目名称:textokit-core，代码行数:40，代码来源:OpenNLPPosTaggerTrainer.java

示例8: brownCleanUpperCase

import opennlp.tools.cmdline.CmdLineUtil; //导入依赖的package包/类
/**
 * Do not print a sentence if is less than 90% lowercase.
 * 
 * @param sentences
 *          the list of sentences
 * @throws IOException
 */
private static void brownCleanUpperCase(Path inFile) throws IOException {
  StringBuilder precleantext = new StringBuilder();
  InputStream inputStream = CmdLineUtil.openInFile(inFile.toFile());
  BufferedReader breader = new BufferedReader(
      new InputStreamReader(inputStream, Charset.forName("UTF-8")));
  String line;
  while ((line = breader.readLine()) != null) {
    double lowercaseCounter = 0;
    StringBuilder sb = new StringBuilder();
    String[] lineArray = line.split(" ");
    for (String word : lineArray) {
      if (lineArray.length > 0) {
        sb.append(word);
      }
    }
    char[] lineCharArray = sb.toString().toCharArray();
    for (char lineArr : lineCharArray) {
      if (Character.isLowerCase(lineArr)) {
        lowercaseCounter++;
      }
    }
    double percent = lowercaseCounter / (double) lineCharArray.length;
    if (percent >= 0.90) {
      precleantext.append(line).append("\n");
    }
  }
  Path outfile = Files.createFile(Paths.get(inFile.toString() + ".clean"));
  Files.write(outfile,
      precleantext.toString().getBytes(StandardCharsets.UTF_8));
  System.err.println(">> Wrote clean document to " + outfile);
  breader.close();
}

开发者ID:ragerri，项目名称:ixa-pipe-convert，代码行数:40，代码来源:Convert.java

注：本文中的opennlp.tools.cmdline.CmdLineUtil类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java Tinker类代码示例发布时间：2022-05-23

Java StringList类代码示例发布时间：2022-05-23

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：18003|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9584|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8134|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8518|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8422|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9321|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8385|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7818|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8373|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7368|2022-11-06

客服电话

电子邮件

Java CmdLineUtil类代码示例

示例1: run

示例2: main

示例3: serializeEntityGazetteers

示例4: serializeLemmaDictionary

示例5: train

示例6: openSampleData

示例7: train

示例8: brownCleanUpperCase

请发表评论

全部评论

上一篇：

下一篇：

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

CVE-2022-24659

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053