Java MorphoFeatureSpecification类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中edu.stanford.nlp.international.morph.MorphoFeatureSpecification类的典型用法代码示例。如果您正苦于以下问题：Java MorphoFeatureSpecification类的具体用法？Java MorphoFeatureSpecification怎么用？Java MorphoFeatureSpecification使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

MorphoFeatureSpecification类属于edu.stanford.nlp.international.morph包，在下文中一共展示了MorphoFeatureSpecification类的10个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: transformTree

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
@Override
public Tree transformTree(Tree t, Tree root) {

  String baseCat = t.value();
  StringBuilder newCategory = new StringBuilder();

  //Add manual state splits
  for (Pair<TregexPattern,Function<TregexMatcher,String>> e : activeAnnotations) {
    TregexMatcher m = e.first().matcher(root);
    if (m.matchesAt(t))
      newCategory.append(e.second().apply(m));
  }

  //Add morphosyntactic features if this is a POS tag
  if(t.isPreTerminal() && tagSpec != null) {
    if( !(t.firstChild().label() instanceof CoreLabel) || ((CoreLabel) t.firstChild().label()).originalText() == null )
      throw new RuntimeException(String.format("%s: Term lacks morpho analysis: %s",this.getClass().getName(),t.toString()));

    String morphoStr = ((CoreLabel) t.firstChild().label()).originalText();
    Pair<String,String> lemmaMorph = MorphoFeatureSpecification.splitMorphString("", morphoStr);
    MorphoFeatures feats = tagSpec.strToFeatures(lemmaMorph.second());
    baseCat = feats.getTag(baseCat);
  }

  //Update the label(s)
  String newCat = baseCat + newCategory.toString();
  t.setValue(newCat);
  if (t.isPreTerminal() && t.label() instanceof HasTag)
    ((HasTag) t.label()).setTag(newCat);

  return t;
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:33，代码来源:FrenchTreebankParserParams.java

示例2: tokenToDatums

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
/**
 * Convert token to a sequence of datums and add to iobList.
 * 
 * @param iobList
 * @param tokenText
 * @param tokenLabel
 * @param lastToken
 * @param charIndex
 * @param applyRewriteRules
 */
private static void tokenToDatums(List<CoreLabel> iobList, String token, TokenType tokType,
        CoreLabel tokenLabel, String lastToken, int charIndex, boolean applyRewriteRules) {
    String lastLabel = ContinuationSymbol;
    String firstLabel = BeginSymbol;
    if (applyRewriteRules) {
        // Apply Arabic-specific re-write rules
        String rawToken = tokenLabel.word();
        String tag = tokenLabel.tag();
        MorphoFeatureSpecification featureSpec = new ArabicMorphoFeatureSpecification();
        featureSpec.activate(MorphoFeatureType.NGEN);
        featureSpec.activate(MorphoFeatureType.NNUM);
        MorphoFeatures features = featureSpec.strToFeatures(tag);

        // Rule #1 : ت --> ة
        if (features.getValue(MorphoFeatureType.NGEN).equals("F")
                && features.getValue(MorphoFeatureType.NNUM).equals("SG") && rawToken.endsWith("ت-")) {
            lastLabel = RewriteTahSymbol;
        }

        // Rule #2 : لل --> ل ال
        if (lastToken.equals("ل") && rawToken.startsWith("-ل")) {
            firstLabel = RewriteTareefSymbol;
        }
    }
    int index = tokenLabel.get(CoreAnnotations.CharacterOffsetBeginAnnotation.class);
    String origToken = tokenLabel.get(CoreAnnotations.OriginalTextAnnotation.class);
    // Create datums and add to iobList
    String firstChar = String.valueOf(token.charAt(0));
    iobList.add(createDatum(firstChar, firstLabel, charIndex++, firstChar,
        String.valueOf(origToken.charAt(0)), index++,
        index, tokenLabel.get(CoreAnnotations.BeforeAnnotation.class)));
    final int numChars = token.length();
    for (int j = 1; j < numChars; ++j) {
        String thisChar = String.valueOf(token.charAt(j));
        String charLabel = (j == numChars - 1) ? lastLabel : ContinuationSymbol;
        iobList.add(createDatum(thisChar, charLabel, charIndex++, thisChar, 
            String.valueOf(origToken.charAt(j)),index++, index, ""));
    }
}

开发者ID:westei，项目名称:stanbol-stanfordnlp，代码行数:50，代码来源:IOBUtils.java

示例3: morphFeatureSpec

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
/**
 * Returns a morphological feature specification for words in this language.
 */
@Override
public MorphoFeatureSpecification morphFeatureSpec() {
  return null;
}

开发者ID:paulirwin，项目名称:Stanford.NER.Net，代码行数:8，代码来源:AbstractTreebankLanguagePack.java

示例4: FactoredLexicon

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
public FactoredLexicon(MorphoFeatureSpecification morphoSpec, Index<String> wordIndex, Index<String> tagIndex) {
  super(wordIndex, tagIndex);
  this.morphoSpec = morphoSpec;
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:5，代码来源:FactoredLexicon.java

示例5: train

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
/**
 * This method should populate wordIndex, tagIndex, and morphIndex.
 */
@Override 
public void train(Collection<Tree> trees, Collection<Tree> rawTrees) {
  double weight = 1.0;
  // Train uw model on words
  uwModelTrainer.train(trees, weight);
  
  final double numTrees = trees.size();
  Iterator<Tree> rawTreesItr = rawTrees == null ? null : rawTrees.iterator();
  Iterator<Tree> treeItr = trees.iterator();
  
  // Train factored lexicon on lemmas and morph tags
  int treeId = 0;
  while (treeItr.hasNext()) {
    Tree tree = treeItr.next();
    // CoreLabels, with morph analysis in the originalText annotation
    List<Label> yield = rawTrees == null ? tree.yield() : rawTreesItr.next().yield();
    // Annotated, binarized tree for the tags (labels are usually CategoryWordTag)
    List<Label> pretermYield = tree.preTerminalYield();

    int yieldLen = yield.size();
    for (int i = 0; i < yieldLen; ++i) {
      String word = yield.get(i).value();
      int wordId = wordIndex.indexOf(word, true); // Don't do anything with words
      String tag = pretermYield.get(i).value();
      int tagId = tagIndex.indexOf(tag, true);

      // Use the word as backup if there is no lemma
      String featureStr = ((CoreLabel) yield.get(i)).originalText();
      Pair<String,String> lemmaMorph = MorphoFeatureSpecification.splitMorphString(word, featureStr);
      String lemma = lemmaMorph.first();
      int lemmaId = wordIndex.indexOf(lemma, true);
      String richMorphTag = lemmaMorph.second();
      String reducedMorphTag = morphoSpec.strToFeatures(richMorphTag).toString().trim();
      reducedMorphTag = reducedMorphTag.length() == 0 ? NO_MORPH_ANALYSIS : reducedMorphTag;
      int morphId = morphIndex.indexOf(reducedMorphTag, true);
      
      // Seen event counts
      wordTag.incrementCount(wordId, tagId);
      lemmaTag.incrementCount(lemmaId, tagId);
      morphTag.incrementCount(morphId, tagId);
      tagCounter.incrementCount(tagId);
      
      // Unseen event counts
      if (treeId > op.trainOptions.fractionBeforeUnseenCounting*numTrees) {
        if (! wordTag.firstKeySet().contains(wordId) || wordTag.getCounter(wordId).totalCount() < 2) {
          wordTagUnseen.incrementCount(tagId);
        }
        if (! lemmaTag.firstKeySet().contains(lemmaId) || lemmaTag.getCounter(lemmaId).totalCount() < 2) {
          lemmaTagUnseen.incrementCount(tagId);
        }
        if (! morphTag.firstKeySet().contains(morphId) || morphTag.getCounter(morphId).totalCount() < 2) {
          morphTagUnseen.incrementCount(tagId);
        }
      }
    }
    ++treeId;

    if (DEBUG && (treeId % 100) == 0) {
      System.err.printf("[%d]",treeId);
    }
    if (DEBUG && (treeId % 10000) == 0) {
      System.err.println();
    }
  }
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:69，代码来源:FactoredLexicon.java

示例6: morphFeatureSpec

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
@Override
public MorphoFeatureSpecification morphFeatureSpec() {
  return new ArabicMorphoFeatureSpecification();
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:5，代码来源:ArabicTreebankLanguagePack.java

示例7: normalizeWholeTree

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
@Override
public Tree normalizeWholeTree(Tree tree, TreeFactory tf) {
  tree = tree.prune(emptyFilter, tf).spliceOut(aOverAFilter, tf);

  for(Tree t : tree) {
    //Map punctuation tags back like the PTB
    if(t.isPreTerminal()) {
      String posStr = normalizePreterminal(t);
      t.setValue(posStr);
      if(t.label() instanceof HasTag) ((HasTag) t.label()).setTag(posStr);

    } else if(t.isLeaf()) {
      //Strip off morphological analyses and place them in the OriginalTextAnnotation, which is
      //specified by HasContext.
      if(t.value().contains(MorphoFeatureSpecification.MORPHO_MARK)) {
        String[] toks = t.value().split(MorphoFeatureSpecification.MORPHO_MARK);
        if(toks.length != 2)
          System.err.printf("%s: Word contains malformed morph annotation: %s%n",this.getClass().getName(),t.value());

        else if(t.label() instanceof CoreLabel) {
          ((CoreLabel) t.label()).setValue(toks[0].trim().intern());
          ((CoreLabel) t.label()).setWord(toks[0].trim().intern());
          ((CoreLabel) t.label()).setOriginalText(toks[1].trim().intern());
        } else {
          System.err.printf("%s: Cannot store morph analysis in non-CoreLabel: %s%n",this.getClass().getName(),t.label().getClass().getName());
        }
      }
    }
  }

  //Add start symbol so that the root has only one sub-state. Escape any enclosing brackets.
  //If the "tree" consists entirely of enclosing brackets e.g. ((())) then this method
  //will return null. In this case, readers e.g. PennTreeReader will try to read the next tree.
  while(tree != null && (tree.value() == null || tree.value().equals("")) && tree.numChildren() <= 1)
    tree = tree.firstChild();

  //Ensure that the tree has a top-level unary rewrite
  if(tree != null && !tree.value().equals(rootLabel))
    tree = tf.newTreeNode(rootLabel, Collections.singletonList(tree));

  return tree;
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:43，代码来源:FrenchTreeNormalizer.java

示例8: morphFeatureSpec

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
@Override
public MorphoFeatureSpecification morphFeatureSpec() {
  return new FrenchMorphoFeatureSpecification();
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:5，代码来源:FrenchTreebankLanguagePack.java

示例9: morphFeatureSpec

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
/**
 * Returns a morphological feature specification for words in this language.
 */
public MorphoFeatureSpecification morphFeatureSpec() {
  return null;
}

开发者ID:benblamey，项目名称:stanford-nlp，代码行数:7，代码来源:AbstractTreebankLanguagePack.java

示例10: morphFeatureSpec

import edu.stanford.nlp.international.morph.MorphoFeatureSpecification; //导入依赖的package包/类
/**
 * The morphological feature specification for the language.
 *
 * @return A language-specific MorphoFeatureSpecification
 */
public abstract MorphoFeatureSpecification morphFeatureSpec();

开发者ID:paulirwin，项目名称:Stanford.NER.Net，代码行数:7，代码来源:TreebankLanguagePack.java

注：本文中的edu.stanford.nlp.international.morph.MorphoFeatureSpecification类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java PMMLObject类代码示例发布时间：2022-05-22

Java Feed类代码示例发布时间：2022-05-22

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：18285|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9681|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8180|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8551|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8458|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9395|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8431|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7865|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8416|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7394|2022-11-06

客服电话

电子邮件

Java MorphoFeatureSpecification类代码示例

示例1: transformTree

示例2: tokenToDatums

示例3: morphFeatureSpec

示例4: FactoredLexicon

示例5: train

示例6: morphFeatureSpec

示例7: normalizeWholeTree

示例8: morphFeatureSpec

示例9: morphFeatureSpec

示例10: morphFeatureSpec

请发表评论

全部评论

上一篇：

下一篇：

小程序接入指南

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053