public class ArabicTreebankParserParams extends AbstractTreebankParserParams
TreebankLangParserParams implementing class for
the Penn Arabic Treebank. The baseline feature set works with either
UTF-8 or Buckwalter input, although the behavior of some unused features depends
on the input encoding.| Modifier and Type | Class and Description |
|---|---|
protected class |
ArabicTreebankParserParams.ArabicSubcategoryStripper |
AbstractTreebankParserParams.AnnotatePunctuationFunction, AbstractTreebankParserParams.RemoveGFSubcategoryStripper, AbstractTreebankParserParams.SubcategoryStripperevalGF, generateOriginalDependencies, inputEncoding, outputEncoding, tlp| Constructor and Description |
|---|
ArabicTreebankParserParams() |
| Modifier and Type | Method and Description |
|---|---|
TreeTransformer |
collinizer()
The collinizer eliminates punctuation
|
TreeTransformer |
collinizerEvalb()
Stand-in collinizer does nothing to the tree.
|
java.util.List<? extends HasWord> |
defaultTestSentence()
Return a default sentence for the language (for testing).
|
DiskTreebank |
diskTreebank()
returns a DiskTreebank appropriate to the treebank source
|
void |
display()
display language-specific settings
|
HeadFinder |
headFinder()
The HeadFinder to use for your treebank.
|
Lexicon |
lex(Options op,
Index<java.lang.String> wordIndex,
Index<java.lang.String> tagIndex)
Returns a lexicon for Arabic.
|
static void |
main(java.lang.String[] args) |
MemoryTreebank |
memoryTreebank()
returns a MemoryTreebank appropriate to the treebank source
|
int |
setOptionFlag(java.lang.String[] args,
int i)
Some options for setOptionFlag:
|
java.lang.String[] |
sisterSplitters()
Returns the splitting strings used for selective splits.
|
TreeTransformer |
subcategoryStripper()
Returns a TreeTransformer that retains categories
according to the following options supported by setOptionFlag:
|
Tree |
transformTree(Tree t,
Tree root)
This method does language-specific tree transformations such
as annotating particular nodes with language-relevant features.
|
TreeReaderFactory |
treeReaderFactory()
Creates an
ArabicTreeReaderFactory with parameters set
via options passed in from the command line. |
HeadFinder |
typedDependencyHeadFinder()
The HeadFinder to use when extracting typed dependencies.
|
defaultCoreNLPFlags, dependencyGrammarExtractor, dependencyObjectify, generateOriginalDependencies, getGrammaticalStructure, getInputEncoding, getOutputEncoding, isEvalGF, MLEDependencyGrammarSmoothingParams, parsevalObjectify, parsevalObjectify, ppAttachmentEval, processHeadWord, pw, pw, readGrammaticalStructureFromFile, setEvalGF, setEvaluateGrammaticalFunctions, setGenerateOriginalDependencies, setInputEncoding, setOutputEncoding, supportsBasicDependencies, testMemoryTreebank, treebank, treebankLanguagePack, treeTokenizerFactory, typedDependencyClasser, typedDependencyObjectify, unorderedTypedDependencyObjectify, unorderedUntypedDependencyObjectify, untypedDependencyObjectifypublic TreeReaderFactory treeReaderFactory()
ArabicTreeReaderFactory with parameters set
via options passed in from the command line.ArabicTreeReaderFactorypublic MemoryTreebank memoryTreebank()
AbstractTreebankParserParamsmemoryTreebank in interface TreebankLangParserParamsmemoryTreebank in class AbstractTreebankParserParamspublic DiskTreebank diskTreebank()
AbstractTreebankParserParamsdiskTreebank in interface TreebankLangParserParamsdiskTreebank in class AbstractTreebankParserParamspublic HeadFinder headFinder()
AbstractTreebankParserParamsheadFinder in interface TreebankLangParserParamsheadFinder in class AbstractTreebankParserParamspublic HeadFinder typedDependencyHeadFinder()
AbstractTreebankParserParamstypedDependencyHeadFinder in interface TreebankLangParserParamstypedDependencyHeadFinder in class AbstractTreebankParserParamspublic Lexicon lex(Options op, Index<java.lang.String> wordIndex, Index<java.lang.String> tagIndex)
lex in interface TreebankLangParserParamslex in class AbstractTreebankParserParamsop - Lexicon optionspublic java.util.List<? extends HasWord> defaultTestSentence()
public TreeTransformer subcategoryStripper()
-retainNPTmp Retain temporal NP marking on NPs.
-retainNPSbj Retain NP subject function tags
-markPRDverbs Retain PRD verbs.
subcategoryStripper in interface TreebankLangParserParamssubcategoryStripper in class AbstractTreebankParserParamspublic TreeTransformer collinizer()
collinizer in interface TreebankLangParserParamscollinizer in class AbstractTreebankParserParamspublic TreeTransformer collinizerEvalb()
collinizerEvalb in interface TreebankLangParserParamscollinizerEvalb in class AbstractTreebankParserParamspublic java.lang.String[] sisterSplitters()
AbstractTreebankParserParamssisterSplitters in interface TreebankLangParserParamssisterSplitters in class AbstractTreebankParserParamspublic Tree transformTree(Tree t, Tree root)
AbstractTreebankParserParamst. It changes both
labels and the tree shape.transformTree in interface TreebankLangParserParamstransformTree in class AbstractTreebankParserParamst - The input tree (with non-language specific annotation already
done, so you need to strip back to basic categories)root - The root of the current tree (can be null for words)public void display()
AbstractTreebankParserParamsdisplay in interface TreebankLangParserParamsdisplay in class AbstractTreebankParserParamspublic int setOptionFlag(java.lang.String[] args,
int i)
-retainNPTmp Retain temporal NP marking on NPs.
-retainNPSbj Retain NP subject function tags
-markGappedVP marked gapped VPs.
-collinizerRetainsPunctuation does what it says.
setOptionFlag in interface TreebankLangParserParamssetOptionFlag in class AbstractTreebankParserParamsargs - flag arguments (usually from commmand linei - index at which to begin argument processingpublic static void main(java.lang.String[] args)
args -