Professional Documents
Culture Documents
Parsing
Morphological Parsing (Word Parsing) MorphologicalParsing(WordParsing) WordsanditsgrammaticalMeaning SyntacticParsing SentenceanditsgrammaticalMeaning
TamilMorphology(aboutsuffixes)
Suffixes for Noun SuffixesforNoun Plural,Case,Postposition,clitics Suffixesforverb Tense,PNG,aspectual,models,clitics
TamilMorphophonemicRules(Sandhi)
Addition Addition
+ =
D l i Deletion
+ =
Substitution
+ =
TamilMorphotactics
Noun Root+PL+Case+Postposition+clitics
Computationalfromalism(RE,FSA)
Regularexpression(RE) Regularexpressionisthestandardnotationfor characterizingstrings(combinationofcharacters).Itis aformulainaspeciallanguageforspecifyingsimple a formula in a special language for specifying simple classesofstrings.Formallyitisanalgebraicnotation forcharacterizingstrings.Regularexpressionwas introducedbyKleene(1956).Astringisanysequence introduced by Kleene (1956) A string is any sequence ofcharacterslikeletters,numbers,spaces,tabs,and punctuationspacewhichisalsoacharacterbecauseit hasencodingvalue.Regularexpressionneedsapattern has encoding value Regular expression needs a pattern (searchtype)tosearchstrings. avaNpuththakam patiththaaN ; /puththakam/(book)
5 2 Finite state automaton (FSA) 5.2.Finitestateautomaton(FSA) Finitestateautomatonisamathematical deviceusedtoimplementregularexpressions. device used to implement regular expressions Finitestateautomataarethetheoretical foundationofagooddealofthe foundation of a good deal of the computationalwork.Anyfinitestate automatoncanbedescribedwiththeRegular automaton can be described with the Regular Expression.
Similaritiesbetweenthesuffixesandpartofthesuffixes
Rankingofthesuffixes
1. 1 2.
Similaritiesbetweensuffixesandpartoftheroots.
1 maraththai (treeAcc) 1.maraththai(tree Acc) Stem:marathth Stem:mara 2.vaaththai (duckAcc) Stem:vaathth Stem:vaa Stem : vaa Compareremainingsyllables
MoreExistenceofglides,sandhi,filler g , ,
Theru ai otti ee (Glide) ee (nearthestreet) avaNaippaRRiththaaN(Sandhi)
maraththiNai (Filler) (treeAcc)
v y y
Lackofvocabulary suNaami()
Stems
Input p 1.maNnNnai stem deletion (soilAcc)maNnNnNn lasttwocharacter
2.eNNai (IAcc) eNN(N)lastonecharacter 3.pallai (toothAcc)pall(l) lastonecharacter 4.ceyyaamal (withoutdoing)ceYY(Y) lastone character
Ambiguity
Input:
than
neythaaN ney+th+aaNney+ neythaaN ney + th + aaN ney +
()
()
(Wovecloth He)
(itisghee)
Ambiguity
avarkaLootu iNnaiya maaNaatu ceNRaaN. avarkaLootuiNnaiyamaaNaatuceNRaaN
Contextknowledgewillplayvitalrole.
THANKYOU