Professional Documents
Culture Documents
Content
AnglaBharti System
Problem Statement Points consider for Testing Features of AnglaBharti Testing Interface(ATI) User Interface Conclusion Future Work
AnglaBharti System
It is machine aided translation System designed for
translating English to Indian languages. It is based on the rule based approach. It uses pattern directed approach using context free grammar like structures to parse English Sentences. It has about 1750 rules and 54000 lex words put in 46 to 58 paradigm types. A number of semantic tags are used to resolve sense ambiguity in the source language.
Problem Statement
Manual testing for RBMT is much lengthy and
complex process. Its also need low level knowledge of system. It takes much time. It involves much human error.
entries. Correctness of respective POS information of source and object word in lexical resources. Correctness of paradigm number assigned.
of word and group of words. Proper generation of PLIL matching to structure of the input sentence. Appropriate selection of meaning of words, out of multiple meaning available in lexicon resource with respect to position and characteristics of the word in source sentence. Proper handling of phrasals, multiwords, repetitive words and words of different language origin in input sentence.
Lex Resources
Modification of Lexicon
Compilation of Lexicon Online Lex Display of Lexicon
PLIL
(Pseudo lingua for Indian languages )
TempGen
Noun pl Verb pl Adverb pl Adjective pl Pronoun pl Sadjnoun pl
Input/Output of Module
Preprocessor
Example Base Morph Rule Base Text Generator
Others
Process Log
Parse Tree Sentence log
ATI
Input
Output
Output Format
Log Pane
Main lex
It display the root word of lexicon Contain the grammatical information of root word
GEN.IN
Parse Tree
Conclusion
ATI has been tested for 3000 sentences for AnglaUrdu and AnglaPunjabi systems. Result shows : ATI system is effective, efficient and user friendly for testing. It saves the time of tester and provide scientific platform for testing and removes ambiguity in testing method itself. Overhead to AnglaBharati system is negligible while operation.
Future Work
System can be further generalized to test any RBMT
for any language pair. System can be updated for creation of lingual resources such as dictionary, synonyms, terminology, multiword etc.
Thank You