You are on page 1of 20

Testing of AnglaBharti System

Anil Kumar, Rahul Kumar Gautam

Content
AnglaBharti System
Problem Statement Points consider for Testing Features of AnglaBharti Testing Interface(ATI) User Interface Conclusion Future Work

AnglaBharti System
It is machine aided translation System designed for

translating English to Indian languages. It is based on the rule based approach. It uses pattern directed approach using context free grammar like structures to parse English Sentences. It has about 1750 rules and 54000 lex words put in 46 to 58 paradigm types. A number of semantic tags are used to resolve sense ambiguity in the source language.

Modules of AnglaBharti System


Preprocessor & Postprocessor
Example Base Morph Lexicon Rule Base Text Generator

Problem Statement
Manual testing for RBMT is much lengthy and

complex process. Its also need low level knowledge of system. It takes much time. It involves much human error.

Points consider for testing


RBMT testing are often done with respect to following points:
Correctness in lexical words and their meaning

entries. Correctness of respective POS information of source and object word in lexical resources. Correctness of paradigm number assigned.

Formulation of correct PLIL matching to the characteristics

of word and group of words. Proper generation of PLIL matching to structure of the input sentence. Appropriate selection of meaning of words, out of multiple meaning available in lexicon resource with respect to position and characteristics of the word in source sentence. Proper handling of phrasals, multiwords, repetitive words and words of different language origin in input sentence.

Proper handling of nouns, pronouns.


Proper handling of transliteration for places, names,

person names etc. Appropriate resolution of context of words in source sentence.

Features of AnglaBharti Testing Interface (ATI)


Lex Resources
PLIL Input/Output of Module Others

Lex Resources
Modification of Lexicon
Compilation of Lexicon Online Lex Display of Lexicon

PLIL
(Pseudo lingua for Indian languages )
TempGen
Noun pl Verb pl Adverb pl Adjective pl Pronoun pl Sadjnoun pl

Input/Output of Module
Preprocessor
Example Base Morph Rule Base Text Generator

Others
Process Log
Parse Tree Sentence log

*** I am fine. 4 2020 Elapsed Time

// sentence separator // Input Sentence // Output

// Number of translations // Elapsed time in millisecond

ATI

Input

Output

Output Format

Log Pane

Online Lex Display of Lexicon Compile Lex

TempGen Noun Pl Verb Pl Pronoun pl Adjective pl Adverb pl Sadjnoun pl

Preprocessor Example Base Morph Rule Base Text Generator

Main lex
It display the root word of lexicon Contain the grammatical information of root word

Search root word in lexicon

To update value in lexicon

GEN.IN

Parse Tree

Conclusion
ATI has been tested for 3000 sentences for AnglaUrdu and AnglaPunjabi systems. Result shows : ATI system is effective, efficient and user friendly for testing. It saves the time of tester and provide scientific platform for testing and removes ambiguity in testing method itself. Overhead to AnglaBharati system is negligible while operation.

Future Work
System can be further generalized to test any RBMT

for any language pair. System can be updated for creation of lingual resources such as dictionary, synonyms, terminology, multiword etc.

Thank You

You might also like