📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 4, ISSUE 8, AUGUST 2015

Evaluation of Similarity metrics for translation retrieval in the Hindi-English Translation Memory

Shashi Pal Singh, Ajai Kumar, Hemant Darbari, Sukanya Chauhan, Nandita Srivastava, Priya Singh

DOI: 10.17148/IJARCCE.2015.4813

Abstract: This paper discusses the bigram model with 3-operation Edit Distance (Levenshtein Distance) String matching metrics for translation retrieval in Hindi-English Translation memory system. In this method we used the statistical language modeling (N-gram approach) to compute bigrams and then implemented the dynamic programming algorithm Levenshtein Distance to find the minimum number of edit operations required transforming one bigram to another and will act as a measure to provide extent for the matching of current input and the source in the TM. This measure will decide whether the translation retrieved correspondingly will be exact match or fuzzy match. Other string matching approaches are evaluated with Levenshtein Distance proving to be more effective comparatively.



Keywords: String Matching, Bigram modelling, Levenshtein Distance, Hindi-English Translation memory (TM).

How to Cite:

[1] Shashi Pal Singh, Ajai Kumar, Hemant Darbari, Sukanya Chauhan, Nandita Srivastava, Priya Singh, “Evaluation of Similarity metrics for translation retrieval in the Hindi-English Translation Memory,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2015.4813