Nepali is an under-resourced language when it comes to its presence in the domain of Natural Language Processing (NLP). Nepali is my native language and I feel that it is my responsibility to take an initiative and work on making Nepali language popular, formal, and eventually make it counted as rich-resourced language on online platforms. The best possible way to make this happen is by adding all the existing Nepali words to the Wiktionary which will work as a foundation for other enthusiastic people to further research in NLP on Nepali Language. Therefore, this summer, I propose to use the Nepali words and phrases available on Wiktionary, parse them, create structured data for dictionaries, and do the analysis using an NLP++ analyzer. To use words and phrases on Wiktionary, I will first have to download those words and upload them to the ECL (Enterprise Control Language) cloud, access them using NLP++ plugins, and run them on local VisualText.
Nepali Language Enrichment: Leveraging Wiktionary for NLP
