Week 1 Day 5: Brainstorming about future direction of the project

Research on available resources

Today, I and my mentor, David De Hilster brainstormed about future direction of our project. In the beginning our short-term plan was to extract the data from Wiktionary in the form of wikitext, parse them, and build a Nepali dictionary using NLP++. When doing background research, I found that the following three existing Nepali dictionaries that can be very helpful resources to start with:

Additionally, we also discussed if we would like to either use English version of Nepali words or Nepali version itself as a input data for building dictionary. To use Nepali version of wikitext we were wondering if there is a Wikipedia page written in Nepali. I looked for it and I found https://ne.wikipedia.org/wiki is available in Nepali version. Also, I found 1000 most common Nepali words: https://1000mostcommonwords.com/1000-most-common-nepali-words/. One of the possible future direction would be to use these available nepali dictionaries, compare and match the words from dictionaries to Wiktionary, and if the words are not listed in Wiktionary then, I will be add those words to Wiktionary using NLP++.

Leave a comment