Today, I and my mentor discussed possible ways we can proceed with this project. Since I started with small data and parsed the “1000 most common Nepali words” dataset. Now, my goal is to do background research on Wiktionary, get familiar with the format words are added to Wiktionary, lookup common Nepali words whether theContinue reading “Week 2 Day 5: Planning for Next Week”
Category Archives: Week 02
Week 2 Day 4: Running NLP++ Analyzers on Actual Nepali Data
I continued parsing text extracted from “1000 most common Nepali words” webpage. With the guidance of my mentor, I created more NLP++ pass having rules to parse rows and columns and remove whitespaces, used built-in library such as KBFuncs, created other NLP++ passes such as KbInit, KbBuild and KbDisplay to parse and display the outputContinue reading “Week 2 Day 4: Running NLP++ Analyzers on Actual Nepali Data”
Week 2 Day 3: Training on NLP++
Today, I attended a meeting with David, my mentor, and Lucas, another intern of my mentor who is also working on Wiktionary data using NLP++. Lucas was running an NLP++ analyzer on WikiText obtained from Chinese Wiktionary, which was in the form of header, dictionary, and synonyms. In this meeting, I learned about the following:Continue reading “Week 2 Day 3: Training on NLP++”
Week 2 Day 2: Azure Account Setup and Pre-processing Training Data
Microsoft Azure Account Setup To run big data from Wiktionary, I will need Azure to build and test our NLP++ parser and analyzer on Cloud. Therefore, I signed up for a free account on Azure using the following link: https://azure.microsoft.com/en-us/features/azure-portal/ I also received $200 credit through that account that I will be using later whileContinue reading “Week 2 Day 2: Azure Account Setup and Pre-processing Training Data”
Week 2 Day 1: Resolve HPCC systems access issues and NLP++ Training
HPCC Systems Access Issues I was having issues accessing hpccsystems.com or training documents available there that I needed to complete ECL training. Therefore, I reached out to the Help Desk, hpcc systems team, and the ECL team. I later came to know that I am supposed to wait for approval once I create an account.Continue reading “Week 2 Day 1: Resolve HPCC systems access issues and NLP++ Training”
