Week 2 Day 3: Training on NLP++

Today, I attended a meeting with David, my mentor, and Lucas, another intern of my mentor who is also working on Wiktionary data using NLP++. Lucas was running an NLP++ analyzer on WikiText obtained from Chinese Wiktionary, which was in the form of header, dictionary, and synonyms. In this meeting, I learned about the following:Continue reading “Week 2 Day 3: Training on NLP++”

Week 2 Day 2: Azure Account Setup and Pre-processing Training Data

Microsoft Azure Account Setup To run big data from Wiktionary, I will need Azure to build and test our NLP++ parser and analyzer on Cloud. Therefore, I signed up for a free account on Azure using the following link: https://azure.microsoft.com/en-us/features/azure-portal/ I also received $200 credit through that account that I will be using later whileContinue reading “Week 2 Day 2: Azure Account Setup and Pre-processing Training Data”

Week 2 Day 1: Resolve HPCC systems access issues and NLP++ Training

HPCC Systems Access Issues I was having issues accessing hpccsystems.com or training documents available there that I needed to complete ECL training. Therefore, I reached out to the Help Desk, hpcc systems team, and the ECL team. I later came to know that I am supposed to wait for approval once I create an account.Continue reading “Week 2 Day 1: Resolve HPCC systems access issues and NLP++ Training”

Week 1 Day 5: Brainstorming about future direction of the project

Research on available resources Today, I and my mentor, David De Hilster brainstormed about future direction of our project. In the beginning our short-term plan was to extract the data from Wiktionary in the form of wikitext, parse them, and build a Nepali dictionary using NLP++. When doing background research, I found that the followingContinue reading “Week 1 Day 5: Brainstorming about future direction of the project”

Week 1: Day 3 & 4: ECL Training Cntd. and Cyber Defense Onboarding Security Training

Cyber Defense Onboarding Training Meanwhile, I started another training assigned to me by the Cyber Defense Awareness Team which is a part of the onboarding Program. The training name is: CDA-RSG Cyber Defense Onboarding Curriculum_Day 1_2022. I am going through its training documents and taking the survey at the end of it. I also completedContinue reading “Week 1: Day 3 & 4: ECL Training Cntd. and Cyber Defense Onboarding Security Training”

Nepali Language Enrichment: Leveraging Wiktionary for NLP

Nepali is an under-resourced language when it comes to its presence in the domain of Natural Language Processing (NLP). Nepali is my native language and I feel that it is my responsibility to take an initiative and work on making Nepali language popular, formal, and eventually make it counted as rich-resourced language on online platforms.Continue reading “Nepali Language Enrichment: Leveraging Wiktionary for NLP”

Week 1 Day 2: Training on ECL

The goal for this week is to complete training on ECL, download all the required software to access NLP++ plugins, and run it on local editor i.e. VisualText. Today, I completed the first three course menu under “Introduction to ECL (Part 1)”. By doing this, I could able to accomplish the following tasks: Downloaded ECLContinue reading “Week 1 Day 2: Training on ECL”

Week 1 Day 1: Account Setup, Planning and Training

Account Setup I started my first day by attending a virtual meeting with my manager and another intern who also joined today. I came to know an overview of another intern’s project and objectives. Based on the guidance of my manager, I called the Help Desk to set up my Gra access which is aContinue reading “Week 1 Day 1: Account Setup, Planning and Training”