Local word grouping: NLP

Overview of the task:

This is a project to find a particular type of words from any given Hindi text labeled with their parts of speech. It is a task of natural language processing (NLP).

It requires a high level of programming knowledge in Java (especially the string/text processing).

Description of the task:

Input: Parts of Speech (PoS) tagged text, one sentence per line.

Desired Output: Verbal words boundary marked with a special predefined label.

Example Input text:

वह\ किताब\ पढ़ेगा\ ।\PU

वह\ किताब\ पढ़ता\ है\ ।\PU

वह\ किताब\ पढ़\VM. रहा\ है\ ।\PU

Example Output text:

वह किताब [पढ़ेगा][url removed, login to view] ।\PU

वह किताब [पढ़ता है[url removed, login to view] ।\PU

वह किताब [पढ़ रहा है][url removed, login to view] ।\PU


वह\ किताब\ [पढ़ेगा\][url removed, login to view] ।\PU

वह\ किताब\ [पढ़ता\ है\][url removed, login to view] ।\PU

वह\ किताब\ [पढ़\VM. रहा\ है\][url removed, login to view] ।\PU

(Note: If you see junk text instead of the Hindi characters in the examples above, please see the attached file.)

Other Details:

There are a total of 192 tags that can be assigned as the boundary marker for the verb group.

More details will follow. A brief of the algorithm is as follows:

1. Start searching for verbal word from the right of the sentence boundary.

2. When a verbal word is found, match it with a template and store it.

3. Continue the search rightward looking for more verbal words and match it with the TAM template. Continue it till the last verbal word found in the sequence.

4. From among the templates matched, choose the longest verb sequence matched with the TAM template and mark the boundary of the verb sequence within square braces ‘[ ]’.

5. Assign the tag of the TAM template matched at the end of the square bracket, prefixed with /VG./.

Skills: Java, JSP

See more: local word grouping, word match algorithm, word find template, text search algorithm, template for algorithm, template algorithm, string searching in c, string searching algorithm, string searching, string search algorithm c, string search algorithm, string processing in c, string processing algorithm, string match algorithm, string match, string algorithm, start java programming, search string examples, searching java, searching a string in c

About the Employer:
( 2 reviews ) New Delhi, India

Project ID: #992469

Awarded to:


Hi, I think I can do this if you can provide some details in the PMB. Cheers!!

$520 USD in 30 days
(3 Reviews)

7 freelancers are bidding on average $546 for this job


Hello, Please view PMB. Ashwin

$750 USD in 15 days
(39 Reviews)

Hi, I can do it. Conatact me to discuss details.

$500 USD in 10 days
(7 Reviews)

Please see PMB

$500 USD in 5 days
(2 Reviews)

Hi, I'm a professional Java developer (SCJP 6) with experience in NLP and text processing. Contact me to discuss it further.

$500 USD in 10 days
(1 Review)

Hello! I am developing in Java, JSP since 2000, so I have a huge experience. I would be thrilled to do this project for You.

$400 USD in 15 days
(0 Reviews)

check my PM

$650 USD in 20 days
(0 Reviews)