Add prompts for POS tagging on Universal Dependencies dataset #754

aakanksha19 · 2022-04-27T20:41:51Z

Six prompts have been created to add part-of-speech tagging on the Universal Dependencies dataset to PromptSource, addressing this GitHub issue: bigscience-workshop/evaluation#24 . These prompts have been created from scratch since we could not find any POS tagging task references. We are using the following output format: the model must produce a sequence of word-tag pairs (e.g., the DET black ADJ sheep NOUN). Right now we are use edit distance as an initial metric, but later on we will look into implementing a more accurate metric that fits this setting.

In addition to these prompts, the universal dependencies dataset has also been added to Huggingface (under aakanksha/udpos) in a prompting-friendly format. To allow this dataset to be visible in promptsource, I have added my user name (aakanksha) to the list of included_users. If there is a better way to do this, please let me know and I can change this.

If there are any issues with the PR, please comment below and I will work on addressing them. Thanks a lot!

…w up in promptsource

awebson · 2022-04-27T21:56:37Z

Thanks for the PR! Your prompts do not indicate no gold target. Please use ||| in your jinja template to separate the input from the target.

Additionally, we will discuss the use of non-natural language prompts in our Eastern Time standup meeting tomorrow (Thursday). I don't think non-expert humans can perform the task of generating every word and its POS tag as your prompts described. Please join our meeting and discuss if you can!

jzf2101 · 2022-06-27T04:16:09Z

@aakanksha19 - there are merge conflicts, could you please resolve?

aakanksha19 added 2 commits April 27, 2022 16:32

Adding username to included_users to allow dataset added to HF to sho…

b827516

…w up in promptsource

Adding prompts for POS tagging on the Universal Dependencies dataset

e33759c

aakanksha19 changed the title ~~Adding prompts for POS tagging on Universal Dependencies dataset~~ Add prompts for POS tagging on Universal Dependencies dataset Apr 27, 2022

aakanksha19 changed the base branch from main to eval-hackathon April 27, 2022 20:45

Changing metric to edit distance

e9755b1

awebson self-assigned this Apr 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add prompts for POS tagging on Universal Dependencies dataset #754

Add prompts for POS tagging on Universal Dependencies dataset #754

aakanksha19 commented Apr 27, 2022 •

edited

Loading

awebson commented Apr 27, 2022

jzf2101 commented Jun 27, 2022

Add prompts for POS tagging on Universal Dependencies dataset #754

Are you sure you want to change the base?

Add prompts for POS tagging on Universal Dependencies dataset #754

Conversation

aakanksha19 commented Apr 27, 2022 • edited Loading

awebson commented Apr 27, 2022

jzf2101 commented Jun 27, 2022

aakanksha19 commented Apr 27, 2022 •

edited

Loading