Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create and evaluate algorithms to match Internships to Interests #765

Open
philipmjohnson opened this issue Aug 21, 2021 · 0 comments
Open

Comments

@philipmjohnson
Copy link
Collaborator

philipmjohnson commented Aug 21, 2021

Please do work for this task in a branch called issue-765.

We need to determine which Interests are associated with a given Internship. This can be complicated to do well.

One way to proceed is to create a standalone JSON file containing a few thousand listings, and another file with the current set of interest descriptions in RadGrad, and then build algorithm(s) to map internship listings to one or more RadGrad Interests.

We'll start with a default, "dumb" algorithm which just does some glorified regex matching, and then allow folks to try to beat it with NLP or other classification algorithms. We'll grade the results by running the algorithms over a test set which used an "oracle" (basically some humans) to establish the ground truth mapping.

This needs to work for both Computer Science and Computer Engineering internships and interests.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant