-
Notifications
You must be signed in to change notification settings - Fork 308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
author urls in style name/id #1179
base: master
Are you sure you want to change the base?
Conversation
Issue #623 Now generates author pages with urls in form name/id for most people this looks like: people/d/david-chiang/david-chiang/ Matt Post has an ORCID in name_variants.yaml, so his page is: people/m/matt-post/0000-0002-1297-6794/ and then there is: people/y/yang-liu/yang-liu-edinburgh/ people/y/yang-liu/yang-liu-ict/ people/y/yang-liu/yang-liu-icsi/ people/y/yang-liu/yang-liu-umich/ I don't know how to make the old URLs people/m/matt-post/ resolve.
Thanks, I'll take a look soon! |
Some thoughts:
|
I definitely agree about dropping the first letter. I agree that name_variants.yaml should be split up into lots of files; it's really an author database now and not just name variants. |
Sorry that I'm behind on this—I will catch up next week! |
I thought this would be a good testbed for the previews. |
Oh, yes, good call! |
Build successful. You can preview it here: https://aclanthology.org/previews/author-url |
Some TODOs:
|
Some thoughts after perusing the build preview:
|
Strongly agree and since we already know that we want to implement it, we (that is probably you) should already start requesting the inclusion of ORCID into the datasets. |
This simplest thing technically would be to add an ORCID field to their Softconf profiles, and to force them to do this prior to submission (and possibly final copies, which would let us get data from NAACL and ACL). Probably we can't force all authors to do this, but we could force the submitting author to do it. Do you know if there are any downsides to ORCID? For example, maybe it's not available in China? This will likely require coordination between us, Softconf, and the ACL Exec. I'm not sure whether everyone can move fast enough, but I'll get on it. |
Do you know if there are any downsides to ORCID? For example, maybe
it's not available in China?
I don’t know any downsides; it is *the* standard and widely used in
China as well: https://info.orcid.org/orcid-in-china/
So I don’t think it is blocked there :-)
I don’t even know any competing identifier, so using ORCID should be a
no-brainer. It is used by every publisher I know.
This will likely require coordination between us, Softconf, and the
ACL Exec. I'm not sure whether everyone can move fast enough, but I'll
get on it.
:thumbsupemoji:
|
Hi all:
There used to be competing in-house and standards (e.g., Thompson
Reuter's own identifiers), but the publishers agreed unanimously that a
central authority and identifier would be the best way to solve this
problem (fortunately or not, it was before blockchain). We should adopt
and promote its use over other domain-specific identifiers.
- M
On Wed, Apr 7, 2021 at 11:56 PM Arne Köhn ***@***.***> wrote:
> Do you know if there are any downsides to ORCID? For example, maybe
> it's not available in China?
I don’t know any downsides; it is *the* standard and widely used in
China as well: https://info.orcid.org/orcid-in-china/
So I don’t think it is blocked there :-)
I don’t even know any competing identifier, so using ORCID should be a
no-brainer. It is used by every publisher I know.
> This will likely require coordination between us, Softconf, and the
> ACL Exec. I'm not sure whether everyone can move fast enough, but I'll
> get on it.
:thumbsupemoji:
—
You are receiving this because your review was requested.
Reply to this email directly, view it on GitHub
<#1179 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AABU726GEWRPBDKPJ2OVDKTTHR6CJANCNFSM4VR75SEA>
.
--
- M
|
+1000 to using ORCiD :) And just in case: names can change, also (especially) at ORCiD, and it's probably worth thinking about how to handle that. I'll start a separate discussion with ACL exec for this in general in which you're very welcome to participate. |
Okay, Softconf has added this to the Global profile. You can set yours by visiting the global profile page. Maybe a few of you can test this, as I did? Softconf is going to have this dumped in the DB file distributed with proceedings tarballs, so we will have it available for disambiguation purposes. Next steps:
|
Build successful. You can preview it here: https://preview.aclanthology.org/author-url |
Build successful. Some useful links:
This preview will be removed when the branch is merged. |
Build successful. Some useful links:
This preview will be removed when the branch is merged. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Any updates on this?
If I understand this correctly, the latest proposal was to:
- Have
/people/{NAME-SLUG}/{IDENTIFIER}
as the canonical URL for an author. - For ambiguous names, have
/people/{NAME-SLUG}
be a disambiguation page linking to the different authors. - For un-ambiguous names, have
/people/{NAME-SLUG}
be the canonical page. <-- I'm not sure about this one, wouldn't this cause problems if we introduce another author of the same name later?
What about names that we have disambiguated using our current system? A slug like /people/huy-nguyen/huy-nguyen-stanford
seems quite verbose.
@@ -90,7 +91,7 @@ def export_anthology(anthology, outdir, clean=False, dryrun=False): | |||
name = anthology.people.get_canonical_name(id_) | |||
log.debug("export_anthology: processing person '{}'".format(repr(name))) | |||
data = name.as_dict() | |||
data["slug"] = id_ | |||
data["slug"] = slugify(repr(name)) or "NONE" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a PersonName.slug()
function in bin/anthology/people.py, I think all slug generation should be done in that class rather than here in the build script.
Whoops, shift-enter sends a comment ... Ideally, the location of a profile would not:
The ideal for me, therefore, would be Once we have the identifier first, this would not be as bad anymore:
because it would be Ideally, we would have an ORCID for every author. This will not happen (as we cannot get them for all old entries) but it would be good to push for it going forward. |
Jumping in here to add that if you would like ARR to help collect any information related to this effort, let us know (for example, we could insist that authors include their ORCID link in their OR profile, and then we would be able to export it when we export papers from commitment sites, which are not managed by ARR, but link to the same OR profile). |
Issue #623
Now generates author pages with urls in form name/id
for most people this looks like:
people/d/david-chiang/david-chiang/
Matt Post has an ORCID in name_variants.yaml, so his page is:
people/m/matt-post/0000-0002-1297-6794/
and then there is:
people/y/yang-liu/yang-liu-edinburgh/
people/y/yang-liu/yang-liu-ict/
people/y/yang-liu/yang-liu-icsi/
people/y/yang-liu/yang-liu-umich/
I don't know how to make the old URLs people/m/matt-post/ resolve.