ezid - Get, Add, Update, and delete ARKS via the EZID api
ezid [-S|--session=<httpie session>] [-b|--base=<base>] \
[-n|--dry-run] [-p|--print] [-h|--help]\
<command> [<args>]
where <command> is one of fq, anvl, args_to_anvl, array_to_anvl, login, get, mint, put, update, delete
Ezid is a script that simplifies the retrieval, creation and update of ezid ARKS. Ezid uses as set of functions to communicate with the ezid services. Ezid uses a few other tools to perform the functions. "COMMANDS" is summary of the commands that are available.
A good review of the format and organization of an ark: can be found in the ID Concepts description.
- -S|--session|--http_session=http_session
-
Set the HTTPIE session variable. This allows you to maintain multiple authentication setups for your EZID, so that you can go back and forth between users. This is equivalent to the httpie session variable, and are shared with that command. Sessions primarily keep the basic authentication parameters saved. By default the ezid session is used.
- -s |--base=base
-
Set the base that you want to use. The base is a combination of the scheme,NAAN, and shoulder components of the identifier. The default base is the EZID ark testing shoulder ark:/99999/fk4. This is a safe place to do your testing. If you are using complete arks in your commands, this is not required, but if you are using shorthands, for example only using the blades of the arks, then the fully qualified arks are created from this base.
- -n|--dry-run
-
Show what you would do do not really do it. Because some commands require multiple access to the server, this command does not always work properly :(.
- --print|http_print
-
Adjust httpie --print= argument. By default only the response body --print=b is shown.
- -h|--help
-
Shows the manpage for the program. The help pages are embedded in the script and require the functions,
pod2usage
andpod2text
to work properly.
There are a number of ezid commands that are used in manipulating the ARKS. There are metadata commands to edit and create the required inputs, and there are commands to retrieve, create, and update records as well.
There are some informational commands that do not access the CDL server.
ezid [--base=base] fq ark
will parse an ark, and either verify it, or expand it to a full ark if it is only a shoulder. See "FQ" for more information.
ezid anvl [--array] [--csv=items]
Will read anvl formatted data from stdin
and output it either as a bash array, or as a csv row. See "ANVL" for more information.
ezid args_to_anvl [key:value] [key2:anvl2]
will read passed key:value pairs from the commandline and output anvl format to stdout
. See "ARGS_TO_ANVL" for more information.
ezid array_to_anvl `declare -p bash_array`
will read the first parameter as serialized bash array, and parse that as anvl. See "ARRAY_TO_ANVL" for more information.
Next there are a set of commands that communicate with the CDL service. Note, that ezid uses httpie for its http communcation. This allows users to combine ezid with other httpie requests if required. Login infomation is stored using the standard httpie
methodology, see "GLOBAL OPTIONS" for httpid options.
ezid [options] get [--array] [--csv=list] ark(s)
retrieves ARKs from the ezid server. Can output in multiple formats. See "GET" for more information.
ezid [--session=http_session] login --auth=user[:password]
Allows users to set and save their login parameters for further updates. See "LOGIN" for more information.
ezid [global_options] mint [--proxy=proxy_server] [key:value] [key2:value2] ...
will mint a new ark in the specified --base
. See "MINT" for more information.
ezid [global_options] update ark [key:value] [key2:value2] ...
updates an existing ark with the passed key:value pairs. See "UPDATE" for more information.
ezid [global_options delete ark(s)
: Deletes ARKS if current status is _status:reserved
. See "DELETE" for more information.
ezid <--base=> fq [ark]
ezid fq
will output the fully qualified ark:
. This will combine the base
specifiation with the passed ark or ark:fragment, and guess at the fully qualified version. Note that the --base
is an c<ezid> parameter, and not an option of this function. Currently, this function does NOT use the parity check that is part of the ark: specification.
# verify an ark:
ezid --session=ucd-legacy login --auth=ucd-legacy
After which the session ucd-legacy
- --array
-
This will output a bash style associative array from a given ARK, where each key of the array is a key of the retrieved ANVL format. these can be
eval
ed for use later in a bash script. - --csv=columns
-
You can specify the columns that you would like to retrieve using this parameter. Somewhat following anvl conventions, the column names are : delimited. This is a convenient way to create a table from a list of ARKs.
- --ark
-
You can specify and ark to be associated with the input avnl data. This is a convenience primarily with the
--csv
parameter, so that you can easily include an ark in CSV file output. Note that if the ANVL input also specifies anark:
parameter, then that takes precedence.
ezid args_to_anvl [anvl parameters]
ezid args_to_anvl
reads passed parameters as set of items to include in an ANVL file, and output that ANVL file to stdout. In keeping with ANVL format, arguments are delinated into key:value pairs using the colon :
. Like normal ANVL, if you want a colon as part of your key, then that needs to be escaped. The actual values themselves will be verified and also escaped if they need to be.
# This is a simple example for two parameters
ezid args_to_anvl erc.who:Quinn erc.what:Eskimo
#This example shows that value escaping is a thing.
./ezid args_to_anvl erc.who:Quinn erc.what:Eskimo where:$'The\nfrozen\north'
ezid array_to_anvl "$(declare -p foo)"
ezid array_to_anvl
reads a bash associated array as the mand outputs ANVL formatted data from stdin
and reformats it as an ANVL file. This is typically a debug function, but could be used in script environment to mint values for example, where the output of this file is piped to the stdin for a ezid mint
command.
# This example converts a simple ANVL file to a bash array
declare -A foo;
foo[erc.who]=Quinn
foo[erc.what]=Eskimo
ezid array_to_anvl "$(declare -p foo)"
ezid options get [--array] [--csv=columns] [--header] ARK(s)
ezid get retrieves existing ARKS from the ezid server, and displays them either as anvl (default), csv or as a bash array for evaluation. The CSV format is most suitable for retrieving multiple arks.
- --csv=columns
-
You can specify the columns that you would like to retrieve using this parameter. Somewhat following anvl conventions, the column names are : delimited. This is a convenient way to create a table from a list of ARKs.
- --header
-
When specifing a --csv command, this will include a header on the first row of the output.
- --array
-
This will output a bash style associative array from a given ARK, where each key of the array is a key of the retrieved ANVL format. these can be
eval
ed for use later in a bash script. For example the commandeval \$(./ezid get --array ark:/87287/d7q30n); echo \${anvl[_target]}
ezid login --auth=USER[:PASS]
ezid login is a simple wrapper around the httpie --auth=USER:PASS command. This allows users to setup their basic authorization, which is then stored in the standard httpie session paratmeters. It is possible to maintain multiple users, via the ezid --session parameter to maintain different users, eg.
ezid --session=ucd-legacy --auth=ucd-legacy
After which the session ucd-legacy
, will be set as a new httpie session, with the saved authorization.
- -a|--auth=USER[:PASS]
-
You specify the basic authentication for the ezid.cdlib.org server. If you only specify the USER, then you are prompted for the password.
- --proxy=proxy_server
-
Setting this parameter will cause the mint function, to perform an immediate "UPDATE" where the target key value pair will be specified as
_target:{proxy_server}{ark}
, the concatentation of theezid mint --proxy=https://ark.foo.edu/ -verify erc.who:Quinn erc.what:Eskimo ./ezid mint --proxy=https://ark.foo.edu/ --verify erc.who:Quinn erc.what:Eskimo
responds with:
success: ark:/99999/fk4qf9zb42 _updated: 1555441448 _target: https://ark.foo.edu/ark:/99999/fk4qf9zb42 erc.who: Quinn _profile: erc _export: yes _owner: ucd-legacy _ownergroup: ucd-library _created: 1555441448 _status: reserved erc.what: Eskimo
- --verify
-
After creating the ark, will "GET" the ark to verify it was created. This will also output the record ANVL format, unless another option
--array
or--csv
is specified. - --csv=columns | =item --array
-
See "GET" for output options.
ezid [global_options] update --ark=ark [--verify] [--proxy=proxy_server] [key:value] [key:value] ...
ezid update
updates and existing ark by overwriting any of the passed key:value
pairs. An existing --ark
is required, otherwise the function works exactly as the "MINT" command. Please see "MINT" on usage.
- --ark=ark
-
Specify the ark to update.
- --csv=columns | --array | --verify
-
See "MINT" for output options.
ezid I<options> delete [ARK(s)]
ezid delete deletes existing ARKS from the ezid server, if their current status is reserved
.
Scripting with CSV files can be hard, if you are trying to parse complicated csv files. So a good method of scripting with csv files is to use another tool. For example, the csvtool is a nice tool for scripting on csv files. csvtool
allows you to call a script on each row of a csv file, passing the values as positional parameters. You can make a small wrapper script to translate these positional parameters to key:value pairs, and then run the tool over a csv file to create or update files.
For example, assume you have a csv file, in.csv
like this:
who,what,more
Quinn,The Eskimo,Bob Dylan
Sloopy,Hang on,The McCoys
You can create a small script file, mint.sh
that looks like this:
# /bin/bash
base=ark:/99999/fk4
proxy=https://digital.ucdavis.edu/
cols=success:_target:erc.who:erc.what
ezid --base=${base} mint --proxy=${proxy} --csv=${cols} erc.who:"$1" erc.what:"$2"
Then you can mint arks, and save the minted arks back to a csv file like this. The example shows how you might use ~csvtool~ to select only a set of columns. In practive, the little wrapper script can be a good place to add constants, or manipulate the columns before sending them off to ezid.
csvtool namedcol who,what in.csv |\
head | tail -n +2 | csvtool call ./mint.sh - | tee out.csv
With the following results:
ark:/99999/fk4qc17z06,https://digital.ucdavis.edu/ark:/99999/fk4qc17z06,Quinn,The Eskimo
ark:/99999/fk4km0j474,https://digital.ucdavis.edu/ark:/99999/fk4km0j474,Sloopy,Hang on
Quinn Hart <[email protected]>