-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
We should be able to convert between different ontological code vocabularies. #204
Comments
Most similar existing stage is vocabulary ID creation and assignment / tokenization:
|
Open question: How to download/store/access the ohdsi vocab remapping tables? |
I wonder if the pipeline's first step should consider that there might be And @mmcdermott , could you look at the file structure of what I have added so far to confirm if this complies with the framework? |
Thanks for the nudge @prenc ; I will try to take a look later today! Also, yes, we should assume that vocabulary fitting will occur later in the pipeline so we do not need to worry about that at this stage. |
Target: We should be able to take a MEDS dataset (with parent-code entries in the metadata) and run a script to map codes in one omop ontology space (e.g., ICD9) to another (e.g., ICD10) using standardized vocabulary mapping tables (e.g. OHDSI vocabulary concept relationship tables)
This will entail two steps (yet to be determined on how to localize into actual stages):
codes.parquet
metadata file, use the vocabulary relationships to remap in the parent code space into the target output space. Store the original code string and the updated code string in some pre-set format. (with codes not in the vocabulary conversion step omitted)codes.parquet
with the original and new code columns to perform a one-to-many mapping from the original shards to shards where the codes have been remapped (with codes not in the vocabulary conversion step omitted).The text was updated successfully, but these errors were encountered: