-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Li Shumin
committed
Oct 20, 2024
1 parent
327fce8
commit 1db702b
Showing
4 changed files
with
29 additions
and
1 deletion.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
# PM3-Bench | ||
|
||
## Introduction | ||
The [ClinGen Evidence Repository](https://erepo.clinicalgenome.org/evrepo/) provides expert-curated assertions, they are written in plain English, posing a difficult challenge for automated evaluation of benchmarks. To address this, we created PM3-Bench, a comprehensive dataset for PM3 literature evidence extraction, based on the ClinGen Evidence Repository | ||
|
||
![](../images/PM3-bench.png) | ||
|
||
--- | ||
|
||
## Description | ||
We provide the `PM3-Bench.json` in this repo, which includes the following fields: | ||
| Column Name | Description | | ||
|---| --- | | ||
| ClinGen ID | The original ID in the ClinGen Evidence Repository | | ||
| Variant Name | The HGVS name of the variant (DNA change) | | ||
| Condition | The condition reported in ClinGen | | ||
| Criterion | The met ACMG criteria | | ||
| Raw Comment |The expert-submitted comment | | ||
| PMID | The PubMed ID of the literature evidence | | ||
| Number of Patients | Extracted number of patients based on the comments | | ||
| In trans Variants | List of in trans variants extracted from the comments, augmented in all possible formats, separated by space. "NA" means no in trans variant was mentioned in the raw comments | | ||
| labels | `eval`: variant-publication pairs for evaluations; `others`: truncated publication XML file in NCBI API, removed in evaluation; `fine-tune`: remaining samples, where non-empty comments used for fine-tuning| | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.