Skip to content

Commit

Permalink
train disease-cell,gene,process,phenotype synthetic
Browse files Browse the repository at this point in the history
  • Loading branch information
bill-baumgartner committed Aug 29, 2024
1 parent 396b7ed commit 251bb4a
Show file tree
Hide file tree
Showing 15 changed files with 42,041 additions and 12,004 deletions.
1,158 changes: 579 additions & 579 deletions .github/workflows/bert-models-cicd.yaml

Large diffs are not rendered by default.

8 changes: 4 additions & 4 deletions IMAGE_VERSIONS_PREDICT
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ BL_PROCESS_TO_ANATOMY=0.1
BL_PROCESS_TO_COMPONENT_CELL=0.1
BL_PROCESS_TO_CELL=0.1
BL_DISEASE_TO_ANATOMY=0.3
BL_DISEASE_TO_CELL=0.2
BL_DISEASE_TO_GENE=0.1
BL_DISEASE_TO_PHENOTYPE=0.1
BL_DISEASE_TO_PROCESS=0.3
BL_DISEASE_TO_CELL=0.3
BL_DISEASE_TO_GENE=0.2
BL_DISEASE_TO_PHENOTYPE=0.2
BL_DISEASE_TO_PROCESS=0.4
8 changes: 4 additions & 4 deletions MODEL_VERSIONS
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ BL_PROCESS_TO_ANATOMY=0.1
BL_PROCESS_TO_COMPONENT_CELL=0.1
BL_PROCESS_TO_CELL=0.1
BL_DISEASE_TO_ANATOMY=0.3
BL_DISEASE_TO_CELL=0.2
BL_DISEASE_TO_GENE=0.1
BL_DISEASE_TO_PHENOTYPE=0.1
BL_DISEASE_TO_PROCESS=0.3
BL_DISEASE_TO_CELL=0.3
BL_DISEASE_TO_GENE=0.2
BL_DISEASE_TO_PHENOTYPE=0.2
BL_DISEASE_TO_PROCESS=0.4
4 changes: 4 additions & 0 deletions data/bl_disease_to_cell/CHANGELOG.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
v0.3
- trained on synthetic sentences created by GPT (disease-has_basis_in-cell.synthetic.bert.v1.random.tsv)
- predicate is now has_basis_in

v0.2
- v0.1 had terrible performance; re-trained and observed expected performance

Expand Down
6,565 changes: 3,637 additions & 2,928 deletions data/bl_disease_to_cell/data.tsv

Large diffs are not rendered by default.

3,637 changes: 3,637 additions & 0 deletions data/bl_disease_to_cell/disease-has_basis_in-cell.synthetic.bert.v1.random.tsv

Large diffs are not rendered by default.

4 changes: 4 additions & 0 deletions data/bl_disease_to_gene/CHANGELOG.txt
Original file line number Diff line number Diff line change
@@ -1,2 +1,6 @@
v0.2
- trained on synthetic sentences created by GPT (disease-has_basis_in-gene.synthetic.bert.v1.random.tsv)
- predicate is now has_basis_in

v0.1
- trained on synthetic sentences created by GPT (assoc-with-disease.gene.synthetic.bert.v1.tsv)
7,392 changes: 4,608 additions & 2,784 deletions data/bl_disease_to_gene/data.tsv

Large diffs are not rendered by default.

4,608 changes: 4,608 additions & 0 deletions data/bl_disease_to_gene/disease-has_basis_in-gene.synthetic.bert.v1.random.tsv

Large diffs are not rendered by default.

4 changes: 4 additions & 0 deletions data/bl_disease_to_phenotype/CHANGELOG.txt
Original file line number Diff line number Diff line change
@@ -1,2 +1,6 @@
v0.2
- trained on synthetic sentences created by GPT (disease-has_phenotype-phenotype.synthetic.bert.v1.random.tsv)
- predicate is now has_phenotype

v0.1
- trained on synthetic sentences created by GPT (assoc-with-disease.phenotype.synthetic.bert.v1.random.tsv)
10,145 changes: 7,174 additions & 2,971 deletions data/bl_disease_to_phenotype/data.tsv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

4 changes: 4 additions & 0 deletions data/bl_disease_to_process/CHANGELOG.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
v0.4
- trained on synthetic sentences created by GPT (disease-has_basis_in-biological_process.synthetic.bert.v1.random.tsv)
- predicate is now has_basis_in

v0.3
- decent performance - no change in the data. Not sure why.

Expand Down
8,034 changes: 5,300 additions & 2,734 deletions data/bl_disease_to_process/data.tsv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

0 comments on commit 251bb4a

Please sign in to comment.