Move plugin repos into this main repo #325

CarlinLiao · 2024-06-07T14:08:05Z

To close #323. Adds SPT graph plugins and includes them in the docker build and push Makefile recipes. I've verified that make build-docker-images recipe does not error and did a dry run make -n build-and-push-docker-images to verify that it's assembling the recipe as expected, but I haven't run it because I'm afraid of overwriting the images in Docker Hub, especially the plugin images that're in active use.

I looked into it and discovered that the submodule feature was added to git in v1.5.3, which released in 2007.

CarlinLiao · 2024-06-07T14:14:20Z

I had some consternation when it came to naming directories. The plugins' source code is in plugin/ but the Dockerfiles are in build/plugins/. This makes sense to me since plugin/ contains both all the plugins' source code as well as template(s) that are not themselves a plugin, while build/plugins/ contains all the Dockerfiles for the actual plugins that're meant to be built, but I can see how it can be confusing seeing as I've already confused myself while writing the Makefile recipes.

Similarly, I added the graph-generation subdirectory to both plugin/ and build/plugins/ under the assumption that at some point in the feature we want to support other plugin archetypes, even though this made writing the Makefile recipes marginally more complicated. (I even made an unpushed commit flattening the structure before reversing it.) However, I've structured the Docker Hub repos and Makefile recipes in a way such that it's assumed that the plugin names cannot clash with submodules or plugin names from other plugin archetypes, since graph-generation isn't included in the name of the Docker Hub repo but every repo is prepended with /nadeemlab/spt-.

CarlinLiao · 2024-06-07T20:38:23Z

Added a quick hot fix for a mostly unrelated issue that unnecessarily prepended "cuda-" to the version number for plugins that require CUDA to begin with (i.e., graph-transformer). I'm surprised this wasn't caught earlier given that we've been running the graph-transformer workflow to test it before, which worries me I'm doing something wrong this time.

jimmymathews · 2024-06-10T14:47:15Z

plugin/graph_processing/README.md

+Graph processing plugins are to be made available as Docker images, built from a Dockerfile following the template provided in `Dockerfile`.
+
+Each plugin should have the following commands available from anywhere in the Docker image:
+* `spt-plugin-print-graph-request-configuration`, which prints to `stdout` the configuration file intended to be used by this plugin to fetch graphs from an SPT instance to use for model training. An empty configuration file and a shell script to do this is provided in this repo, as well as the command needed to make this available in the template `Dockerfile`.


It looks like the Dockerfile uses an ADD instruction to place graph.config into the Docker image of the built plugin, e.g. the cg-gnn plugin. But this file is only a template (empty target_name). How are the contents of graph.config to be specified at runtime for different datasets using the same Docker image?

If the in-built graph.config is not used, and a separate graph.config is meant to be supplied at runtime, then there doesn't seem to be much use for this command: spt-plugin-print-graph-request-configuration.

I forgot to update it when it says "this repo" to "the template" since this was originally the README for this template only. The graph.config file is not used within the Docker images, it's a (suggested default) setting file for the user to use with SPT when extracting the graph files to run the image(s) on.

Ah, yes that is what I thought.
But then why is spt-plugin-print-graph-request-configuration used?

It's so you can get a graph.config file to use. It prints to stdout and then you copy it into your local directory and run spt graph extract with it.

But the file it emits is never directly useable, right? If the idea is to manually edit the emitted template just before doing some run, it seems simpler to use the source code version of graph.config directly from this repository. The main runtime usage of the Docker container, to do its calculation, doesn't use the spt-plugin-print-graph-request-configuration command, if I understand correctly.

I think a similar remark applies to spt-plugin-print-training-configuration. What is the rationale for incorporating some of these development-specific actions into the docker image? Is it to avoid the need to access git source code in addition to a Docker image pull? If this is the case, we can perhaps make these 2:

spt-plugin-print-graph-request-configuration

spt-plugin-print-training-configuration

into commands provided by the template image only, and not indicated as "required" by the plugins themselves (since they are not required for the main action, training/processing).

I put them in at your request a while back. I believe the rationale was to provide a way to get the recommended default settings without having to muck about with the Docker internals / source code.

The training variables cg-gnn wants are different from the training variables graph-transformer wants, and likewise the graph formats plugins want may be different from plugin to plugin (although in practice they're identical so far), so making it a template command would not work.

I see, yeah this seems like the best that can be done now.

CarlinLiao · 2024-06-11T20:09:49Z

Through testing I discovered that the graph-transformer:0.0.1 Docker image was bugged, so I've uploaded a new one using the new Makefile recipe to fix it.

The errors were disqualifying (it lacked necessary apt packages to run cv2 and was not updated to switch to relative references when the script was moved from app/ to /usr/bin/local broke references), so I'm puzzled how we were able to run and upload graph-transformer results beforehand.

CarlinLiao added 4 commits June 5, 2024 00:49

move plugins into this repo

a2ae189

move dockerfiles and add version files

50f055f

add build and push recipes

d90d6e7

add recipe to populate submodules

92a5de9

CarlinLiao added the refactor Code change preserving functionality label Jun 7, 2024

CarlinLiao requested a review from jimmymathews June 7, 2024 14:08

CarlinLiao self-assigned this Jun 7, 2024

hotfix workflow plugin image naming

3f85f44

CarlinLiao added 2 commits June 9, 2024 18:43

Merge branch 'main' into issue323

0cd0580

fix another typo

c1fb36f

jimmymathews reviewed Jun 10, 2024

View reviewed changes

jimmymathews and others added 7 commits June 10, 2024 11:57

Make some changes to plugin development process documentation

370370b

Syntax

4f72922

Expand names of plugins

8c14c4e

Attempt relative link

5d963d0

clarify template after move

430083c

hotfix graph-transformer build

52779d2

hotfix plugin push recipe

1f4831d

Merge branch 'main' into issue323

6e28e9b

jimmymathews merged commit bdaf0de into main Jun 21, 2024
1 check passed

jimmymathews deleted the issue323 branch September 20, 2024 22:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move plugin repos into this main repo #325

Move plugin repos into this main repo #325

CarlinLiao commented Jun 7, 2024

CarlinLiao commented Jun 7, 2024

CarlinLiao commented Jun 7, 2024

jimmymathews Jun 10, 2024

jimmymathews Jun 10, 2024

CarlinLiao Jun 10, 2024

jimmymathews Jun 10, 2024

CarlinLiao Jun 10, 2024

jimmymathews Jun 10, 2024

CarlinLiao Jun 10, 2024

jimmymathews Jun 21, 2024

CarlinLiao commented Jun 11, 2024

Move plugin repos into this main repo #325

Move plugin repos into this main repo #325

Conversation

CarlinLiao commented Jun 7, 2024

CarlinLiao commented Jun 7, 2024

CarlinLiao commented Jun 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CarlinLiao commented Jun 11, 2024