Hi-GMAE: Hierarchical Graph Masked Autoencoders

1. Overview

This paper presents Hi-GMAE, a novel multi-scale GMAE framework designed to handle the hierarchical structures within graphs. Diverging from the standard graph neural network (GNN)used in GMAE models, Hi-GMAE modifies its encoder and decoder into hierarchical structures. This entails using GNN at the finer scales for detailed local graph analysis and employing a graph transformer at coarser scales to capture global information.

2. Environment Setup

conda create -n himae python=3.11
conda activate himae
conda install pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=11.8 -c pytorch -c nvidia
conda install pyg -c pyg
pip install ogb
pip install pygsp
pip install scipy
pip install tensorboardX
pip install matplotlib
pip install sortedcontainers
pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-2.1.0+cu118.html

3. Running Hi-GMAE

3-1. Running unsupervised graph classification:

conda activate himae
# Running Hi-GMAE tuned hyperparameters for PROTEINS.
sh ./scripts/protein.sh 
# Running Hi-GMAE tuned hyperparameters for COLLAB.
sh ./scripts/collab.sh 
# Running Hi-GMAE tuned hyperparameters for D&D.
sh ./scripts/dd.sh

3-2. Running transfer learning on molecular classification task:

conda activate himae
cd transfer_learning
# Pretraining Hi-GMAE on ZINC15.
python pretraining.py
# Finetuning Hi-GMAE on MoleculeNet datasets.
i.e. finetune on BACE
sh ./scripts/bace.sh

3-3. Running transfer learning on regression task:

conda activate himae
cd transfer_learning
i.e. finetune on QM9
sh ./scripts/qm9.sh

4. Experimental Settings

4-1. Unsupervised Learning

Parameter Settings. We use Adam optimizer with $\beta_1 = 0.9$, $\beta_2 = 0.999$, $\epsilon = 1e-8$. Additionally, we use PReLU as our nonlinear activation function. To minimize the introduction of excessive hyper-parameters, we choose to fix the hidden size as 512, coarsening method as JC, recovery epoch as one-quarter of the maximum epoch, and decay ratio as 1.0. For other hyper-parameter selections, we search the coarsening layer in the set ${2, 3}$, coarse ratio in the set ${0.1, 0.2,..., 0.5}$, and mask ratio in the set ${0.1, 0.2,..., 0.6}$.

Training Details. For the pre-training, we use the same encoder type as GraphMAE in the fine-grained layer and GT in the coarse-grained layer. In terms of decoder selection, we also choose the same type of decoder as GraphMAE. At each level, we only utilize a single-layer decoder. For the evaluation, we use a LIBSVM as the classifier with hyper-parameter chosen from {$10^{-3}, 10^{-2}, ..., 1, 10$}. We use 10-fold cross validation with 5 different seeds, reporting the average accuracy and variance across five random seeds as the evaluation metrics.

Dataset	PROTEINS	D&D	NCI	ENZYMES	Mutagencity	IMDB-B	IMDB-M	COLLAB	RDT-B
Mask Ratio	0.6	0.3	0.25	0.3	0.5	0.3	0.3	0.5	0.6
Encoder	GIN+GT	GIN+GT	GIN+GT	GIN+GT	GIN+GT	GIN+GT	GIN+GT	GIN+GT	GCN+GT
Decoder	GIN	GIN	GIN	GIN	GIN	GIN	GIN	GIN	GCN
Num layers	3	1	3	2	3	1	1	1	2
Learning Rate	0.00015	0.00015	0.0001	0.00015	0.00015	0.00015	0.00015	0.00015	0.006
Batch size	32	32	16	32	32	32	32	32	8
Pooling Layer	2	3	2	2	2	2	3	2	3
Pooling Ratio	0.1	0.5	0.2	0.1	0.4	0.3	0.25	0.4	0.2
Recovery Ratio	0.8	0.2	0.5	0.0	0.0	0.0	0.0	0.0	0.7

4-2. Transfer Learning for classification

Parameter Settings. In transfer learning, the CoFi-R strategy is not applied due to the significant time consumption associated with parameter tuning. For the pre-training, we fix the coarsening layer at 2, mask ratio at 0.25, learning rate at 0.001, batch size at 256, and embedding size at 300. We search the coarsening ratio in the set ${0.25, 0.5, 0.75}$. For the fine-tuning, we fix the coarsening layer and learning rate the same as pre-training, and dropout ratio at 0.5. Besides, we search the coarsening ratio from 0.1 to 0.9, and batch size in ${32, 64}$.

Training Details. In transfer learning, we adopt a five-layer GIN as the encoder in the fine-grained layer and a single-layer GT in the coarse-grained layer. For the decoder selection, we employ a single GIN layer at each level. We pre-train the model for 100 epochs. For evaluation, the downstream datasets are split into 80/10/10% for train/validation/test using scaffold-split. We report ROC-AUC scores using ten different random seeds.

Dataset	BBBP	Tox21	ToxCast	SIDER	ClinTox	MUV	HIV	BACE
Batch size	32	32	32	32	32	32	32	32
Pooling Rate	0.8	0.8	0.8	0.4	0.1	0.6	0.5	0.9

4-3. Transfer Learning for regression

Parameter Settings. In the pre-training phase, we use the same settings as in the classification task. For fine-tuning, we keep the coarsening layer and learning rate consistent with those used in pre-training and set the dropout rate to 0.5. Additionally, we search for the optimal coarsening ratio within the range of 0.1 to 0.9, and we set the batch size following the approach outlined in SimSGT.

Training Details. For the CEP and Malaria datasets, we pre-train the model using the GEOM dataset, while for the other datasets, we use ZINC15 for pre-training. The model is trained for 100 epochs in the pre-training phase. For evaluation, we split each downstream dataset into 80/10/10% for training, validation, and testing using scaffold splitting. RMSE and MAE scores are reported based on three different random seeds.

Dataset	CEP	Malaria	QM7	QM8	QM9
Batch size	32	32	32	32	256
Pooling Rate	0.5	0.1	0.3	0.3	0.1

5. Baselines

Infomax:https://github.com/snap-stanford/pretrain-gnns
ContextPred:https://github.com/snap-stanford/pretrain-gnns
AttrMasking:https://github.com/snap-stanford/pretrain-gnns
GCC:https://github.com/THUDM/GCC
GraphCL:https://github.com/Shen-Lab/GraphCL
SimGrace:https://github.com/junxia97/SimGRACE
JOAO:https://github.com/Shen-Lab/GraphCL_Automated
GraphLoG:https://github.com/DeepGraphLearning/GraphLoG
RGCL:https://github.com/lsh0520/rgcl
S2GAE:https://github.com/qiaoyu-tan/S2GAE
GraphMAE:https://github.com/THUDM/GraphMAE
GraphMAE2:https://github.com/thudm/graphmae2
Mole-BERT:https://github.com/junxia97/mole-bert

6. Datasets

Supported datasets:

TUDataset: NCI1, PROTEINS, D&D, IMDB-BINARY, IMDB-MULTI, COLLAB, REDDIT-BINARY
MoleculeNet: BBBP, Tox21, ToxCast, SIDER, ClinTox, MUV, HIV, BACE, Malaria, CEP
Quantum Machine: QM7,QM8,QM9

Unsupervised graph classification datasets mentioned above will be downloaded automatically using PyG's API when running the code.
Dataset for molecular property prediction can be found here. After downloading, unzip it and put it in transfer_learning/datasets

7. Acknowledgments

Hi-GMAE is built using PyG and GraphMAE.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
graph_coarsening		graph_coarsening
graphmae		graphmae
imgs		imgs
scripts		scripts
transfer_learning		transfer_learning
.DS_Store		.DS_Store
README.md		README.md
configs.yml		configs.yml
load_data.py		load_data.py
main_graph.py		main_graph.py
preprocess_data.py		preprocess_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hi-GMAE: Hierarchical Graph Masked Autoencoders

1. Overview

2. Environment Setup

3. Running Hi-GMAE

3-1. Running unsupervised graph classification:

3-2. Running transfer learning on molecular classification task:

3-3. Running transfer learning on regression task:

4. Experimental Settings

4-1. Unsupervised Learning

4-2. Transfer Learning for classification

4-3. Transfer Learning for regression

5. Baselines

6. Datasets

7. Acknowledgments

About

Releases

Packages

Contributors 2

Languages

LiuChuang0059/Hi-GMAE

Folders and files

Latest commit

History

Repository files navigation

Hi-GMAE: Hierarchical Graph Masked Autoencoders

1. Overview

2. Environment Setup

3. Running Hi-GMAE

3-1. Running unsupervised graph classification:

3-2. Running transfer learning on molecular classification task:

3-3. Running transfer learning on regression task:

4. Experimental Settings

4-1. Unsupervised Learning

4-2. Transfer Learning for classification

4-3. Transfer Learning for regression

5. Baselines

6. Datasets

7. Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages