Skip to content

LASER-UMASS/TacTok

Folders and files

NameName
Last commit message
Last commit date

Latest commit

05686a1 · Jan 9, 2024

History

26 Commits
Feb 25, 2021
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
May 26, 2019
Apr 20, 2022
Jan 9, 2024
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020
Oct 22, 2020

Repository files navigation

TacTok

The TacTok automated Coq proof script synthesis tool

The TacTok technique and its evaluation are described in TacTok: Semantics-Aware Proof Synthesis by Emily First, Yuriy Brun, and Arjun Guha, in Proceedings of the ACM on Programming Languages (PACMPL) Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA) issue, vol. 4, November 2020, pp. 149:1-149:26, doi: 10.1145/3428299.

We have made available a replication package for the results in the paper. It is a VirtualBox VM: here. This includes the result files, scripts for processing the results, pre-trained models, directions and code for running the pre-trained models, and training examples used in the paper. The VM does not support training models.

The following are the directions for installation and use of TacTok.

1. Installation

TacTok operates within the CoqGym learning environment and so modifies their code. The following are the dependencies and directions to install CoqGym:

Dependencies

Building Coq, SerAPI, CoqHammer, and the CoqGym Coq Projects

  1. Create an OPAM switch for OCaml 4.07.1+flambda: opam switch create 4.07.1+flambda && eval $(opam env)
  2. Upgrade the installed OPAM packages (optional): opam upgrade && eval $(opam env)
  3. Clone the repository
  4. Install Coq, SerAPI and CoqHammer: cd TacTok && source install.sh
  5. Build the Coq projects (can take a while): cd coq_projects && make && cd ..
  6. Create and activate the conda environment: conda env create -f coq_gym.yml && conda activate coq_gym

2. Extracting proofs from Coq projects

For any Coq project that compiles in Coq 8.9.1 that you want to use (and may not be in the CoqGym dataset), the following are the steps to extract the proofs from code:

  1. Copy the project into the coq_projects directory.
  2. For each *.meta file in the project, run python check_proofs.py --file /path/to/*.meta
    This generates a *.json file in ./data/ corresponding to each *.meta file. The proofs field of the JSON object is a list containing the proof names.
  3. For each *.meta file and each proof, run:
    python extract_proof.py --file /path/to/*.meta --proof $PROOF_NAME
    python extract_synthetic_proofs.py --file /path/to/*.meta --proof $PROOF_NAME
  4. Finally, run python postprocess.py

3. Using the CoqGym benchmark dataset

Download the CoqGym dataset

To download the CoqGym dataset, please refer to the CoqGym repo for the latest instructions.

Training Examples (proof steps)

  1. Proofs steps used in the paper are found in processed.tar.gz, which can be downloaded from the replication package link provided above. This should be copied into TacTok/
  2. To extract new proofs, run python extract_proof_steps.py.
  3. To generate new proof steps that have prev_tokens field, run python process_proof_steps.py. This will generate processed/proof_steps.

4. Training TacTok

To train, for example, the Tok model on all the proof steps, run python main.py --no_validation --exp_id tok

Model checkpoints will be saved in TacTok/runs/tok/checkpoints/. See options.py for command line options.

5. Evaluation

Now, you can evaluate a model you trained on the test set. For example, the Tok model that you trained can be run with python evaluate.py ours tok-results --path /path/to/tok_model/*.pth. If you used additional options in training, specify those same options for evaluating.

WeigthedGreedy and WeightedRandom don't require any training. To evaluate these two methods:

  1. Run WeightedGreedy python evaluate.py greedy greedy-results
  2. Run WeightedRandom python evaluate.py weighed random-results These commands run the models for the entire specified test set. You can specify file names, proof names, project/file indices, etc.