From 450bd3e6b17c0f777eb7303f030040838641deb1 Mon Sep 17 00:00:00 2001 From: Serguei Mokhov Date: Mon, 3 Jun 2024 15:43:38 -0400 Subject: [PATCH] [manual][HTML] publish interim 7.2-dev-03 --- doc/web/index.html | 102 +++++++++++++++++++++++++++------------------ 1 file changed, 62 insertions(+), 40 deletions(-) diff --git a/doc/web/index.html b/doc/web/index.html index d5bd9c7..5a2f9b8 100644 --- a/doc/web/index.html +++ b/doc/web/index.html @@ -24,7 +24,7 @@

Speed: The GCS ENCS Cluster


Concordia University
Montreal, Quebec, Canada
rt-ex-hpc~AT~encs.concordia.ca

-
Version 7.2-dev-02
+
Version 7.2-dev-03

The group acknowledges the initial manual version VI produced by Dr. Scott Bunnell while with us as well as Dr. Tariq Daradkeh for his instructional support of the users and contribution of examples.
@@ -162,7 +162,9 @@

1.2
  • Gillian Roper, Senior Systems Administrator, HPC, AITS
  • -
  • Carlos Alarcón Meza, Systems Administrator, HPC and Networking, AITS
  • +
  • Carlos Alarcón Meza, Systems Administrator, HPC and Networking, AITS +
  • +
  • Farah Salhany, IT Instructional Specialist, AITS
  • We receive support from the rest of AITS teams, such as NAG, SAG, FIS, and DOG.
    https://www.concordia.ca/ginacody/aits.html

    @@ -268,8 +270,8 @@

    1.5

    1.6 Available Software

    We have a great number of open-source software available and installed on “Speed” – various Python, CUDA versions, C++/Java compilers, OpenGL, OpenFOAM, OpenCV, TensorFlow, -OpenMPI, OpenISS, MARF [24], etc. There are also a number of commercial packages, subject to -licensing contributions, available, such as MATLAB [1323], Abaqus [1], Ansys, Fluent [2], +OpenMPI, OpenISS, MARF [26], etc. There are also a number of commercial packages, subject to +licensing contributions, available, such as MATLAB [1325], Abaqus [1], Ansys, Fluent [2], etc.

    To see the packages available, run ls -al /encs/pkg/ on speed.encs. In particular, there are over 2200 programs available in /encs/bin and /encs/pkg under Scientific Linux 7 (EL7). We are @@ -2103,13 +2105,26 @@

    3.3 tracking. In 34th British Machine Vision Conference (BMVC), Aberdeen, UK, November 2023. https://arxiv.org/abs/2309.05829 and https://github.com/goutamyg/MVT +
  • +
  • +
  • Farshad Rezaei and Marius Paraschivoiu. Computational challenges of simulating vertical axis + wind turbine on the roof-top corner of a building. Progress in Canadian Mechanical + Engineering, 6, 1–6 2023. http://hdl.handle.net/11143/20861 +
  • Belkacem Belabes and Marius Paraschivoiu. CFD modeling of vertical-axis wind turbine wake interaction. Transactions of the Canadian Society for Mechanical Engineering, pages 1–10, 2023. https://doi.org/10.1139/tcsme-2022-0149
  • +
  • Farshad Rezaei and Marius Paraschivoiu. Placing a small-scale vertical axis wind turbine on + roof-top corner of a building. In Proceedings of the CSME International Congress, June 2022. + https://doi.org/10.7939/r3-j7v7-m909 +
  • Belkacem Belabes and Marius Paraschivoiu. CFD study of the aerodynamic performance of a vertical axis wind turbine in the wake of another turbine. In Proceedings of the CSME International Congress, 2022. https://doi.org/10.7939/r3-rker-1746 + + +
  • Belkacem Belabes and Marius Paraschivoiu. Numerical study of the effect of turbulence intensity on VAWT performance. Energy, 233:121139, 2021. https://doi.org/10.1016/j.energy.2021.121139 @@ -2119,33 +2134,33 @@

    3.3 https://doi.org/10.1177/0278364920913945

  • -

    The work “Haotao Lai. An OpenISS framework specialization for deep learning-based +

    The work “Haotao Lai. An OpenISS framework specialization for deep learning-based person re-identification. Master’s thesis, Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada, August 2019. https://spectrum.library.concordia.ca/id/eprint/985788/” using TensorFlow and Keras on OpenISS adjusted to run on Speed based on the repositories:

    - - - -

    and theirs forks by the team. +

    and theirs forks by the team.

  • -

    +

    A History

    -

    +

    A.1 Acknowledgments

    • The first 6 (to 6.5) versions of this manual and early UGE job script samples, Singularity testing and user support were produced/done by Dr. Scott Bunnell during his time at Concordia as a part of the NAG/HPC group. We thank him for his contributions. + + +
    • The HTML version with devcontainer support was contributed by Anh H Nguyen.
    • @@ -2153,18 +2168,15 @@

      A.1 2.15.4.0 other tasks. We have a continued collaboration on HPC/scheduling research.

    - - - -

    +

    A.2 Migration from UGE to SLURM

    -

    For long term users who started off with Grid Engine here are some resources to make a transition +

    For long term users who started off with Grid Engine here are some resources to make a transition and mapping to the job submission process.

    -

    +

    A.3 Phases

    -

    Brief summary of Speed evolution phases. -

    +

    Brief summary of Speed evolution phases. +

    A.3.1 Phase 4
    -

    Phase 4 had 7 SuperMicro servers with 4x A100 80GB GPUs each added, dubbed as “SPEED2”. We +

    Phase 4 had 7 SuperMicro servers with 4x A100 80GB GPUs each added, dubbed as “SPEED2”. We also moved from Grid Engine to SLURM. -

    +

    A.3.2 Phase 3
    -

    Phase 3 had 4 vidpro nodes added from Dr. Amer totalling 6x P6 and 6x V100 GPUs +

    Phase 3 had 4 vidpro nodes added from Dr. Amer totalling 6x P6 and 6x V100 GPUs added. -

    +

    A.3.3 Phase 2
    -

    Phase 2 saw 6x NVIDIA Tesla P6 added and 8x more compute nodes. The P6s replaced 4x of FirePro +

    Phase 2 saw 6x NVIDIA Tesla P6 added and 8x more compute nodes. The P6s replaced 4x of FirePro S7150. -

    +

    A.3.4 Phase 1
    -

    Phase 1 of Speed was of the following configuration: +

    Phase 1 of Speed was of the following configuration:

    • Sixteen, 32-core nodes, each with 512 GB of memory and approximately 1 TB of @@ -2528,13 +2540,13 @@
      It is possible that your job is pending, because the job requested resources that are not available within Speed. To verify why job id 1234 is not running, execute ‘sacct -j 1234’. A summary of the reasons is available via the squeue command. -

      +

      C Sister Facilities

      -

      Below is a list of resources and facilities similar to Speed at various capacities. Depending on your +

      Below is a list of resources and facilities similar to Speed at various capacities. Depending on your research group and needs, they might be available to you. They are not managed by HPC/NAG of AITS, so contact their respective representatives.

      @@ -2553,7 +2565,7 @@

      C
    • -

      There are various Lambda Labs other GPU servers and like computers acquired by individual +

      There are various Lambda Labs other GPU servers and like computers acquired by individual researchers; if you are member of their research group, contact them directly. These resources are not managed by us.

        @@ -2709,11 +2721,21 @@

        C http://aosabook.org/en/bash.html.

        - [23]   Rob Schreiber. MATLAB. Scholarpedia, 2(6):2929, 2007. + [23]   Farshad Rezaei and Marius Paraschivoiu. Placing a small-scale vertical axis wind turbine on + roof-top corner of a building. In Proceedings of the CSME International Congress, June 2022. + https://doi.org/10.7939/r3-j7v7-m909. +

        +

        + [24]   Farshad Rezaei and Marius Paraschivoiu. Computational challenges of simulating vertical axis + wind turbine on the roof-top corner of a building. Progress in Canadian Mechanical Engineering, + 6, 1–6 2023. http://hdl.handle.net/11143/20861. +

        +

        + [25]   Rob Schreiber. MATLAB. Scholarpedia, 2(6):2929, 2007. http://www.scholarpedia.org/article/MATLAB.

        - [24]   The MARF Research and Development Group. The Modular Audio Recognition + [26]   The MARF Research and Development Group. The Modular Audio Recognition Framework and its Applications. [online], 2002–2014. http://marf.sf.net and http://arxiv.org/abs/0905.1235, last viewed May 2015.