Knowledge Base Resources

Contributed by cyberinfrastructure professionals (researchers, research computing facilitators, research software engineers and HPC system administrators), these resources are shared through the ConnectCI community platform. Add resources you find helpful!

Add a Resource

Implementing Markov Processes with Julia

Markov Decision Processes in Julia

The following link provides an easy method of implementing Markov Decision Processes (MDP) in the Julia computing language. MDPs are a class of algorithms designed to handle stochastic situations where the actor has some level of control. For example, used at a low level, MDPs can be used to control an inverted pendulum, but applied in higher level decision making the can also decide when to take evasive action in air traffic management. MDPs can also be extended to the partially observable domain to form the Partially Observable Markov Decision Process (POMDP). This link contains a wealth of information to show one can easily implement basic POMDP and MDP algorithms and apply well known online and offline solvers.

ai machine-learning julia

0 Likes

Type

tool

Level

Oakridge Leadership Computing Facility (OLCF) Training Events and Archive

Upcoming training events and archives of training materials detailing general HPC best practices as well as how to use OLCF resources and services.

training

0 Likes

Type

learning

Level

CMake Tutorials

CMake Tutorials

CMake is an open-source tool used to manage the build process in operating systems. This tutorial takes you through how to use CMake from the very basics with example projects.

training compiling

0 Likes

Type

learning

Level

Thrust resources

Thrust is a CUDA library that optimizes parallelization on the GPU for you. The Thrust tutorial is great for beginners. The documentation is helpful for anyone using Thrust.

parallelization gpu resources

0 Likes

Type

learning

Level

Discover Data Science

Discover Data Science

Discover Data Science is all about making connections between prospective students and educational opportunities in an exciting new, hot, and growing field – data science.

data-analysis workforce-development

0 Likes

Type

website

Level

Women in HPC

https://womeninhpc.org

Through collaboration and networking, WHPC strives to bring together women in HPC and technical computing while encouraging women to engage in outreach activities and improve the visibility of inspirational role models.

community-outreach

0 Likes

Type

website

Level

Docker - Containerized, reproducible workflows

Docker Documentation

Docker allows for containerization of any task - basically a smaller, scalable version of a virtual machine. This is very useful when transferring work across computing environments, as it ensures reproducibility.

documentation cloud-computing deep-learning

0 Likes

Type

tool

Level

iOS CoreML + SwiftUI Image Classification Model

Document Tutorial

This tutorial will teach step-by-step how to create an image classification model using Core ML in XCode and integrate it into an iOS app that will use the user's iPhone camera to scan objects and predict based on the image classification model.

ai machine-learning

0 Likes

Type

documentation

Level

CUDA Toolkit Documentation

CUDA Toolkit Documentation

NVIDIA CUDA Toolkit Documentation: If you are working with GPUs in HPC, the NVIDIA CUDA Toolkit is essential. You can access the CUDA Toolkit documentation, including programming guides and API references, at this provided website

documentation c c++fortran python

0 Likes

Type

documentation

Level

Scipy Lecture Notes

https://lectures.scientific-python.org/

Comprehensive tutorials and lecture notes covering various aspects of scientific computing using Python and Scipy.

visualization data-analysis machine-learning python

0 Likes

Type

learning

Level

ACCESS Getting Started Quick-Guide

Getting Started Quick-Guide

A step-by-step guide to getting your first allocation for Access computing and storage resources.

ACCESS-account ACCESS-credits allocations-proposal

0 Likes

Type

website

Level

Active inference textbook

Active Inference: The Free Energy Principle in Mind, Brain, and Behavior

This textbook is the first comprehensive treatment of active inference, an integrative perspective on brain, cognition, and behavior used across multiple disciplines including computational neurosciences, machine learning, artificial intelligence, and robotics. It was published in 2022 and it's open access at this time. The contents in this textbook should be educational to those who want to understand how the free energy principle is applied to the normative behavior of living organisms and who want to widen their knowledge of sequential decision making under uncertainty.

ai machine-learning neural-networks

0 Likes

Type

learning

Level

EasyBuild Documentation

EasyBuild is a software installation framework that allows administrators to easily build and install software on high-performance computing (HPC) systems. It supports a wide range of software packages, toolchains, and compilers. Supported software are found in the EasyConfigs repository, one of several resositories in EasyBuild project.

easybuild

0 Likes

Type

documentation

Level

Info about retiring of R GIS packages rgdal, rgeos, maptools in 2023

R GIS packages "rgdal", "rgeos", and "maptools" are package set to be archived and no longer supported by end of 2023. Many other R GIS packages are build on top of these packages, including "sp" and "raster". The packages recommended as replacement for "sp" is "sf" and the replacement for "raster" is "terra". Below are links to published articles regarding this transition. Additionally, I am including links to the documentation for the new packages recommended to be used "sf" and "terra".

0 Likes

Type

documentation

Level

Understanding LLM Fine-tuning

The Ultimate Guide to LLM Fine Tuning: Best Practices & Tools

With the recent uprising of LLM's many business are looking at way to adopt these LLMs and fine-tuning these models on specfic data sets to ensure accuracy. These models when fine-tuned can be optimal for fulfilling the specific needs of a company. This site explains explicitly when, how, and why models should be trained. It goes over various strategies for LLM fine -tuning.

big-data training

0 Likes

Type

learning

Level

Research Security Operations Center at IU

Research Security Operations Center

The NSF-funded ResearchSOC helps make scientific computing resilient to cyberattacks and capable of supporting trustworthy, productive research through operational cybersecurity services, training, and information sharing necessary to a community as unique and variable as research and education (R&E). ResearchSOC is a service offering from Indiana University's OmniSOC.

cybersecurity

0 Likes

Type

website

Level

Open-Source Server Virtualization Platform

Proxmox Virtual Environment - Installation

Proxmox Virtual Environment is a hyper-converged infrastructure open-source software. It is a hosted hypervisor that can run operating systems including Linux and Windows on x64 hardware.

software-installation

0 Likes

Type

learning

Level

Working with Python on HPC Clusters

Working with Python on HPC Clusters

This tutorial series and documentation covers topics on using Python on HPC clusters. The specific steps are based on the HOPPER cluster at George Mason University in Fairfax, VA. They should be implementable on most HPC clusters that have the SLURM scheduler installed, the Environment Modules system for managing packages and Open onDemand for a web-based GUI to access the cluster resources.

pytorch batch-jobs job-submission scheduling slurm modules scripting conda python

0 Likes

Type

documentation

Level

Benchmarking with a cross-platform open-source flow solver, PyFR

What is PyFR and how does it solve fluid flow problems? PyFR is an open-source Computational Fluid Dynamics (CFD) solver that is based on Python and employs the high-order Flux Reconstruction technique. It effectively solves fluid flow problems by utilizing streaming architectures, making it suitable for complex fluid dynamics simulations. How does PyFR achieve scalability on clusters with CPUs and GPUs? PyFR achieves scalability by leveraging distributed memory parallelism through the Message Passing Interface (MPI). It implements persistent, non-blocking MPI requests using point-to-point (P2P) communication and organizes kernel calls to enable local computations while exchanging ghost states. This design approach allows PyFR to efficiently operate on clusters with heterogeneous architectures, combining CPUs and GPUs. Why is PyFR valuable for benchmarking clusters? PyFR's exceptional performance has been recognized by its selection as a finalist in the ACM Gordon Bell Prize for High-Performance Computing. It demonstrates strong-scaling capabilities by effectively utilizing low-latency inter-GPU communication and achieving strong-scaling on unstructured grids. PyFR has been successfully benchmarked with up to 18,000 NVIDIA K20X GPUs on Titan, showcasing its efficiency in handling large-scale simulations.

finite-element-analysis benchmarking parallelization github fluid-dynamics openmpi c++cuda mpi

0 Likes

Type

tool

Level

Official Python Documentation

Python 3.11.5 Documentation

The official documentation for Python 3.11.5. Python comes with a lot of features built into the language, so it is worth taking a look as you code.

documentation python

0 Likes

Type

documentation

Level

Introduction to Linux CLI for Researchers

Intro Linux Tutorial for researchers

The goal of this video is to help researchers and students recently given allocations to High Performance Compute resources a basic introduction to Linux commands to help them get started. These are a few of the most fundamental commands for navigating and getting started. If you find this video helpful or would like me to continue this series let me know!

bash ssh research-facilitation training

0 Likes

Type

learning

Level

Machine Learning in Astrophysics

Machine learning is becoming increasingly important in field with large data such as astrophysics. AstroML is a Python module for machine learning and data mining built on numpy, scipy, scikit-learn, matplotlib, and astropy allowing for a range of statistical and machine learning routines to analyze astronomical data in Python. In particular, it has loaders for many open astronomical datasets with examples on how to visualize such complicated and large datasets.

plotting big-data image-processing machine-learning astrophysics

0 Likes

Type

documentation

Level

Bridges-2 Home Page

Bridges 2 Home Page

Landing Page for Bridges-2 information

matlab

0 Likes

Type

website

Level

Why 'N How: Martinos Center for Biomedical Imaging:

Why 'N How: Martinos

The Why & How seminar series is designed to introduce research assistants, graduate students, and postdoctoral and clinical fellows – really, anyone who is interested – to the many tools used in medical imaging. These include software tools and most of the major imaging modalities wielded by investigators (MRI, PET, EEG, MEG, optical, TMS and others). As the name of the series suggests, the talks cover both the reasons researchers might need a particular tool and the nuts and bolts of how to apply it. You can watch videos of the overviews below.

image-processing

0 Likes

Type

learning

Level

AI/ML TechLab - Accelerating AI/ML Workflows on a Composable Cyberinfrastructure

This technology lab contains a set of sessions to help a new user start an AI project on the ACES cluster, a composable accelerator testbed at Texas A&M University. You will learn how to create and activate a virtual environment, manipulate and visualize data with Pandas and Matplotlib, use Scikit-learn for linear regression and classification applications, and use Pytorch to create and train a simple image classification model with deep neural networks (DNN).

ACES documentation TAMU ai visualization deep-learning machine-learning neural-networks login authentication composable-systems gpu nvidia slurm bash modules vim anaconda conda programming python scikit-learn

0 Likes

Type

documentation

Level

Knowledge Base Resources

Topics

Programming Language

Science Domain

Skill Level

Content Type