1. Before we get started
I assume you have:
- knowledge about how to use
ssh to connect to remote servers via
terminal on Mac, or via Putty on Windows - knowledge about using at least one of vim, emacs or nano, to modify
files on command line interface
- an account for UConn’s HPC
2. What’s special about parallelizing on a cluster
When running jobs only on our local machine, in the “computer science”
terminology, we are using cores on a single node. Such parallelization
can be done easily via packages like parallel on Unix systems, and
doparallel on Windows. The parallel package, however, when used on
clusters, is not capable of breaking the node barrier. No matter how
may cores you requested, your code will still only run on cores within
a single node.