- The first piece is Bright Cluster Manager which is used to easily deploy and manage the clustered infrastructure and provides all cluster software including the operating system, GPU drivers and libraries, InfiniBand drivers and libraries, MPI middleware, the Slurm schedule, so on
- The second piece is the Bright ML which includes any DL library dependencies to the base operating system, DL frameworks including PyTorch, Theano, TensorFlow, Horovod, Keras, DIGITS, CNTK and MXNet, and DL libraries including cuDNN, NCCL, and the CUDA toolkit.
- The third piece is the Data Science Provisioning Portal which was developed by Dell . The portal was created to abstract the complexity of the DL ecosystems by providing a single pane of glass which provides users with an interface to get started with their models. The portal includes spawner for Jupyterhub and integrates with
Resource managers and schedulers (Slurm)
LDAP for user management
DL framework environments (Tensor Flow, Keras, MXNet, PyTorch etc.) from Bright’s module environment, Python2, Python3, and R kernel support
TensorBoard
Terminal CLI environments.