Glossary of Cloud Computing Terms#
bucket In this context, the term “bucket” is used to denote any general cloud-based object storage system. A bucket usually has a “policy” that determines who can access the bucket, and what actions you can perform (list, read, copy, etc) on the bucket and its contents. Buckets have many features that make them distinct from file-based / directory storage system.
binder A free service for deploying reproducible compute environments.
conda A package and environment management system that can be used for any software language, including Python, R, Java, JavaScript, FORTRAN, etc
container
CPU
Dask
Docker Docker is a widely used platform that allows developers to package applications and their dependencies together into portable, isolated packages called containers.
EFS The Amazon Elastic File Storage (EFS) is an AWS service that offers scalable, durable file storage. This is the default storage type associated with everyone’s home directory in GeoLab.
ephemeral
git A widely used version control system used by many open-source projects to manage and share their code base. See Using Git.
GitHub A web-based platform that enables software developers to remotely collaborate on software development using the git command line tool. GitHub is a huge resource for open-source collaboration where users can store, manage, and contribute to not only their own projects, but any public-facing project on the platform. See Using Git
GPU
instance
image
Jupyter Book Jupyter Book is a command-line tool that allows users to create publication-quality PDFs or web-based interactive books from Markdown documents and Jupyter Notebooks.
Jupyter Notebook A Jupyter Notebook is a human-readable document that allows users to create and share reproducible workflows. The notebook may contain some combination of code (typically Python), equations, images, or narrative text that illustrate the data processing steps that were taken.
JupyterHub Developed by the open-source Jupyter community, JupyterHub allows you to deploy an application that provides remote data science environments to different users.
kernel
S3 The Amazon Simple Storage Service (S3) is the AWS version of object storage we most commonly use. Usually a more cost-effective option than EFS.
server