Colab User Guide#

You can use BigDL without any installation by using Google Colab.

1. Open a Colab Notebook#

BigDL includes a collection of notebooks that can be directly opened and run in Colab. You can click ‘Run in Google Colab’ that opens the notebook on Colab directly. Click the “run” triangle on the left of each cell to run the notebook cell. When you run the first cell, you may face a pop-up saying ‘Warning: This notebook was not authored by Google’; you should click on ‘Run Anyway’ to get rid of the warning.

2. Notebook Setup#

The first few cells of the notebook contains the code necessary to set up BigDL and other libraries.

Install Java 8

Run the following command on the Google Colab to install jdk 1.8

# Install jdk8
!apt-get install openjdk-8-jdk-headless -qq > /dev/null
# Set jdk environment path which enables you to run Pyspark in your Colab environment.
import os
os.environ["JAVA_HOME"] = "/usr/lib/jvm/java-8-openjdk-amd64"
!update-alternatives --set java /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java

Install Conda

Run the code bellow to install conda on Colab.

# Install Miniconda
!chmod +x
!./ -b -f -p /usr/local

# Update Conda
!conda install --channel defaults conda python=3.7 --yes
!conda update --channel defaults --all --yes

# Append to the sys.path
import sys
_ = (sys.path


Install BigDL

Install the latest pre-release version.

# Install latest pre-release version of BigDL
# Installing BigDL from pip will automatically install all BigDL modules and their dependencies.
!pip install --pre --upgrade bigdl

Install Python Dependencies

As Colab python environment provides some built-in Python libraries, you should check if the library versions are compatible with your application. You may refer compatibility to specify the python library version that BigDL supports.