Setting Up Databricks Community Edition

Module: Getting Started with Databricks Community Edition

In this tutorial, you'll learn how to register for the free Databricks Community Edition, set up your first cluster, create a workspace, and write your first notebook using Spark and Python.


1. What is Databricks?

Databricks is a cloud-based platform built on Apache Spark that supports big data analytics and machine learning. It allows you to write code using Python, Scala, SQL, and R in an interactive environment.


2. Registering for Databricks Community Edition

Step 1: Visit the Registration Page

Go to https://community.cloud.databricks.com. Click on "Sign Up".

Step 2: Fill in Your Details
  • Full Name
  • Email Address (can be personal or professional)
  • Company Name (use "Student" or "Self")
  • Job Title and Country
Step 3: Verify Email and Set Password

You will receive a verification link in your email. Click it and set a password for your Databricks account.

Step 4: Log In to the Databricks Portal

After logging in, you will land in the Databricks workspace home screen.


3. Setting Up Your First Cluster

Step 1: Click on "Clusters" from the Sidebar

Navigate to the sidebar and click Clusters. Then click the "Create Cluster" button.

Step 2: Configure Cluster Settings
  • Cluster Name: MyFirstCluster
  • Cluster Mode: Single Node
  • Databricks Runtime Version: Use the default or latest

Click Create Cluster. It will take a few minutes to start.


4. Creating a Project Workspace and Notebook

Step 1: Go to the Workspace Tab

Click on Workspace → Your Username → Click the dropdown icon → Select CreateNotebook

Step 2: Fill Notebook Details
  • Notebook Name: FirstNotebook
  • Default Language: Python
  • Attached to: Select your running cluster
Step 3: Write Your First Spark Code

# Sample Python + Spark code
data = [("Alice", 28), ("Bob", 32)]
df = spark.createDataFrame(data, ["Name", "Age"])
df.show()
    

Click "▶ Run" to execute the cell and see the output.


5. Learning Outcome

By completing this tutorial, you have successfully:

  • Created a Databricks Community Edition account
  • Launched and configured your first Spark cluster
  • Created a notebook and executed code using Python and Spark
  • Learned basic navigation in the Databricks platform
Next Module: Explore how to load and visualize data in a Databricks Notebook.