Setting Up Databricks Community Edition
Module: Getting Started with Databricks Community Edition
In this tutorial, you'll learn how to register for the free Databricks Community Edition, set up your first cluster, create a workspace, and write your first notebook using Spark and Python.
1. What is Databricks?
Databricks is a cloud-based platform built on Apache Spark that supports big data analytics and machine learning. It allows you to write code using Python, Scala, SQL, and R in an interactive environment.
2. Registering for Databricks Community Edition
Step 1: Visit the Registration Page
Go to https://community.cloud.databricks.com. Click on "Sign Up".
Step 2: Fill in Your Details
- Full Name
- Email Address (can be personal or professional)
- Company Name (use "Student" or "Self")
- Job Title and Country
Step 3: Verify Email and Set Password
You will receive a verification link in your email. Click it and set a password for your Databricks account.
Step 4: Log In to the Databricks Portal
After logging in, you will land in the Databricks workspace home screen.
3. Setting Up Your First Cluster
Step 1: Click on "Clusters" from the Sidebar
Navigate to the sidebar and click Clusters. Then click the "Create Cluster" button.
Step 2: Configure Cluster Settings
- Cluster Name: MyFirstCluster
- Cluster Mode: Single Node
- Databricks Runtime Version: Use the default or latest
Click Create Cluster. It will take a few minutes to start.
4. Creating a Project Workspace and Notebook
Step 1: Go to the Workspace Tab
Click on Workspace → Your Username → Click the dropdown icon → Select Create → Notebook
Step 2: Fill Notebook Details
- Notebook Name: FirstNotebook
- Default Language: Python
- Attached to: Select your running cluster
Step 3: Write Your First Spark Code
# Sample Python + Spark code
data = [("Alice", 28), ("Bob", 32)]
df = spark.createDataFrame(data, ["Name", "Age"])
df.show()
Click "▶ Run" to execute the cell and see the output.
5. Learning Outcome
By completing this tutorial, you have successfully:
- Created a Databricks Community Edition account
- Launched and configured your first Spark cluster
- Created a notebook and executed code using Python and Spark
- Learned basic navigation in the Databricks platform