DATABRICKS lessons

databricks-L1 · DE Pro · Domain 1A — Python & SQL Development

Python and SQL Development on Databricks

Prove you can develop on Databricks, not just describe it. Ship a Databricks Asset Bundle with managed dependencies, a Pandas UDF and an iterator UDF used for the right reasons, and schema-first tests. The gate passes from working artifacts, never from definitions.

Not submitted
RV

Your mentor

Rao Venkataraman

active
I'm Rao — I run the data-engineering bar here. I won't grade prose. Build the bundle, pick the right UDF for the workload, and let the schema test prove it. Ask me anything while you work; I'll point at the gate, not give you the answer.

The deliverable · 8 gates

Databricks Asset Bundle (databricks.yml) with targets
Dependency / environment specification
Pandas (vectorized) UDF with a justified use
Iterator UDF with a justified use
Schema-first test using assertSchemaEqual / assertDataFrameEqual
Naming reconciliation note (legacy ↔ current terms)
Validation + test evidence
Short design explanation

Your work · artifact bundle

0 files
Empty workspace. Add files, upload a folder, or load a demo bundle to start.

Submit the bundle to the kernel. It grades the artifact gate-by-gate — the mentor never gives a verdict.

Kernel evaluation

PASS = ∏ γᵢ over 8 gates · any failed factor ⇒ FAIL
Recognition Filter
Naming Reconciliation
Bundle Structure
Dependency Management
Pandas UDF
Iterator UDF
Testing Discipline
Evidence Completeness

World output

Developer Workbench

Locked until every gate passes. Nothing downstream can attach to an unverified layer.