The Efficiency Hub

Go back

Databricks

Behavioral Inefficiency

11

Commitment Misalignment

11

Excessive Data Retention

11

Excessive Retention of Non-Critical Data

11

Idle Resource

11

Idle Resource with Baseline Cost

11

Inactive Resource

11

Inactive Resource Consuming Baseline Costs

11

Inactive Storage Resource

11

Inactive and Detached Volume

11

Inefficient Architecture

11

Inefficient Configuration

11

Inefficient Configuration

11

Inefficient Data Ingestion

11

Inefficient Network Configuration

11

Inefficient Query Pattern

11

Inefficient Query Patterns

11

Inefficient Resource Usage

11

Inefficient Scheduling

11

Inefficient Storage Tiering

11

Inefficient Storage Usage

11

Misconfiguration

11

Misconfiguration Leading to Future Orphaned Resource

11

Misconfigured Architecture

11

Misconfigured Logging

11

Misconfigured Redundancy

11

Misconfigured Reservation

11

Misconfigured Storage Tier

11

Missing Cost Control Configuration

11

Missing Safeguard

11

Modernization

11

Orphaned Resource

11

Orphaned Storage Resource

11

Outdated Resource

11

Outdated Resource Selection

11

Over-Retention of Data

11

Overcommitted Reservation

11

Overprovisioned Networking Resource

11

Overprovisioned Resource

11

Overprovisioned Resource

11

Overprovisioned Resource Allocation

11

Pricing Model Misalignment

11

Recursive Invocation Misconfiguration

11

Redundant Configuration

11

Retained Unused Resource

11

Retention

11

Retry Misconfiguration

11

Suboptimal Configuration

11

Suboptimal Configuration and Usage

11

Suboptimal Data Layout

11

Suboptimal Data Layout or Format

11

Suboptimal Execution Model

11

Suboptimal Instance Family Selection

11

Suboptimal Instance Family Selection

11

Suboptimal Instance Selection

11

Suboptimal Lifecycle Configuration

11

Suboptimal Pricing Model

11

Suboptimal Query Routing and Warehouse Utilization

11

Suboptimal Storage Tier

11

Suboptimal Workload Distribution

11

Underutilization

11

Underutilized Commitment

11

Underutilized Compute Resource

11

Underutilized Resource

11

Unused Resource

11

Unused Resource

11

Visibility Gap

11

Clear filters

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Showing

1234

out of

1234

inefficiencis

Filter

:

Filter

x

Inefficient Use of Photon Engine in Azure Databricks

Compute

Cloud Provider

Azure

Service Name

Databricks

Inefficiency Type

Suboptimal Configuration

Photon is optimized for SQL workloads, delivering significant speedups through vectorized execution and native C++ performance. However, Photon only accelerates workloads that use compatible operations and data patterns. If a workload includes unsupported functions, unoptimized joins, or falls back to interpreted execution, Photon may be silently bypassed — even on a Photon-enabled cluster. In this case, users are billed at a premium DBU rate while receiving no meaningful speed or efficiency gain. This inefficiency typically arises when teams enable Photon globally without validating workload compatibility or updating their pipelines to follow Photon best practices. The result is higher costs with no corresponding benefit — a classic case of configuration drift outpacing optimization discipline.

Learn more

Lack of Functional Cost Attribution in Databricks Workloads

Other

Cloud Provider

Databricks

Service Name

Databricks

Inefficiency Type

Visibility Gap

Databricks cost optimization begins with visibility. Unlike traditional IaaS services, Databricks operates as an orchestration layer spanning compute, storage, and execution — but its billing data often lacks granularity by workload, job, or team. This creates a visibility gap: costs fluctuate without clear root causes, ownership is unclear, and optimization efforts stall due to lack of actionable insight. When costs are not attributed functionally — for example, to orchestration (query/job DBUs), compute (cloud VMs), storage, or data transfer — it becomes difficult to pinpoint what’s driving spend or where improvements can be made. As a result, inefficiencies persist not due to a single misconfiguration, but because the system lacks the structure to surface them.

Learn more

There are no inefficiency matches the current filters.