Other

Cloud Provider

Snowflake

Service Name

Snowflake Tasks and Pipelines

Inefficiency Type

Inefficient Scheduling

Inefficient pipeline refresh scheduling occurs when data refresh operations are executed more frequently, or with more compute resources, than the actual downstream business usage requires.

Without aligning refresh frequency and resource allocation to true data consumption patterns (e.g., report access rates in Tableau or Sigma), organizations can waste substantial Snowflake credits maintaining underutilized or rarely accessed data assets.

Learn more

Missing or Inefficient Use of Materialized Views

Other

Cloud Provider

Snowflake

Service Name

Snowflake Materialized Views

Inefficiency Type

Inefficient Resource Usage

Inefficiency arises when MVs are either underused or misused.

When high-cost, repetitive queries are not backed by MVs, workloads consume unnecessary compute resources.
When MVs exist but are rarely queried, their background refresh and storage costs accumulate without offsetting savings.

Proper evaluation of workload patterns and strategic use of MVs is critical to achieve a net cost benefit.

Learn more

Suboptimal Query Routing

Other

Cloud Provider

Snowflake

Service Name

Snowflake Query Processing

Inefficiency Type

Suboptimal Query Routing and Warehouse Utilization

Organizations may experience unnecessary Snowflake spend due to inefficient query-to-warehouse routing, lack of dynamic warehouse scaling, or failure to consolidate workloads during low-usage periods. Third-party platforms offer solutions to address these inefficiencies:

Sundeck enables highly customizable, SQL-based control over the query lifecycle through user-defined rules (Flows, Hooks, Conditions). Cost optimization techniques include adaptive warehouse routing, instant warehouse suspension, and off-peak consolidation. However, it requires users to maintain optimization logic manually.
Keebo offers a fully automated AI-driven approach, dynamically tuning warehouse size, clustering, and memory configurations without requiring manual query intervention. It prioritizes minimal operational effort with continuous background optimization.

Choosing between these solutions depends heavily on the organization's internal capabilities and desired balance between control and automation.

Learn more

Excessive Auto-Clustering Costs from High-Churn Tables

Other

Cloud Provider

Snowflake

Service Name

Snowflake Automatic Clustering Service

Inefficiency Type

Inefficient Configuration

Excessive Auto-Clustering costs occur when tables experience frequent and large-scale modifications ("high churn"), causing Snowflake to constantly recluster data. This leads to significant and often hidden compute consumption for maintenance tasks, especially when table structures or loading patterns are not optimized. Poor clustering key choices, unordered data loads, or frequent full-table replacements are common drivers of unnecessary Auto-Clustering activity.

Learn more

Inefficient Snowpipe Usage Due to Small File Ingestion

Other

Cloud Provider

Snowflake

Service Name

Snowflake Snowpipe

Inefficiency Type

Inefficient Data Ingestion

Ingesting a large number of small files (e.g., files smaller than 10 MB) using Snowpipe can lead to disproportionately high costs due to the per-file overhead charges. Each file, regardless of its size, incurs the same overhead fee, making the ingestion of numerous small files less cost-effective. Additionally, small files can increase the load on Snowflake's metadata and ingestion infrastructure, potentially impacting performance.

Learn more

Duplicate or Overlapping AWS CloudTrail Trails

Other

Cloud Provider

AWS

Service Name

AWS CloudTrail

Inefficiency Type

Redundant Configuration

AWS CloudTrail enables event logging across AWS services, but when multiple trails are configured to log overlapping events — especially data events — it can result in redundant charges and unnecessary storage or ingestion costs. This commonly occurs in decentralized environments where teams create trails independently, unaware of existing coverage or shared logging destinations.Each trail that records data events contributes to billing on a per-event basis, even if the same activity is logged by multiple trails. Additional costs may also arise from delivering duplicate logs to separate S3 buckets or CloudWatch Log groups. While separate trails may be justified for audit, compliance, or operational segmentation, unintentional duplication increases both cost and operational complexity without added value.

Learn more

Excessive CloudWatch Log Volume from Persistently Enabled Debugging

Other

Cloud Provider

AWS

Service Name

AWS CloudWatch

Inefficiency Type

Inefficient Configuration

Engineers often enable verbose logging (e.g., debug or trace-level) during development or troubleshooting, then forget to disable it after deployment. This results in elevated log ingestion rates — and therefore costs — even when the detailed logs are no longer needed. Because CloudWatch Logs charges per GB ingested, persistent debug logging in production environments can create silent but material cost increases, particularly for high-throughput services.In environments with multiple teams or loosely governed log group policies, this issue can go undetected for long periods. Identifying and deactivating unnecessary debug-level logging is a low-risk, high-leverage optimization.

Learn more

Inefficient Private Link Routing to Azure Databricks

Other

Cloud Provider

Azure

Service Name

Azure Databricks

Inefficiency Type

Misconfiguration

In Azure Databricks environments that rely on Private Link for secure networking, it’s common to route traffic through multi-tiered network architectures. This often includes multiple VNets, Private Link endpoints, or peered subscriptions between data sources (e.g., ADLS) and the Databricks compute plane. While these architectures may be designed for isolation or compliance, they frequently introduce redundant routing paths that add cost without improving performance. Each additional hop may result in duplicated Private Link ingress and egress charges. Without regular review, this can create persistent and unrecognized network inefficiencies tied to Databricks usage.

Learn more

Lack of Functional Cost Attribution in Databricks Workloads

Other

Cloud Provider

Databricks

Service Name

Databricks

Inefficiency Type

Visibility Gap

Databricks cost optimization begins with visibility. Unlike traditional IaaS services, Databricks operates as an orchestration layer spanning compute, storage, and execution — but its billing data often lacks granularity by workload, job, or team. This creates a visibility gap: costs fluctuate without clear root causes, ownership is unclear, and optimization efforts stall due to lack of actionable insight. When costs are not attributed functionally — for example, to orchestration (query/job DBUs), compute (cloud VMs), storage, or data transfer — it becomes difficult to pinpoint what’s driving spend or where improvements can be made. As a result, inefficiencies persist not due to a single misconfiguration, but because the system lacks the structure to surface them.

Learn more

Inactive CloudWatch Log Group

Other

Cloud Provider

AWS

Service Name

AWS CloudWatch

Inefficiency Type

Unused Resource

CloudWatch log groups often persist long after their usefulness has expired. In some cases, they are associated with applications or resources that are no longer active. In other cases, the systems may still be running, but the log data is no longer being reviewed, analyzed, or used by any team. Regardless of the reason, retaining logs that no one is monitoring or using results in unnecessary storage costs. If log data is not needed for operational visibility, debugging, compliance, or auditing purposes, it should either be deleted or managed with a shorter retention policy.

Learn more

There are no inefficiency matches the current filters.