Inefficiencies

Inefficient Processor Selection in EC2 Instances

Compute

Cloud Provider

AWS

Service Name

AWS EC2

Inefficiency Type

Suboptimal Instance Family Selection

Many organizations default to Intel-based EC2 instances due to familiarity or assumptions about workload compatibility. However, AWS offers AMD and Graviton-based alternatives that often deliver significantly better price-performance for general-purpose and compute-optimized workloads.

By not testing workloads across available architectures, teams may continue paying a premium for Intel instances even when no specific performance or compatibility benefit exists. Over time, this results in unnecessary compute spend across development, staging, and even production environments.

Learn more

Underutilized Snowflake Warehouse

Compute

Cloud Provider

Snowflake

Service Name

Snowflake Virtual Warehouse

Inefficiency Type

Underutilized Resource

Underutilized Snowflake warehouses occur when a workload is assigned a larger warehouse size than necessary. For example, a workload that could efficiently execute on a Medium (M) warehouse may be running on a Large (L) or Extra Large (XL) warehouse.This leads to unnecessary credit consumption without a proportional benefit to performance. Underutilization is often driven by early provisioning decisions that were not later reassessed, or by a desire for marginal speed improvements that do not justify the increased operational cost.

Learn more

Inefficient Execution of Repeated Queries

Compute

Cloud Provider

Snowflake

Service Name

Snowflake Query Processing

Inefficiency Type

Inefficient Query Pattern

Inefficient execution of repeated queries occurs when common query patterns are frequently executed without optimization. Even if individual executions are successful, repeated inefficiencies compound overall compute consumption and credit costs.

By analyzing Snowflake's parameterized query metrics, organizations can identify top repeated queries and optimize them for better performance, resource usage, and cost-efficiency.

Learn more

Inefficient Pipeline Refresh Scheduling

Other

Cloud Provider

Snowflake

Service Name

Snowflake Tasks and Pipelines

Inefficiency Type

Inefficient Scheduling

Inefficient pipeline refresh scheduling occurs when data refresh operations are executed more frequently, or with more compute resources, than the actual downstream business usage requires.

Without aligning refresh frequency and resource allocation to true data consumption patterns (e.g., report access rates in Tableau or Sigma), organizations can waste substantial Snowflake credits maintaining underutilized or rarely accessed data assets.

Learn more

Missing or Inefficient Use of Materialized Views

Other

Cloud Provider

Snowflake

Service Name

Snowflake Materialized Views

Inefficiency Type

Inefficient Resource Usage

Inefficiency arises when MVs are either underused or misused.

When high-cost, repetitive queries are not backed by MVs, workloads consume unnecessary compute resources.
When MVs exist but are rarely queried, their background refresh and storage costs accumulate without offsetting savings.

Proper evaluation of workload patterns and strategic use of MVs is critical to achieve a net cost benefit.

Learn more

Suboptimal Query Routing

Other

Cloud Provider

Snowflake

Service Name

Snowflake Query Processing

Inefficiency Type

Suboptimal Query Routing and Warehouse Utilization

Organizations may experience unnecessary Snowflake spend due to inefficient query-to-warehouse routing, lack of dynamic warehouse scaling, or failure to consolidate workloads during low-usage periods. Third-party platforms offer solutions to address these inefficiencies:

Sundeck enables highly customizable, SQL-based control over the query lifecycle through user-defined rules (Flows, Hooks, Conditions). Cost optimization techniques include adaptive warehouse routing, instant warehouse suspension, and off-peak consolidation. However, it requires users to maintain optimization logic manually.
Keebo offers a fully automated AI-driven approach, dynamically tuning warehouse size, clustering, and memory configurations without requiring manual query intervention. It prioritizes minimal operational effort with continuous background optimization.

Choosing between these solutions depends heavily on the organization's internal capabilities and desired balance between control and automation.

Learn more

Excessive Auto-Clustering Costs from High-Churn Tables

Other

Cloud Provider

Snowflake

Service Name

Snowflake Automatic Clustering Service

Inefficiency Type

Inefficient Configuration

Excessive Auto-Clustering costs occur when tables experience frequent and large-scale modifications ("high churn"), causing Snowflake to constantly recluster data. This leads to significant and often hidden compute consumption for maintenance tasks, especially when table structures or loading patterns are not optimized. Poor clustering key choices, unordered data loads, or frequent full-table replacements are common drivers of unnecessary Auto-Clustering activity.

Learn more

Retention of Unused Data in Snowflake Table

Storage

Cloud Provider

Snowflake

Service Name

Snowflake Tables

Inefficiency Type

Excessive Data Retention

Retention of stale data occurs when old, no longer needed records are preserved within active Snowflake tables. Without lifecycle policies or regular purging, tables accumulate outdated data.

Because Snowflake’s compute charges are tied to how much data is scanned, retaining large volumes of inactive or irrelevant data can drive up both storage and query execution costs unnecessarily.

Learn more

Inefficient Snowpipe Usage Due to Small File Ingestion

Other

Cloud Provider

Snowflake

Service Name

Snowflake Snowpipe

Inefficiency Type

Inefficient Data Ingestion

Ingesting a large number of small files (e.g., files smaller than 10 MB) using Snowpipe can lead to disproportionately high costs due to the per-file overhead charges. Each file, regardless of its size, incurs the same overhead fee, making the ingestion of numerous small files less cost-effective. Additionally, small files can increase the load on Snowflake's metadata and ingestion infrastructure, potentially impacting performance.

Learn more

Excessive Snapshot Storage from High-Churn Snowflake Tables

Storage

Cloud Provider

Snowflake

Service Name

Snowflake Snapshots

Inefficiency Type

Inefficient Storage Usage

Snowflake automatically maintains previous versions of data when tables are modified or deleted. For tables with high churn—meaning frequent INSERT, UPDATE, DELETE, or MERGE operations—this can cause a significant buildup of historical snapshot data, even if the active data size remains small.

This hidden accumulation leads to elevated storage costs, particularly when Time Travel retention periods are long and data change rates are high. Often, teams are unaware of how much snapshot data is being stored behind the scenes.

Learn more

There are no inefficiency matches the current filters.