Executive summary
Size the commitment to evidenced usage, pick the right committed use discount, and price egress before you sign, and the Google Cloud bill follows. Google Cloud sells consumption wrapped in two discount mechanics, committed use discounts and an enterprise spend commitment, and most overspend comes from a commitment set above real usage or a discount type chosen for the wrong reason. This guide works through how Google prices, how committed use discounts differ, how to size the enterprise commitment, where BigQuery and egress hide cost, and the renewal calendar that puts the buyer in front.
Read this as a working sequence rather than a reference. The consumption levers and the contract levers reinforce one another, so a team that measures and forecasts before the commitment window arrives at the table with a lower, defensible baseline and a stronger case for the rate. The order is the point: measure, forecast, structure, then negotiate.
Our advisors negotiate cloud agreements on the buyer side only. Across more than 500 enterprise engagements, the buyers we advise have negotiated over $2.4 billion in software and cloud contracts at an average saving near 38 percent, and our audit defence work averages a 72 percent reduction against the initial claim. The figures below summarize that record and the public mechanics that frame a Google Cloud deal.
1. How Google Cloud prices, and where the discount sits
Google Cloud publishes a price list, and few enterprises pay it on volume. The published rate is the anchor, and the real number depends on which discount mechanics you apply and how accurately you size them. Three layers sit on top of list pricing, and they do not stack the way buyers assume.
The first layer is the sustained use discount, which Google applies automatically to certain Compute Engine machine types the longer they run in a month, with no commitment. The second layer is the committed use discount, where you trade a one-year or three-year commitment for a deeper published reduction. The third layer is the enterprise spend commitment, a negotiated multi-year contract where you commit to a total dollar figure in exchange for terms that sit on top of the published discounts.
Read the proposal the way it was built
The account team is measured on committed consumption and growth, not on your efficiency, so the first proposal tends to anchor a large three-year commitment against an optimistic growth forecast. Settle the structural terms first, the commitment size, the discount mechanics, egress treatment, and support, and treat the headline discount as the last move. A commitment sized to a forecast you cannot meet is the most expensive mistake in a Google Cloud contract, because the shortfall is yours to absorb.
2. Committed use discounts: resource-based against flexible
The committed use discount is the central lever, and Google sells it in two shapes that behave differently. A resource-based CUD commits to a specific quantity of vCPU and memory in a single region for a deeper discount. A spend-based, or flexible, CUD commits to an hourly dollar amount across eligible Compute Engine families and regions for a smaller discount and far more freedom to move workloads.
Google publishes resource-based commitment discounts of up to about 37 percent for a one-year term and up to about 55 percent for a three-year term on general-purpose machine types, with higher bands on memory-optimized families. Google publishes flexible commitment discounts near 28 percent for one year and near 46 percent for three years. The gap between the two is the price of portability, so the choice is a forecast question before it is a discount question.
Commitment term is the second axis, and it deserves the same scrutiny as the discount type. A three-year term carries the deeper published reduction, but it also locks your position for three years against a platform that changes machine families and regional pricing across that span. Where the workload is mature and stable, the three-year term usually pays. Where the architecture is still moving, a one-year term keeps the deeper reduction within reach next year while protecting you from a stranded commitment this year. Stagger commitments across renewal dates rather than landing them all on one cliff, so a single forecast miss does not strand the whole estate at once.
| CUD type | What it commits | Published discount, indicative | Where it wins |
|---|---|---|---|
| Resource-based | vCPU and memory in one region | Up to about 37% (1 yr), up to about 55% (3 yr) | Stable, predictable workloads in a fixed region |
| Flexible (spend-based) | Hourly spend across eligible families | About 28% (1 yr), about 46% (3 yr) | Workloads that shift family, size, or region |
The mistake we see most often is a large resource-based commitment placed on a workload that later moves region or changes machine family, leaving the commitment stranded against capacity the team no longer uses. Where the architecture is still settling, the smaller flexible discount is usually the cheaper position once the risk of a stranded commitment is priced in.
Insider note. Resource-based and flexible CUDs can be combined, with a flexible commitment covering the moving parts and a resource-based commitment on the stable core. Confirm the current published discount bands for your exact machine families and regions in writing, because the percentages vary by family and change over time.
3. Sustained use discounts and the automatic layer
Before any commitment, Google applies sustained use discounts automatically to eligible Compute Engine machine types, increasing the reduction as a resource runs for more of the month. Google publishes sustained use reductions reaching up to about 30 percent on qualifying general-purpose instances at full-month usage. This layer needs no negotiation, but it changes the baseline you commit against.
The point for a buyer is that sustained use discounts already lower the effective rate on steady workloads, so a committed use discount is only worth the incremental saving above what sustained use already delivers. Model the commitment against the rate you actually pay after sustained use, not against undiscounted list, or the apparent saving from a CUD will be overstated.
4. The enterprise spend commitment and how to size it
Above the per-resource discounts sits the enterprise spend commitment, a negotiated contract where you commit to a total consumption figure over a multi-year term in exchange for contract terms and, where agreed, additional discounting. This is the document where the multi-year money is won or lost, because the committed figure becomes your floor whether or not you consume it.
Size the commitment to a baseline you can defend with twelve months of consumption history, then add growth you have evidence for, not growth the account team has forecast for you. A commitment set to an aspirational number transfers the risk of the forecast onto your budget, because any shortfall against the commitment is still payable.
Build the baseline from the trailing twelve months of billed consumption, then separate it into the steady core and the variable top. Commit against the steady core, where your confidence is highest, and keep the variable top on flexible discounts or on-demand. A buyer who commits only to what the history proves, and leaves room to add commitment mid-term as confidence grows, captures most of the discount while carrying little of the forecast risk. The account team will press for a single large figure because it books the growth early, so the discipline is to hold the committed number to the evidence.
Marketplace spend as a drawdown lever
Google publishes that eligible Google Cloud Marketplace purchases draw down an enterprise spend commitment. Routing qualifying third-party software through Marketplace can satisfy part of the committed figure while consolidating billing under one agreement, which gives a buyer a second way to meet the commitment without over-buying raw infrastructure. Eligibility varies by product, so confirm in writing which Marketplace purchases count before you set the commitment size.
Insider note. The commitment shortfall clause is the term that matters most. Negotiate how an under-consumption is treated, whether unused commitment can carry forward, and whether Marketplace and committed use discount spend both count toward the figure. These mechanics decide your real exposure far more than the headline discount does.
Sizing a Google Cloud commitment or comparing one-year against three-year terms? Our advisors model the baseline and the drawdown with you.
Cloud Contract Negotiation5. BigQuery editions and the on-demand trap
BigQuery is where analytics cost concentrates, and it prices compute and storage separately, the same split that governs the rest of the platform. Compute runs either on-demand, billed per tebibyte scanned, or through editions, billed per slot-hour with autoscaling. Storage is billed per gibibyte per month with an active tier and a cheaper long-term tier.
Google publishes on-demand BigQuery compute near $6.25 per tebibyte scanned in US regions, with the first tebibyte each month free. On-demand is simple and unpredictable, because a single unbounded query can scan far more than expected. Editions move you to a slot model, where Google publishes Standard, Enterprise, and Enterprise Plus at rising per-slot-hour rates, and one-year or three-year slot commitments lower the rate further.
| Option | How it bills | Published rate, indicative |
|---|---|---|
| On-demand | Per tebibyte scanned | About $6.25 per TiB, first 1 TiB free monthly |
| Standard edition | Per slot-hour, autoscaling | About $0.04 per slot-hour |
| Enterprise edition | Per slot-hour, plus commitments | About $0.06 per slot-hour on-demand |
| Enterprise Plus edition | Per slot-hour, plus commitments | About $0.10 per slot-hour on-demand |
The decision between on-demand and editions is a usage-pattern question. Spiky, low-volume querying often costs less on-demand. Steady, heavy querying almost always costs less on an edition with slot commitments, because the per-slot rate falls with the one-year or three-year commitment. Measure your scan volume and concurrency for a full month before you choose, because the wrong model can double the analytics bill.
Storage and the long-term tier
Google publishes BigQuery active storage near $0.02 per gibibyte per month and long-term storage near $0.01 per gibibyte per month, with the long-term rate applied automatically to data not modified for 90 days, and the first 10 gibibytes each month free. Partitioning and clustering reduce the data scanned per query, which lowers on-demand cost directly, so table design is a cost control, not only a performance choice.
One reliable saving sits between the two compute models. A baseline of steady querying can run on an edition with a slot commitment, while unpredictable spikes overflow to on-demand or to autoscaling slots above the commitment. This split captures the committed rate on the predictable base without paying a commitment for capacity that sits idle between spikes. Set a query cost control, such as a maximum bytes-billed limit, so a single unbounded scan cannot run up a surprise on the on-demand portion. Measured monthly, the mix of committed slots and on-demand overflow is the lever that keeps the analytics bill flat while query volume grows.
6. Network egress, the cost that hides until deployment
Egress is the line that surprises buyers, because it is invisible in a list-price comparison and large in a data-heavy architecture. Egress is data leaving Google Cloud to the internet or moving across regions, and Google publishes internet egress rates that commonly fall between about 8 and 12 US cents per gigabyte depending on destination and monthly volume, with reductions at high volume tiers.
The architectures that generate the most egress are content delivery, data replication across regions, and analytics that move large result sets out of the platform. None of these show up when you compare compute rates, which is why egress belongs in the model at design time and in the contract at negotiation time. A committed-spend agreement is the place to negotiate egress treatment, because egress is rarely discounted by default.
The practical defence is to make egress visible before it is committed. Tag traffic by service and destination so the monthly egress total breaks down by cause, then attack the largest cause first. Edge caching removes repeat internet egress on content that does not change between requests. Co-locating services that talk to each other in the same region removes the cross-region charge entirely. Processing data in place and exporting only the result removes the analytics export cost. None of these require a contract change, and together they shrink the egress figure you then take into the negotiation, which improves both the rate and the credibility of your forecast.
| Pattern | Why it bills | The control |
|---|---|---|
| Internet egress | Data served to external users | Cache at the edge, negotiate a committed rate |
| Cross-region traffic | Replication and multi-region designs | Co-locate chatty services, model before deploying |
| Analytics export | Large result sets leaving the platform | Process in place, export only what is needed |
Insider note. Egress is one of the few costs that a buyer can sometimes get committed into a multi-year agreement at a negotiated rate. Bring a measured egress forecast to the commitment discussion, because an account team that wants the commitment has room to discount egress that it will not offer on a standard order.
7. Vertex AI and committed throughput
Vertex AI pricing has become a board-level line as model usage scales. Google publishes per-unit list rates for Vertex AI models, charged per thousand characters or per token depending on the model, alongside training and prediction compute. The figures move as models change, so the discipline is to measure token or character volume per workload and price it against the current published rate rather than a remembered one.
For steady, high-volume inference, Google offers provisioned throughput, where you reserve capacity for predictable performance and pricing instead of paying purely per call. The trade mirrors the rest of the platform: a commitment buys a better effective rate and removes variability, but a commitment sized above real usage becomes a floor you pay regardless. Size provisioned throughput to measured demand, and keep on-demand for the spiky remainder.
8. Support tiers and the percentage-of-spend question
Google Cloud sells support in tiers, and the higher tiers are priced as a percentage of monthly spend above a published minimum, which means support cost grows with consumption unless it is negotiated. Google publishes Enhanced Support starting near $500 per month plus a percentage of monthly spend, and Premium Support starting near $12,500 per month or a percentage of monthly spend, whichever is higher.
The percentage-of-spend structure is the part to watch, because a growing estate quietly grows the support bill in step. In a committed-spend negotiation, the support percentage and its minimum are negotiable terms, not fixed list items, so they belong in the same conversation as the discount and the egress rate rather than treated as an afterthought.
Match the support tier to the workloads that actually need it rather than buying the top tier across the whole estate. A production environment with a tight recovery requirement justifies a higher tier and a faster response commitment. A development or test environment rarely does. Where an organisation runs several projects under one billing account, the support percentage applies to the combined spend, so splitting non-critical workloads or negotiating a blended rate can remove cost that the default structure would otherwise grow automatically with consumption.
Insider note. Confirm whether the support percentage applies to gross consumption or to consumption net of committed use discounts. The difference is material on a large estate, and the answer should be written into the agreement rather than left to the billing default.
9. The negotiation calendar and the lever order
Google Cloud negotiations reward a calendar, not a meeting. The levers land when you have a consumption baseline, a forecast you can defend, and time before a quarter end concentrates the account team's flexibility. Alphabet reports on a calendar fiscal year ending December 31, so the December quarter and the year end carry the most weight, with the other quarter ends adding pressure. The illustrative index below shows where buyer-side preparation changes the outcome. It is an illustrative index with the prepared position set to 100, not a market benchmark.
Relative negotiating position by preparation stage, illustrative index (prepared = 100)
Preparation, not pressure, moves a Google Cloud outcome. Illustrative index, not a quote.
Begin about 90 days before a commitment term ends. Establish the consumption baseline, model one-year against three-year scenarios for both committed use discounts and the enterprise spend commitment, and settle the structural terms before the headline discount. Never lead with your own renewal deadline, because the deadline is the account team's strongest lever if you reveal it first.
| Clause | What to verify |
|---|---|
| Commitment size | Set to evidenced baseline plus defensible growth, with the shortfall treatment defined |
| CUD type | Resource-based or flexible chosen against the forecast, bands named per family and region |
| Marketplace drawdown | Which Marketplace purchases count toward the committed spend, confirmed in writing |
| Egress | A negotiated egress rate or cap, with a measured forecast attached |
| BigQuery model | On-demand or edition with slot commitments, matched to measured scan volume |
| Support | Percentage and minimum negotiated, gross or net of CUD spend defined |
Our recommendation: size the commitment to evidenced usage rather than a vendor forecast, choose the committed use discount type from the workload's real volatility, model BigQuery and egress before they appear on a bill, and settle support, egress, and the shortfall clause before you discuss the headline percentage. Treat the published rate as an anchor and the commitment mechanics as the place the multi-year money is won.
Sources: Google Cloud public pricing for Compute Engine committed use discounts, sustained use discounts, BigQuery editions and storage, network egress, Vertex AI, and Cloud support tiers, as available at the time of review. Outcome ranges are Atonement Licensing advisory figures, indicative and deal-specific, not a quote.
Related reading: Cloud Licensing hub, Cloud Contract Negotiation, Azure MACC Negotiation Guide, and AWS EDP Negotiation Playbook.