Google's Liquid Cooling

296 points | by giuliomagnifico 12 hours ago

120 comments

ykl 8 hours ago
I once saw an interview with the SVP who oversees Azure datacenter buildout or something like that and a thing that stuck with me was that he said his job got a lot easier when he realized he was no longer in the computer business, he was now in the industrial cooling business.
Reading this article immediately made me think back to that.
jonathaneunice 11 hours ago
It’s very odd when mainframes (S/3x0, Cray, yadda yadda) have been extensively water-cooled for over 50 years, and super-dense HPC data centers have used liquid cooling for at least 20, to hear Google-scale data center design compared to PC hobbyist rigs. Selective amnesia + laughably off-target point of comparison.
[-]
- liquidgecka 9 hours ago
  [bri3d pointed out that I missed an element of this. There is a transfer between rack level and machine level coolant which makes this far less novel than I had initially understood. See their direct comment to this]
  I posed this further down in a reply-to-a-reply but I should call it out a little closer to the top: The innovation here is not “we are using water for cooling”. The innovation here is that they are direct cooling the servers with chillers that are outside of the facility. Most mainframes will use water cooling to get the heat from the core out to the edges where traditional where it can be picked up by the typical heatsink/cooling fans. Even home PCs do this by moving the heat to a reservoir that can be more effectively cooled.
  What Google is doing is using the huge chillers that would normally be cooling the air in the facility to cool water which is directly pumped into every server. The return water is then cooled in the chiller tower. This eliminates ANY air based transfer besides the chiller tower. This is one being done a server or a rack.. its being done on the whole data center all at once.
  I am super curious how they handle things like chiller maintenance or pump failures. I am sure they have redundancy but the system for that has to be super impressive because it can’t be offline long before you experience hardware failure!
  [Edit: It was pointed out in another comment that AWS is doing this as well and honestly their pictures make it way clearer what is happening: https://www.aboutamazon.com/news/aws/aws-liquid-cooling-data...]
  [-]
  - bri3d 8 hours ago
    I don't think this comment is accurate based on the article, although you cite personal experience elsewhere so maybe your project wasn't the one that's documented here?
    > What Google is doing is using the huge chillers that would normally be cooling the air in the facility to cool water which is directly pumped into every server.
    From the article:
    > CDUs exchange heat between coolant liquid and the facility-level water supply.
    Also, I know from attaching them at some point that plenty of mainframes used this exact same approach (water to water exchange with facility water), not water to air to water like you describe in this comment and others, so I think you may have just not had experience there? https://www.electronics-cooling.com/2005/08/liquid-cooling-i... contains a diagram in Figure 1 of this exact CDU architecture, which it claims was in use in mainframes dating back to 1965 (!).
    I also don't think "This eliminates ANY air based transfer besides the chiller tower." is strictly true; looking at the photo of the sled in the article, there are fans. The TPUs are cooled by the liquid loop but the ancillaries are still air cooled. This is typical for water cooling systems in my experience; while I wouldn't be surprised to be wrong (it sure would be more efficient, I'd think!), I've never seen a water cooling system which successfully works without forced air, because there are just too many ancillary components of varying shapes to successfully design a PCB-waterblock combination which does not also demand forced air cooling.
    [-]
    - liquidgecka 8 hours ago
      > > CDUs exchange heat between coolant liquid and the facility-level water supply.
      Oh interesting I missed that when I went through in the first pass. (I think I space bared to pass the image and managed to skip the entire paragraph in between the two images so that’s on me.
      I was running off an informal discussion I had with a hardware ops person several years ago where he mentioned a push to unify cooling and eliminate thermal transfer points since they were one of the major elements of inefficiency in modern cooling solutions. By missing that as I browsed through it I think I leaned too heavily on my assumptions without realizing it!
      Also, not all chips can be liquid cooled so there will always be an element of air cooling so the fans and stuff are still there for the “everything else” cases and I doubt anybody will really eliminate that effectively. The comment you quoted was mostly directed towards the idea that Cray-1 had liquid cooling, it did, but it transferred to air outside of the server which was an extremely common model for most older mainframe setups. It was rare for the heat to be kept liquid along the whole path.
      [-]
      - matt-p 5 hours ago
        The CDUs are essentially just passive water to water heat exchangers with some fancy electronics attached. You want to run a different chemical mix outside to the chillers as you do on the internal loop, it also helps regulate flow/pressure and leak detection with auto cutoff is all fairly essential.
        Running direct on facility water would made day to day operations and maintenance a total pain.
      - avar 5 hours ago
        > not all chips can be > liquid cooled.
        Why not? It's just a heatsink except with water running through cavities within it, instead of a fan sitting on top of the heatsink.
        [-]
        bri3d 4 hours ago
        One of the biggest problems with water cooling, especially on boards that weren’t designed for it, can be passive components which don’t usually have a heatsink and therefore don’t offer a good surface for a water block, but end up in a thermal design which requires airflow - resistors and FETs are common culprits here. Commodity assemblies are also a big problem, with SFPs being a huge pain point in designs I’ve seen.
        The problem is often exacerbated on PCBs designed for air cooling where the clearance between water cooled and air cooled components is not high enough to fit a water block. Usually the solution when design allows is to segment these components into a separate air cooled portion of the design, which is what Google look to have done on these TPU sleds (the last ~third of the assembly looks like it’s actively air cooled by the usual array of rackmount fans).
    - matt-p 5 hours ago
      It's interesting because I've never seen mainframes do water to water (though I'm sure that was possible).
      The only ones I've ever seen do water to compressor (then gas to the outdoor condenser, obviously)
      [-]
      - bri3d 5 hours ago
        Most ES/9000 series and earlier stuff had water to water SKUs. I remember seeing an installation with one that exchanged heat into a facility loop which went through another water to water exchanger to a pond-fountain chiller system.
        Starting with S/390 G4 they did a weird thing where the internals were cooled by refrigeration but the standard SKUs actually had the condenser in the bottom of the cabinet and they required raised floor cooling.
        They brought water to air back with the later zSeries, but the standard SKUs mimicked the S/390 strategy with raised floor. I guess you could buy a z196 or a ec12 with a water to water cabinet but I too have never seen one.
  - nitwit005 9 hours ago
    This was before I was born, so I'm hardly an expert, but I've heard of feeding IBM mainframes chilled water. A quick check of wikipedia found some mention of the idea: https://en.wikipedia.org/wiki/IBM_3090
    [-]
    - ChuckMcM 9 hours ago
      When our mainframe in 1978 sprung a leak in its water cooling jacket it took down the main east/west node on IBMs internal network at the time. :-). But that was definitely a different chilling mechanism than the types Google uses.
    - jauntywundrkind 7 hours ago
      Having to pre chill water (via a refrigeration cycle) is radically less efficient than being able to collect and then disperse heat. It generates considerably more heat ahead of time, to deliver the chilled water. This mode of gathering the heat and sending it out, dealing with the heat after it is produced rather than in advance, should be much more energy efficient.
      I don't know what surprises me about it so much, but having these rack-sized CDU heat-exchangers was quite a surprise, quite novel to me. Having a relatively small closed loop versus one big loop that has to go outside seems like a very big tradeoff, with a somewhat material and space intensive demand (a rack with 6x CDUs), but the fine grained control does seem obviously sweet to have. I wish there were a little more justification for the use of heat exchangers!
      The way water is distributed within the server is also pretty amazing, with each server having it's own "bus bar" of water, and each chip having it's own active electro-mechanical valve to control it's specific water flow. The TPUv3 design where cooling happens serially, each chip in sequence getting hotter and hotter water seems common-ish, where-as with TPUv4 there's a fully parallel and controllable design.
      Also the switch from lidded chips to bare chips, with a cold plate that comes down to just above, channeling water is one of those very detailed fine-grained optimizations that is just so sweet.
  - ChuckMcM 9 hours ago
    Much of the Google use of liquid chillers was protected behind NDAs as part of its "hidden advantage" with respect to the rest of the world. It was the secret behind really low PUE numbers.
    [-]
    - throwaway2037 2 hours ago
      Do we know if other hyperscalers also use liquid chillers to achieve very low PUE values? I think I saw photos from xAI's new data center and there was liquid cooling.
  - ambicapter 9 hours ago
    So every time they plug in a server they also plug in water lines?
    [-]
    - liquidgecka 9 hours ago
      [I am not a current Google Employee so my understanding of this is based on externally written articles and “leap of faith” guestimation]
      Yes. A supply and return line along with power. Though if I had to guess how its setup this would be done with some super slick “it just works” kind of mount that lets them just slide the case in and lock it in place. When I was there almost all hardware replacement was made downright trivial so it could just be more or less slide in place and walk away.
      [-]
      - scrlk 8 hours ago
        You can see the male quick disconnect fittings for the liquid cooling at each corner of the server in this photo:
        https://substackcdn.com/image/fetch/$s_!8aMm!,f_auto,q_auto:...
        Looks like the power connector is in the centre. I'm not sure if backplane connectors are covered up by orange plugs?
        [-]
        Romario77 7 hours ago
        it's not the center one, it's the side ones. Center seems to be a power supply.
      - michaelt 8 hours ago
        Interestingly, entire supercomputers have been decommissioned [1] due to faulty quick disconnects causing water spray.
        So you can get a single, blind-mating connector combining power, data and water - but you might not want to :)
        [1] https://gsaauctions.gov/auctions/preview/282996
      - nielsbot 9 hours ago
        Maybe similar to a gasoline hose breakaway
        https://www.opwglobal.com/products/us/retail-fueling-product...
      - semi-extrinsic 8 hours ago
        https://amphenol-industrial.com/products/liquid-cooling-syst...
        [-]
        tesseract 8 hours ago
        CPC is in this market too https://www.cpcworldwide.com/Liquid-Cooling/Products/Blind-M...
        Non-spill fluid quick disconnects are established tech in industries like medical, chemical processing, beverage dispensing, and hydraulic power, so there are plenty of design concepts to draw on.
    - jayd16 8 hours ago
      Maybe we can declutter things if they get PWoE(power and water over ethernet) or just a USB-W standard.
      [-]
      - Nition 8 hours ago
        It worked for MONIAC.
      - taneq 6 hours ago
        Great, now I’ll have to figure out if my USB Hi-Flow cable is slowing down my USB Full-Flow drink bottle refilling.
        [-]
        toast0 5 hours ago
        It's not. Full flow is less than hi-flow... Your bottle might slow down other fills on the bus though :p
    - ajb 9 hours ago
      I remember reading somewhere that they don't operate at the level of servers; if one dies they leave it in place until they're ready to replace the whole rack. Don't know if that's true now, though.
      It does sound like connections do involve water lines though. As they are isolating different water circuits, in theory they could have a dry connection between heat exchanger plates, or one made through thermal paste. It doesn't sound like they're doing that though.
      [-]
      - liquidgecka 8 hours ago
        It has not been true for a LONG time. That was part of Google early “compute unit” strategy that involved things like sealed containers and such. Turns out that’s not super efficient or useful because you leave large swaths of hardware idle.
        In my day we had software that would “drain” a machine and release it to hardware ops to swap the hardware on. This could be a drive, memory, CPU or a motherboard. If it was even slightly complicated they would ship it to Mountain View for diagnostic and repair. But every machine was expected to be cycled to get it working as fast as possible.
        We did a disk upgrade on a whole datacenter that involved switching from 1TB to 2TB disks or something like that (I am dating myself) and total downtime was so important they hired temporary workers to work nights to get the swap done as quickly as possible. If I remember correctly that was part of the “holy cow gmail is out of space!” chaos though, so added urgency.
        [-]
        throwaway2037 an hour ago
        > part of the “holy cow gmail is out of space!” chaos
        This sounds like an interesting story. Can you share more details.
      - cavisne 8 hours ago
        Definitely not true for these workloads. TPUs are interconnected, one dying makes the whole cluster significantly less useful.
    - jedberg 9 hours ago
      Looks like it. New server means power, internet, and water.
      [-]
      - fudgy73 9 hours ago
        just like humans.
    - Hilift 7 hours ago
      And a 12V battery.
- legulere 10 hours ago
  It's not so surprising when considering googles history coming from inexpensive commodity hardware. It's pretty similar to how it took decades for x86 servers and operating systems to gain mainframe functionality like virtualisation.
  https://blog.codinghorror.com/building-a-computer-the-google...
- spankalee 10 hours ago
  From the article:
  > Liquid cooling is a familiar concept to PC enthusiasts, and has a long history in enterprise compute as well.
  And the trend in data centers was to move towards more passive cooling at the individual servers and hotter operating temperatures for a while. This is interesting because it reverses that trend a lot, and possibly because of the per-row cooling.
  [-]
  - dekhn 10 hours ago
    We've basically been watching Google gradually re-discover all the tricks of supercomputing (and other high performance areas) over the past 10+ years. For a long time, websearch and ads were the two main drivers of Google's datacenter architecture, along with services like storage and jobs like mapreduce. I would describe the approach as "horizontal scaling with statistical multiplexing for load balancing".
    Those style of jobs worked well but as Google has realized it has more high performance computing with unique workload characteristics that are mission-critical (https://cloud.google.com/blog/topics/systems/the-fifth-epoch...) their infrastructure has had to undergo a lot of evolution to adapt to that.
    Google PR has always been full of "look we discovered something important and new and everybody should do it", often for things that were effectively solved using that approach a long time ago. MapReduce is a great example of that- Google certainly didn't invent the concepts of Map or Reduce, or even the idea of using those for doing high throughput computing (and the shuffle phase of MapReduce is more "interesting" from a high performance computing perspective than mapping or reducing anyway).
    [-]
    - liquidgecka 9 hours ago
      As somebody that worked on Google data centers after coming from a high performance computing world I can categorically say that Google is not “re-learning” old technology. In the early days (when I was there) they focused heavily on moving from thinking of computers to thinking of compute units. This is where containers and self contained data centers came from. This was actually a joke inside of Google because it failed but was copied by all the other vendors for years after Google had given up on it. They then moved to stop thinking about cooling as something that happens within a server case to something that happens to a whole facility. This was the first major leap forward where they moved from cooling the facility and pushing conditioned air in to cooling the air immediately behind the server.
      Liquid cooling at Google scale is different than mainframes as well. Mainframes needed to move heat from the core out to the edges of the server where traditional data center cooling would transfer it away to be conditioned. Google liquid cooling is moving the heat completely outside of the building while it’s still liquid. That’s never been done before as far as I am aware. Not at this scale at least.
      [-]
      - mattofak 9 hours ago
        It's possible it never made it into production; but when I was helping to commission a 4 rack "supercomputer" circa 2010 we used APC's in-row cooling (which did glycol exchange to the outside but still maintains the hot/cold aisle) and I distinctly remember reading a whitepaper about racks with built in water cooling and the problems with pressure loss, dripless connectors, and corrosion. I no longer recall if the direct cooling loop exited the building or just cycled in the rack to an adjacent secondary heat exchanger. (And I don't remember if it was an APC whitepaper or some other integrator.)
        There's also all the fun experiments with dunking the whole server into oil, but I'll give you that again I've only seen setups described with secondary cooling loops - probably because of corrosion and wanting to avoid contaminants.
        [-]
        bri3d 8 hours ago
        The parent poster is just either extremely confidently wrong or talking about a very different project from the one in the linked article - here's an article from 2005 with Figure 1 dating from (according to the article) 1965 (!!) showing the same CDU architecture shown in the chipsandcheese article: https://www.electronics-cooling.com/2005/08/liquid-cooling-i...
        I do think Google must be doing something right, as their quoted PUE numbers are very strong, but nothing about what's in the linked chipsandcheese article seems groundbreaking at all architecturally, just strong micro-optimization. The article talks a lot about good plate/thermal interface design, good water flow management, use of active flow control valves, and a ton of iteration at scale to find the optimal CDU-to-hardware ratio, but at the end of the day it's the same exact thing in the diagram from 1965.
      - zer00eyz 9 hours ago
        > cooling is moving the heat completely outside of the building while it’s still liquid.
        We have been doing this for decades, it's how refrigerants work.
        The part that is new is not having an air-interface in the middle of the cycle.
        Water isn't the only material being looked at, mostly because high pressure PtC (Push to Connect) fittings, and monitoring/sensor hardware has evolved. If a coolant is more expensive but leaks don't destroy equipment, and can be quickly isolated then it becomes a cost/accounting question.
        [-]
        liquidgecka 9 hours ago
        > The part that is new is not having an air-interface in the middle of the cycle.
        I wasn’t clear when I was writing but this was the point I was trying to make. Heat from the chip is transferred in the same medium all the way from the chip to the exterior chiller without intermediate transfers to a new medium.
        marcosdumay 8 hours ago
        The claim is that Google has larger pipes that go all the way out of the building. While mainframes have short pipes that go only to a heat exchanger on the end of the hack.
        IMO, it's not a big difference. There are probably many details more noteworthy than this. And yeah, mainframes are that way because the vendor only creates them up to the hack-level, while Google has the "vendor" design the entire datacenter. Supercomputers have had single-vendor datacenters for decades too, and have been using large pipes for a while too.
        cyberax 2 hours ago
        Glycol is cheap and safe, but it has lower specific heat capacity and higher viscosity. So that's why water is still being used.
        The next step is probably evaporative cooling, with liquid coolant ("freon") pumped to individual racks.
        [-]
        jabl an hour ago
        Not sure what Google specifically is using here, but PG25 (25% propylene glycol, 75% water) id somewhat common in data center applications. The glycol takes care of avoiding microbial growth.
      - jonathaneunice 9 hours ago
        "From the core to the edges of the server"—what does that even mean?
        Unless Google has discovered a way to directly transfer heat to the aethereal plane, nothing they’re doing is new. Mainframes were moving chip and module heat entirely outside the building decades ago. Immersion cooling? Chip, module, board, rack, line, and facility-level work? Rear-door and hybrid strategies? Integrated thermal management sensors and controls? Done. Done. Done. Done. Richard Chu, Roger Schmidt, and company were executing all these strategies at scale long before Google even existed.
    - dmayle 9 hours ago
      As to MapReduce, I think you're fundamentally mistaken. You can talk about map and reduce in the lambda calculus sense of the term, but in terms of high performance distributed calculations, MapReduce was definitely invented at Google (by Jeff Dean and Sanjay Ghemawat in 2004).
      [-]
      - jonathaneunice 9 hours ago
        Not quite. Google brilliantly rebranded the work of John McCarthy, C.A.R. Hoare, Guy Steele, _et al_ from 1960 ff. e.g. https://dl.acm.org/doi/10.1145/367177.367199
        Dean, Ghemawat, and Google at large deserve credit not for inventing map and reduce—those were already canonical in programming languages and parallel algorithm theory—but for reframing them in the early 2000s against the reality of extraordinarily large, scale-out distributed networks.
        Earlier takes on these primitives had been about generalizing symbolic computation or squeezing algorithms into environments of extreme resource scarcity. The 2004 MapReduce paper was also about scarcity—but scarcity redefined, at the scale of global workloads and thousands of commodity machines. That reframing was the true innovation.
      - dekhn 8 hours ago
        CERN was doing the equivalent of MapReduce before Google existed.
        [-]
        wbl 8 hours ago
        Got a link? HEP data is really about triggers and widely distributed screening that's application specific AFAIK.
        [-]
        dekhn 8 hours ago
        My main reference is the head of computing at CERN, who explained this to me. He gave some early examples of ROOT (https://en.wikipedia.org/wiki/ROOT) using parallel processing of the ROOT equivalent of SSTables.
    - Sesse__ 9 hours ago
      > MapReduce is a great example of that- Google certainly didn't invent the concepts of Map or Reduce, or even the idea of using those for doing high throughput computing (and the shuffle phase of MapReduce is more "interesting" from a high performance computing perspective than mapping or reducing anyway).
      The “Map” in MapReduce does not originally stand for the map operation, it comes from the concept of “a map” (or, I guess, a multimap). MapReduce descends from “the ripper”, an older system that mostly did per-element processing, but wasn't very robust or flexible. I believe the map operation was called “Filter()” at the time, and reduce also was called something else. Eventually things were cleaned up and renamed into Map() and Reduce() (and much more complexity was added, such as combiners), in a sort of backnaming.
      It may be tangential, but it's not like the MapReduce authors started with “aha, we can use functional programming here”; it's more like the concept fell out. The fundamental contribution of MapReduce is not to invent lambda calculus, but to show that with enough violence (and you should know there was a lot of violence in there!), you can actually make a robust distributed system that appears simple to the users.
      [-]
      - dekhn 9 hours ago
        I believe Map in MapReduce stood for "map" function, not a multimap- I've never heard or read otherwise (maps can operate over lists of items, not just map types). That's consistent both with the original mapreduce paper: """Our abstraction is inspired by the map and reduce primitives present in Lisp and many other functional languages. We realized that most of our computations involved applying a map operation to each logical “record” in our input in order to compute a set of intermediate key/value pairs, and then applying a reduce operation to all the values that shared the same key, in order to combine the derived data appropriately"""
        and with the internal usage of the program (I only started in 2008, but spoke to Jeff extensively about the history of MR as part of Google's early infra) where the map function can be fed with recordio (list containers) or sstable (map containers).
        As for the ripper, if you have any links to that (rather than internal google lore), I'd love to hear about it. Jeff described the early infrastructure as being very brittle.
        [-]
        Sesse__ 8 hours ago
        > and with the internal usage of the program (I only started in 2008, but spoke to Jeff extensively about the history of MR as part of Google's early infra) where the map function can be fed with recordio (list containers) or sstable (map containers).
        I worked on the MapReduce team for a while (coincidentally, around 2008), together with Marián Dvorský, who wrote up a great little history of this. I don't think it was ever made public, though.
        > As for the ripper, if you have any links to that (rather than internal google lore), I'd love to hear about it. Jeff described the early infrastructure as being very brittle.
        I believe it's all internal, unfortunately.
    - Aurornis 7 hours ago
      > We've basically been watching Google gradually re-discover all the tricks of supercomputing (and other high performance areas)
      Google isn't claiming to have invented water cooling. This article recaps their talk at Hot Chips where they showed some details of their implementation.
      Data center cooling is also a different beast than supercomputer cooling. Datacenter cooling operates at a larger scale and has different needs for maintenance operations like maintenance cycles.
      There are also some interesting notes in there about new things Google is doing, like the direct-die cooling.
      Water cooling is a big field. Data center operations is a big field. There is interesting content in the article.
    - Spooky23 6 hours ago
      I think that's a little unfair.
      Building out an idea for a bespoke supercomputer and making an iteration of that idea that applies to globally scaled workloads is a different thing. They are building computing factories, as is Amazon, MS, Facebook, etc.
      That said, PR is PR, and companies always crow about something for their own reasons.
  - echelon 10 hours ago
    This abundance of marketing (not necessarily this blog post) is happening because of all the environmental chatter about AI and data centers recently.
    Google wants you to know it recycles its water. It's free points.
    Edit: to clarify, normal social media is being flooded with stories about AI energy and water usage. Google isn't greenwashing, they're simply showing how things work and getting good press for something they already do.
    [-]
    - Legend2440 10 hours ago
      The environmental impact of water usage seems way overblown to me.
      Last year, U.S. data centers consumed 17 billion gallons of water. Which sounds like a lot, but the US as a whole uses 300 billion gallons of water every day. Water is not a scarce resource in much of the country.
      [-]
      - bee_rider an hour ago
        It seems like a very minor problem next to the huge super obvious one, electricity use.
      - Guvante 9 hours ago
        To be clear all talk of water usage has been focusing on local usage which isn't well represented by national numbers.
        [-]
        abdullahkhalids 7 hours ago
        Yeah. There are metrics for how water stressed a community is. We know where data centers have been built, and we know how these water metrics have changed over time.
        The correct metric is something like, what's the probability that the launch of data center in a location results in nearby communities to drop significantly in these water metrics.
      - foota 10 hours ago
        Personally, I agree. That said, I think it might be worth considering the impact of water usage in local areas that are relatively water constrained.
    - mlyle 10 hours ago
      Google uses plenty of water in cooling towers and chillers. Sure, water cooling loops to the server may reduce the amount a little bit compared to fans, but this isn't "recycling" in any normal sense.
      It's mostly a play for density.
    - quickthrowman 8 hours ago
      Google almost certainly uses evaporative cooling towers, it’s almost twice as efficient as air cooled chillers by themselves.
- bluedino 5 hours ago
  > super-dense HPC data centers have used liquid cooling for at least 20
  Hasn't this just been for things like rack doors and such?
  In the last ~two generations of servers it seems like now there's finally DLC (direct liquid cooling) into the actual servers themselves (similar to the article). Intel kind of forced that one on us, with their high-end SKUs. This has been a pain becuase it doesn't fit into legacy datacenters as easily as the door/rack-based systems.
  I won't say which server vendor it is, but I've put in more than one service ticket for leaking coolant bags (they hang them on the racks).
- jeffbee 11 hours ago
  Hyper-scale data centers normally need not be concerned with power density, and their designers might avoid density because of the problems it causes. Arguably modern HPC clusters that are still concerned about density are probably misguided. But when it comes to ML workloads, putting everything physically close together starts to bring benefits in terms of interconnectivity.
  [-]
  - jonathaneunice 10 hours ago
    LOLWUT? Hyperscalers and HPC data centers have been very concerned about power and thermal density for decades IME. If you're just running web sites or VM farms, sure keep racks cooler and more power efficient. But for those that deeply care about performance, distance drives latency. That drives a huge demand to "pack things closely" and that drives thermal density up, up, up. "Hot enough to roast a pig" was a fintech data center meme of 20 years go, back at 20kW+ racks. Today you're not really cooking until you get north of 30kW.
    [-]
    - jeffbee 9 hours ago
      Name a hyper-scale data center where the size and shape suggests that power density ever entered the conversation.
- coliveira 9 hours ago
  This is typical for Google: they will reinvent things and say they're the first to do it. Google is typically concerned with costs; there is probably solutions out there for these problems, but they don't want to pay for it. It is cheaper, at their scale, to reinvent things and get it done internally, and then claim they're first.
  [-]
  - jsnell 9 hours ago
    Why make up such an absurd grievance to get fake-outraged about? Nowhere in the article does it say that Google claims to have invented liquid cooling. In fact, nowhere does it say they claim any part of this is a new invention.
    But the point of this kind of paper is typically not what is new, it's what combination of previously known and novel techniques have been found to work well at massive scale over a timespan of years.
pjdesno 7 hours ago
In theory data center cooling is simple - CPUs run at 60-70C, while the outside ambient is usually 30C or less, so heat should just "flow downhill" with a bit of help from fans, pumps, etc.
The problem with using air cooling to get it there is that the humans who run the data center have to enter and breathe the same air that's used to cool the computers, and if your working fluid gets too hot it's quite unhealthy for them. (we run our hot aisles at 100F, which is a bit toasty, and every third rack is a heat exchanger running off the chilled water lines from the outside evaporative cooler, modulo a heat exchanger to keep the bird shit out)
We're not going to be able to pump much heat into the outside world unless our working fluid is a decent amount hotter than ambient, so when it gets reasonably warm outside we need to put chillers (water-to-water AC units) in the loop, which consume energy to basically concentrate that heat into a higher-temperature exhaust. When it's really hot outside they consume quite a bit of energy.
If the entire data center was liquid cooled we could have coolant coming from the racks at a much higher temperature, and we'd be able to dump the heat outside on the hottest days without needing any chillers in the loop. As it is we have some liquid cooled racks running off the same chilled water lines as the in-aisle heat exchangers, but the coolant temp is limited by the temperature of our hot aisles, which are quite hot enough already, thank you.
[-]
- fanf2 7 hours ago
  15 years ago IBM installed a supercomputer at ETH Zurich that used 60C hot water as its coolant, with a heat exchanger to the building’s hot water system (which is typically somewhat less than 60C) https://en.m.wikipedia.org/wiki/Aquasar
michaelt 11 hours ago
> TPU chips are hooked up in series in the loop, which naturally means some chips will get hotter liquid that has already passed other chips in the loop. Cooling capacity is budgeted based on the requirements of the last chip in each loop.
Of course, it's worth noting that if you've got four chips, each putting out 250W of power, and a pump pushing 1 litres of water per minute through them, water at the outlet must be 14°C hotter than water at the inlet, because of the specific heat capacity of water. That's true whether the water flows through the chips in series, or in parallel.
[-]
- foota 11 hours ago
  Hm... but in the case when the chips are in serial, the heat transfer from the last chip will be less than when the chips are in parallel, because the rate of heating is proportional to the difference in temperature, and the water starts at a lower temperature for the parallel case for this last theoretical chip.
  [-]
  - chickenbig 9 hours ago
    In steady state the power put into the chip is removed by the water (neglecting heat transfer away from the cooling system). The increased water temperature on entering into the cooling block is offset by a correspondingly higher chip temperature.
  - fraserphysics 10 hours ago
    One way to characterize the cost of cooling is entropy production. As you say, cooling is proportional to difference in temperature. However, entropy production is also proportional to temperature difference. It's not my field, but it looks like an interesting challenge to optimize competing objectives.
  - 0x457 9 hours ago
    Yes, but water is constantly moving in a loop. It's not like you use water to cool chip #1, and then it moves to chip #2, it's constantly moving, so temperature delta isn't that much.
    [-]
    - smachiz 8 hours ago
      in their first serial design, that's exactly what it was doing.
- friendzis 10 hours ago
  While there is some truth to your comment, it has no practical engineering relevance. Since energy transfer rate is proportional to temp difference, therefore you compute the flow rate required, which is going to be different if the chips are in series or in parallel.
- JonChesterfield 2 hours ago
  The equations all work out OK though since one can go substantially above a litre a minute at pressures which are still viable. The standard 18W thing for desktops is about ten times that.
- idiotsecant 10 hours ago
  It just means in series that some of your chips get overcooled in order to achieve the required cooling on the hottest chip. You need to run more water for the same effect.
  [-]
  - k7sune 10 hours ago
    I can imagine a setup where multiple series of slower cooler water converging into a faster warmer stream, and the water will extract an equal amount of heat away from all the chips whether upstream or downstream.
hnburnsy 9 hours ago
Kind of related from B1M...
>The Paris Olympic Pool is Heated by the Internet
>https://www.youtube.com/watch?v=2gWudPtN6z4&t=4s
BoppreH 11 hours ago
I see frequent mentions of AI wasting water. Is this one such setup, perhaps with the CDU using the facility's water supply for evaporative cooling?
[-]
- bri3d 10 hours ago
  The CDU is inside the datacenter and strictly liquid to liquid exchange. It transfers heat from the rack block's coolant to the facility coolant. The facility then provides outdoor heat exchange for the facility coolant, which is sometimes accomplished using open-loop evaporative cooling (spraying down the cooling towers). All datacenters have some form of facility cooling, whether there's a CDU and local water cooling or not, so it's not particularly relevant.
  The whole AI-water conversation is sort of tiring, since water just moves to more or less efficient parts or locations in the water cycle - I think a "total runtime energy consumption" metric would be much more useful if it were possible to accurately price in water-related externalities (ie - is a massive amount of energy spent moving water because a datacenter evaporates it? or is it no big deal?). And the whole thing really just shows how inefficient and inaccurately priced the market for water is, especially in the US where water rights, price, and the actual utility of water in a given location are often shockingly uncorrelated.
  [-]
  - bjackman 8 hours ago
    Even without AI the whole water conservation topic is kinda silly.
    The margin at which it makes sense to save water varies wildly by location, but the cultural dominance of the western USA infiltrates everything.
    Here in Zürich, my office briefly installed water saving taps. This is a city of less than half a million where the government maintains 1,200 fountains that spew drinkable water 24/7. But someone at the California HQ said saving water is important and someone at the Swiss subsidiary said yes sir we'll look into it right away sir.
    [-]
    - acyou 4 hours ago
      It's on the same level as people using incandescent light bulbs. Well we clear 160k Euros after taxes and have public medical care, and electricity is 10c/kWh here, so why does it matter what bulbs we use?
      We live in an area surrounded by grass fed cows, so what does it matter if we throw away 3/4 of our steak?
      Without regard to how plentiful resources are in our particular area, being needlessly wasteful is in bad taste more than anything. It's a lack of appreciation of the value of what we have.
      For water specifically - it is generally speaking the most valuable resource available, we just don't appreciate it because we happen to have a lot of it.
      [-]
      - MindSpunk 44 minutes ago
        While I'm not saying waste when things are plentiful in general is okay, I think water is a unique case that can be treated differently.
        Comparing to energy costs isn't the same because using the energy for the incandescent bulb consumes that energy permanently. The gas/coal/fuel can't be un-burned. Although solar changes this as the marginal cost of that energy is free.
        Comparing to food is similar. Once the food is wasted it is gone.
        Water is typically not destroyed, it's just moved around in the water cycle. Water consumption in a region is dictated by the throughput the water cycle replenishes the reservoirs you're pulling from. "Waste" with water is highly geographic, and it's pretty reasonable to take exception to California projecting their problems to geographic regions that they aren't important.
  - maartin0 10 hours ago
    Slightly old but interesting https://blog.equinix.com/blog/2024/09/19/how-data-centers-us...
- sleepydog 10 hours ago
  AWS had a similar article a couple months ago:
  https://www.aboutamazon.com/news/aws/aws-liquid-cooling-data...
  In either case I cannot find out how they dump the heat from the output water before recycling it. That's a problem I find far more interesting.
- Lerc 10 hours ago
  I have encountered a lot of references to AI using water, but with scant details. Is it using water in the same way a car uses a road? The road remains largely unchanged?
  The implication is clear that it is a waste, but I feel like if they had the data so support that, it wouldn't be left for the reader to infer.
  I can see two models where you could say water is consumed. Either talking about drinkable water rendered undrinkable, or turning water into something else where it is not practically recaptured. Tuning it into steam, sequestering it in some sludge etc.
  Are these things happening? If it is happening, is it bad? Why?
  I'd love to see answers on this, because I have seen the figures used like a kudgel without specifying what the numbers actually refer to. It's frustrating as hell.
  [-]
  - tony_cannistra 9 hours ago
    This article will help you. https://www.construction-physics.com/p/how-does-the-us-use-w...
    > ...actual water consumed by data centers is around 66 million gallons per day. By 2028, that’s estimated to rise by two to four times. This is a large amount of water when compared to the amount of water homes use, but it's not particularly large when compared to other large-scale industrial uses. 66 million gallons per day is about 6% of the water used by US golf courses, and it's about 3% of the water used to grow cotton in 2023.
  - jeffbee 10 hours ago
    The water is "used" in the sense that it evaporates. At a global average rate of 1 liter per kilowatt-hour of energy, Google claims.
    [-]
    - Lerc 6 hours ago
      What does that mean for environmental impact? Hydroelectric power uses water in the sense that the "used" water is at a lower altitude. What happens once that is done, is what matters.
      Depending on what global average means, it seems like that's quite a lot of cycling of evaporation unless they are releasing steam at 800C
      Looking up overall water usage, the US uses 27.4Billion gallons a day residential and 18.2 Billion gallons industrial. It surprised me that industrial was lower, but I guess the US manufactures less these days.
      If the 1kwh per litre were accurate then judging by this calc https://www.google.com/search?q=(27.4billion+gallons+*365)*+... 37 857 903.3 terawatt hours
      0.1% of The residential water use of the US would be enough to cool the entire Electicity output of the world (about 30,000TWh)
      (of course, with these things it's easy to slip an order of magnitude(or several) so best check my numbers)
      [-]
      - jeffbee 5 hours ago
        It means different things in different places, depending on where the water was sourced and the counterfactual of what would otherwise have happened to it. For example if you source surface water that evaporates, and the local rainshed feeds that watershed, then it's virtually closed-loop. If you're pumping fossil water out of an aquifer, and it rains over the ocean, then it's the opposite.
        [-]
        jabl 33 minutes ago
        I suppose in some desert-ish climate the extra moisture given to the air doesn't rain down nearby but moves with the wind somewhere far away where it rains. So there might be an effect of using locally precious water even if it's a closed loop on a continent level.
    - aspenmayer 7 hours ago
      Does this evaporated water stay in the loop and condense out later? Is an extra liter on the front end needed in the line until the first liter falls out of the back end?
  - quickthrowman 8 hours ago
    > I have encountered a lot of references to AI using water, but with scant details.
    Here you go: https://en.m.wikipedia.org/wiki/Cooling_tower
    Data centers use evaporative cooling towers which evaporate water to reject heat to the atmosphere.
- esaym 3 hours ago
  Yes see: https://www.austinchronicle.com/news/2025-07-25/texas-is-sti...
- jeffbee 10 hours ago
  The reason you see frequent mentions of AI wasting water is there is a massive influence operation that seeks to destroy knowledge, science, and technology in the United States and return most of us to a state of bare subsistence, working menial jobs or reduced to literal slavery. There is no subjective measure by which the water used by AI is even slightly concerning.
  [-]
  - bryan_w 20 minutes ago
    Ayyo chill. Don't let on that you know their plans. People aren't ready to know about the operations yet, so you'll come across as off balanced
  - wredcoll 10 hours ago
    I looked very hard but I don't see a way to subscribe to your newsletter?
  - stripe_away 10 hours ago
    > there is a massive influence operation that seeks to destroy knowledge, science, and technology in the United States
    Agreed. Started with big tobacco by discrediting the connection to lung cancer, playbook copied by many and weaponized by Russia.
    > There is no subjective measure by which the water used by AI is even slightly concerning.
    Does not follow from your first point. The water has to be sourced from somewhere, and debates over water rights are as old as civilization. For one recent example, see i.e. https://www.texaspolicy.com/legewaterrights/
    You are probably correct that the AI does not damage the water, but unless there are guarantees that the water is rapidly returned "undamaged" to the source, there are many reasons to be concerned about who is sourcing water from where.
m463 10 hours ago
I wonder what the economics of water cooling really is.
Is it because chips are getting more expensive, so it is more economical to run them faster by liquid cooling them?
Or is it data center footprint is more expensive, so denser liquid cooling makes more sense?
Or is it that wiring distances (1ft = 1nanosecond) make dense computing faster and more efficient?
[-]
- MurkyLabs 10 hours ago
  It's a mixture of both 2 and 3. The chips are getting hotter because they're compacting more stuff in a small space and throwing more power into them. At the same time, powering all those fans that cool the computers takes a lot of power (when you have racks and racks those small fans add up quickly) and that heat is then blown into hot isles that need to then circulate the heat to A/C units. With liquid cooling they're able to save costs due to lower electricity usage and having direct liquid to liquid cool as apposed to chip->air->AC->liquid. ServeTheHome did a write up on it last year, https://www.servethehome.com/estimating-the-power-consumptio...
  [-]
  - mikepurvis 9 hours ago
    I've never done DC ops, but I bet fan failure is a factor too— basically there'd be a benefit to centralizing all the cooling for N racks in 2-3 large redundant pumps rather than having each node bringing its own battalion of fans that are all going to individually fail in a bell curve centered on 30k hours of operation, with each failure knocking out the system and requiring hands-on maintenance.
- summerlight 10 hours ago
  Not sure about classical computing demands, but I think wiring distances definitely matter for TPU-like memory heavy computation.
- xadhominemx 7 hours ago
  It’s because the chips are networked with very high bitrates and need to be physically densely packed together.
- moffkalast 10 hours ago
  It's more of a testament to inefficiency, with rising TDP year after year as losses get larger with smaller nm processes. It's so atrocious, even in the consumer sector Nvidia can't even design a connector that doesn't melt during normal usage because their power draw has become beyond absurd.
  People don't really complain about crappy shovels during a gold rush though unfortunately, they're just happy they got one before they ran out. They have no incentive to innovate in efficiency while the performance line keeps going up.
ianpenney 8 hours ago
Between 2006 and 2012 I was in DCs. I didn’t live there but I had to visit at stupid o’clock too often.
It’s not a hospitable place.
I would have loved it if cooling was quieter and less extreme.
The reason interfaces are on the back is that’s the intake side. So, bring a sweater.
Or just go to the other side of the rack to warm your hands back up.
tyingq 6 hours ago
OVH has been doing liquid cooling for quite a while, and is now using immersion also. Somewhat surprising to see at that dirt cheap end of the market.
nwmcsween 8 hours ago
What I don't understand is why not use what I've seen in industrial processes? Why not a low boiling point liquid and a heat exchanger? Or even better power a turbine like biogas plants do
miohtama 9 hours ago
How much moat Google has in the AI race due to its in-house expertise in data centers and in-house IPR in TPUs? Can any company on Earth match them?
[-]
- owebmaster 8 hours ago
  Not enough to prevent open source models making it all useless, I think.
betaby 11 hours ago
Relevant.
OVH has been using liquid cooling for many years:
https://blog.ovhcloud.com/new-hybrid-immersion-liquid-coolin...
https://www.youtube.com/watch?v=S_DHJcQpumI
[-]
- bri3d 11 hours ago
  Your linked articles are about immersion cooling, which is "liquid cooling," I suppose, but a very different thing. Do OVH actually use immersion cooling in production? This seems like a "labs" product that hasn't been fully baked yet.
  OVH _do_ definitely use traditional water-loop/water-block liquid cooling like Google are presenting, described here: https://blog.ovhcloud.com/understanding-ovhclouds-data-centr... and visually in https://www.youtube.com/watch?v=RFzirpvTiOo , but while it looks very similar architecturally their setup seems almost comically inefficient compared to Google's according to their PUE disclosures.
- jeffbee 11 hours ago
  And yet their claimed PUE is 1.26 which is pretty bad. One way to characterize that overall PUE figure is they waste 3x as much on overhead as Google (claimed 1.09 global PUE) or Meta (1.08).
hnburnsy 9 hours ago
>"Google extensively validates components with leak testing, uses alerting systems to discover problems like leaks, and takes preventative measures like scheduled maintenance and filtration. They also have a clear set of protocols to respond to alerts and issues, allowing a large workforce to respond to problems in a consistent fashion. It’s a far cry from the ad-hoc measures enthusiasts take to maintain their water cooling setups."
How about the author compare it to MS, Meta, or AWS instead of Joe Blow buying parts online? I would hope that Google had extensive validation and clear protocols. [Roll-Eyes]
[-]
- MBCook 7 hours ago
  My guess would be the average reader is probably far more familiar with people water cooling their own machines than the intricacies of how various large companies are doing water cooling (or not) in their data centers.
adolph 6 hours ago
Seeing in "installation & commissioning" mention of "Leak testing" reminds me of the PowerMac G5 liquid cooling leaks:
https://www.xlr8yourmac.com/systems/G5_CoolantLeak_Repair/G5...