I hear your argument, but short of major algorithmic breakthroughs I am not conv...

reppap · 2026-01-20T23:10:32 1768950632

People will want more GPUs but will they be able to fund them? At what points does the venture capital and loans run out? People will not keep pouring hundreds of billions into this if the returns don't start coming.

gadflyinyoureye · 2026-01-21T01:35:00 1768959300

Money will be interesting the next few years.

There is a real chance that the Japanese carry trade will close soon the BoJ seeing rates move up to 4%. This means liquidity will drain from the US markets back into Japan. On the US side there is going to be a lot of inflation between money printing, refund checks, amortization changes and a possible war footing. Who knows?

coryrc · 2026-01-20T19:08:07 1768936087

Not that locked out: https://www.cnbc.com/2025/12/31/160-million-export-controlle...

agentcoops · 2026-01-21T18:37:43 1769020663

Yeah, that's the bull case for sure. Chinese firms might not accept training setbacks even given CCP regulations that they dogfood X homegrown chip.

tracker1 · 2026-01-20T23:15:03 1768950903

Doesn't even necessarily need to be CUDA compatible... there's OpenCL and Vulkan as well, and likely China will throw enough resources at the problem to bring various libraries into closer alignment to ease of use/development.

I do think China is still 3-5 years from being really competitive, but still even if they hit 40-50% of NVidia, depending on pricing and energy costs, it could still make significant inroads with legal pressure/bans, etc.

bigyabai · 2026-01-21T01:56:22 1768960582

> there's OpenCL and Vulkan as well

OpenCL is chronically undermaintained & undersupported, and Vulkan only covers a small subset of what CUDA does so far. Neither has the full support of the tech industry (though both are supported by Nvidia, ironically).

It feels like nobody in the industry wants to beat Nvidia badly enough, yet. Apple and AMD are trying to supplement raster hardware with inference silicon; both of them are afraid to implement a holistic compute architecture a-la CUDA. Intel is reinventing the wheel with OneAPI, Microsoft is doing the same with ONNX, Google ships generic software and withholds their bespoke hardware, and Meta is asleep at the wheel. All of them hate each other, none of them trust Khronos anymore, and the value of a CUDA replacement has ballooned to the point that greed might be their only motivator.

I've wanted a proper, industry-spanning CUDA competitor since high school. I'm beginning to realize it probably won't happen within my lifetime.

zozbot234 · 2026-01-21T02:56:38 1768964198

The modern successor to OpenCL is SYCL and there's been some limited convergence with Vulkan Compute (they're still based on distinct programming models and even SPIR-V varieties under the hood, but the distance is narrowing somewhat).

pjmlp · 2026-01-22T16:01:34 1769097694

Which is basically Intel for practical purposes.

robmay · 2026-01-21T13:04:24 1769000664

Lemurian Labs is working on this https://www.lemurianlabs.com/

Balinares · 2026-01-21T07:45:08 1768981508

Ask Claude, HN tells me that it can implement the things that you ask.

laughing_man · 2026-01-21T05:39:28 1768973968

I suspect major algorithmic breakthroughs would accelerate the demand for GPUs instead of making it fall off, since the cost to apply LLMs would go down.

MaxBarraclough · 2026-01-21T19:58:31 1769025511

Sounds like the Jevons paradox. From https://en.wiktionary.org/wiki/Jevons_paradox :

> The proposition that technological progress that increases the efficiency with which a resource is used tends to increase (rather than decrease) the rate of consumption of that resource.

See also Wikipedia: https://en.wikipedia.org/wiki/Jevons_paradox

nroets · 2026-01-21T06:36:39 1768977399

Some changes to the algorithms and implementations will allow cheaper commodity hardware to be used.

Rover222 · 2026-01-21T11:45:58 1768995958

There will always be an incentive to scale data centers. Better algorithms just mean more bang per gpu, not that “well, that’s enough now, we’ve done it”.

iLoveOncall · 2026-01-20T19:23:51 1768937031

> short of major algorithmic breakthroughs I am not convinced the global demand for GPUs will drop any time soon

Or, you know, when LLMs don't pay off.

unsupp0rted · 2026-01-20T22:00:03 1768946403

Even if LLMs didn't advance at all from this point onward, there's still loads of productive work that could be optimized / fully automated by them, at no worse output quality than the low-skilled humans we're currently throwing at that work.

pvab3 · 2026-01-20T22:53:08 1768949588

inference requires a fraction of the power that training does. According to the Villalobos paper, the median date is 2028. At some point we won't be training bigger and bigger models every month. We will run out of additional material to train on, things will continue commodifying, and then the amount of training happening will significantly decrease unless new avenues open for new types of models. But our current LLMs are much more compute-intensive than any other type of generative or task-specific model

SequoiaHope · 2026-01-21T05:15:39 1768972539

Run out of training data? They’re going to put these things in humanoids (they are weirdly cheap now) and record high resolution video and other sensor data of real world tasks and train huge multimodal Vision Language Action models etc.

The world is more than just text. We can never run out of pixels if we point cameras at the real world and move them around.

I work in robotics and I don’t think people talking about this stuff appreciate that text and internet pictures is just the beginning. Robotics is poised to generate and consume TONS of data from the real world, not just the internet.

DoctorOetker · 2026-01-21T19:28:17 1769023697

While we may run out of human written text of value, we won't run out of symbolic sequences of tokens: we can trivially start with axioms and do random forward chaining (or random backward chaining from postulates), and then train models on 2-step, 4-step, 8-step, ... correct forward or backward chains.

Nobody talks about it, but ultimately the strongest driver for terrascale compute will be for mathematical breakthroughs in crypography (not bruteforcing keys, but bruteforcing mathematical reasoning).

vintermann · 2026-01-21T09:40:57 1768988457

Yeah, another source of "unlimited data" is genetics. The human reference genome is about 6.5 GB, but these days, they're moving to pangenomes, wanting to map out not just the genome of one reference individual, but all the genetic variation in a clade. Depending on how ambitious they are about that "all", they can be humongous. And unlike say video data, this is arguably a language. We're completely swimming in unmapped, uninterpreted language data.

boppo1 · 2026-01-21T15:05:01 1769007901

Can you say more?

yourapostasy · 2026-01-20T23:17:18 1768951038

Inference leans heavily on GPU RAM and RAM bandwidth for the decode phase where an increasingly greater amount of time is being spent as people find better ways to leverage inference. So NVIDIA users are currently arguably going to demand a different product mix when the market shifts away from the current training-friendly products. I suspect there will be more than enough demand for inference that whatever power we release from a relative slackening of training demand will be more than made up and then some by power demand to drive a large inference market.

It isn’t the panacea some make it out to be, but there is obvious utility here to sell. The real argument is shifting towards the pricing.

zozbot234 · 2026-01-20T23:09:44 1768950584

> We will run out of additional material to train on

This sounds a bit silly. More training will generally result in better modeling, even for a fixed amount of genuine original data. At current model sizes, it's essentially impossible to overfit to the training data so there's no reason why we should just "stop".

_0ffh · 2026-01-21T00:51:36 1768956696

You'd be surprised how quickly improvement of autoregressive language models levels off with epoch count (though, admittedly, one epoch is a LOT). Diffusion language models otoh indeed keep profiting for much longer, fwiw.

zozbot234 · 2026-01-21T09:33:54 1768988034

Does this also apply to LLM training at scale? I would be a bit surprised if it does, fwiw.

_0ffh · 2026-01-21T12:48:40 1768999720

Yup, as soon as data is the bottleneck and not compute, diffusion wins. Tested following the Chinchilla scaling strategy from 7M to 2.5B parameters.

https://arxiv.org/abs/2507.15857

pvab3 · 2026-01-20T23:31:13 1768951873

I'm just talking about text generated by human beings. You can keep retraining with more parameters on the same corpus

https://proceedings.mlr.press/v235/villalobos24a.html

x-complexity · 2026-01-21T01:35:16 1768959316

> I'm just talking about text generated by human beings.

That in itself is a goalpost shift from

> > We will run out of additional material to train on

Where it is implied "additional material" === "all data, human + synthetic"

------

There's still some headroom left in the synthetic data playground, as cited in the paper linked:

https://proceedings.mlr.press/v235/villalobos24a.html ( https://openreview.net/pdf?id=ViZcgDQjyG )

"On the other hand, training on synthetic data has shown much promise in domains where model outputs are relatively easy to verify, such as mathematics, programming, and games (Yang et al., 2023; Liu et al., 2023; Haluptzok et al., 2023)."

With the caveat that translating this success outside of these domains is hit-or-miss:

"What is less clear is whether the usefulness of synthetic data will generalize to domains where output verification is more challenging, such as natural language."

The main bottleneck for this area of the woods will be (X := how many additional domains can be made easily verifiable). So long as (the rate of X) >> (training absorption rate), the road can be extended for a while longer.

SchemaLoad · 2026-01-20T22:09:44 1768946984

How much of the current usage is productive work that's worth paying for vs personal usage / spam that would just drop off after usage charges come in? I imagine flooding youtube and instagram with slop videos would reduce if users had to pay fair prices to use the models.

The companies might also downgrade the quality of the models to make it more viable to provide as an ad supported service which would again reduce utilisation.

unsupp0rted · 2026-01-20T22:20:59 1768947659

For any "click here and type into a box" job for which you'd hire a low-skilled worker and give them an SOP to follow, you can have an LLM-ish tool do it.

And probably for the slightly more skilled email jobs that have infiltrated nearly all companies too.

Is that productive work? Well if people are getting paid, often a multiple of minimum wage, then it's productive-seeming enough.

greree · 2026-01-21T04:24:10 1768969450

Another bozo making fun of other job classes.

Why are there still customer service reps? Shouldn’t they all be gone by now due to this amazing technology?

Ah, tumbleweed.

bethekidyouwant · 2026-01-21T03:24:34 1768965874

Who is generating videos for free?

stingraycharles · 2026-01-20T22:11:54 1768947114

Exactly, the current spend on LLMs is based on extremely high expectations and the vendors operating at a loss. It’s very reasonable to assume that those expectations will not be met, and spending will slow down as well.

Nvidia’s valuation is based on the current trend continuing and even increasing, which I consider unlikely in the long term.

bigyabai · 2026-01-20T22:27:50 1768948070

> Nvidia’s valuation is based on the current trend continuing

People said this back when Folding@Home was dominated by Team Green years ago. Then again when GPUs sold out for the cryptocurrency boom, and now again that Nvidia is addressing the LLM demand.

Nvidia's valuation is backstopped by the fact that Russia, Ukraine, China and the United States are all tripping over themselves for the chance to deploy it operationally. If the world goes to war (which is an unfortunate likelihood) then Nvidia will be the only trillion-dollar defense empire since the DoD's Last Supper.

matthewdgreen · 2026-01-20T22:34:17 1768948457

China is restricting purchases of H200s. The strong likelihood is that they're doing this to promote their own domestic competitors. It may take a few years for those chips to catch up and enter full production, but it's hard to envision any "trillion dollar" Nvidia defense empire once that happens.

bigyabai · 2026-01-20T23:00:32 1768950032

It's very easy to envision. America needs chips, and Intel can't do most of this stuff.

zozbot234 · 2026-01-20T23:12:31 1768950751

Intel makes GPUs.

bigyabai · 2026-01-21T00:09:51 1768954191

Intel's GPU designs make AMD look world-class by comparison. Outside of transcode applications, those Arc cards aren't putting up a fight.

irishcoffee · 2026-01-21T06:00:34 1768975234

...if you can't be with the one you love, love the one you're with?

pjmlp · 2026-01-22T16:02:57 1769097777

Intel's GPU story all their life.

MichaelRo · 2026-01-21T07:59:12 1768982352

> short of major algorithmic breakthroughs I am not convinced the global demand for GPUs will drop any time soon

>> Or, you know, when LLMs don't pay off.

Heh, exactly the observation that a fanatic religious believer cannot possibly foresee. "We need more churches! More priests! Until a breakthrough in praying technique will be achieved I don't foresee less demand for religious devotion!" Nobody foresaw Nietzsche and the decline in blind faith.

But then again, like an atheist back in the day, the furious zealots would burn me at the stake if they could, for saying this. Sadly no longer possible so let them downvotes pour instead!

selfhoster11 · 2026-01-20T19:48:20 1768938500

They already are paying off. The nature of LLMs means that they will require expensive, fast hardware that's a large capex.

kortilla · 2026-01-20T19:53:31 1768938811

They aren’t yet because the big providers that paid for all of this GPU capacity aren’t profitable yet.

They continually leap frog each other and shift around customers which indicates that the current capacity is already higher than what is required for what people actually pay for.

MrDarcy · 2026-01-20T20:49:22 1768942162

Google, Amazon, and Microsoft aren’t profitable?

notyourwork · 2026-01-20T21:02:10 1768942930

I assume the reference was AI use cases are not profitable. Those companies are subsidizing and OpenAI/grok are burning money.

lossyalgo · 2026-01-21T00:00:20 1768953620

Yeah but OpenAI is adding ads this year for the free versions, which I'm guessing is most of their users. They are probably hedging on taking a big slice of Google's advertising monopoly-pie (which is why Google is also now all-in on forcing Gemini opt-out on every product they own, they can see the writing on the wall).

onion2k · 2026-01-20T23:53:03 1768953183

Google, Amazon, and Microsoft do a lot of things that aren't profitable in themselves. There is no reason to believe a company will kill a product line just because it makes a loss. There are plenty of other reasons to keep it running.

notyourwork · 2026-01-21T16:44:45 1769013885

I didn't imply anything about what big-tech would do.

wolfram74 · 2026-01-20T22:20:13 1768947613

Do you think it's odd you only listed companies with already existing revenue streams and not companies that started with and only have generative algos as their product?

josefx · 2026-01-20T21:15:30 1768943730

Aren't all Microsoft products OpenAI based? OpenAI has always been burning money.

dangus · 2026-01-20T22:25:50 1768947950

How many business units have Google and Microsoft shut down or ceased investment for being unprofitable?

I hear Meta is having massive VR division layoffs…who could have predicted?

Raw popularity does not guarantee sustainability. See: Vine, WeWork, MoviePass.

Forgeties79 · 2026-01-20T20:30:38 1768941038

Where? Who’s in the black?

selfhoster11 · 2026-01-21T17:33:20 1769016800

The users.

Forgeties79 · 2026-01-22T02:39:28 1769049568

Ehhhhhhh

kelseyfrog · 2026-01-21T17:33:16 1769016796

Algorithmic breakthroughs (increases in efficiency) risk Jevons Paradox. More efficient processes make deploying them even more cost effective and increases demand.