More

Deegy · 2026-04-02T19:48:30 1775159310

So what's the business strategy here?

Google is the only USA based frontier lab releasing open models. I know they aren't doing it out of the goodness of their hearts.

artificialprint · 2026-04-02T20:30:52 1775161852

Release open weights so competitors can't raise good money, then rear naked choke when they run dry

robocat · 2026-04-02T22:19:04 1775168344

Using Brazilian Jiu-Jitsu (BJJ) technical terms is confusing. Sports allusions don't travel well between cultures, especially if they sound seedy.

golfer · 2026-04-03T03:50:40 1775188240

I found it plucky and intriguing. A great metaphor, not often seen in tech. Not everything has to be in the lowest common denominator of language.

g947o · 2026-04-02T21:38:25 1775165905

https://openai.com/index/introducing-gpt-oss/

stavros · 2026-04-02T23:40:20 1775173220

This is nearly a year old, which is a million years in LLM time.

g947o · 2026-04-02T23:44:19 1775173459

*8 months

That doesn't make parent's claim true or even relevant.

And OpenAI could release an open model tomorrow. Nobody knows.

stavros · 2026-04-03T00:20:50 1775175650

Anybody could release an open model tomorrow. Google is the only US based lab releasing open weights models. OpenAI released one once, which might or might not count as "releasing", depending on your definition

g947o · 2026-04-03T10:21:37 1775211697

> OpenAI released one once

You forgot the GPT-2 that came long before that. OpenAI was the lab that releases open models.

None of this is factually correct, that is it. I don't think this is debatable. I don't love OpenAI, but OpenAI made huge contributions to the field, and one should give credit where credit is due.

I have great trouble understanding why someone would waste time defending it.

petu · 2026-04-03T07:35:11 1775201711

> OpenAI released one once

they've released gpt-oss-safeguard in October

I hope / think they are going to release more, just going for one big release a year like Gemma (if we talk strictly about general chat model -- Gemma 3 was March 2025)

nickthegreek · 2026-04-03T14:56:46 1775228206

> OpenAI released one once

gpt-2, CLIP, Whisper, Point-E, got-oss-120b, gpt-oss-20b.

BoingBoomTschak · 2026-04-03T21:33:10 1775251990

Spreading propaganda through aligned model censored to eschew wrongthink? I mean, I truly believe there's some of that in the LLM world, but probably not the real reason you're searching for. Might be trying to (re)gain mindshare/cred amongst the hackers.

Deegy · 2026-04-02T19:46:13 1775159173

Is it though? Do we still have the expectation that LLMs will eventually be able to solve problems they haven't seen before? Or do we just want the most accurate auto complete at the cheapest price at this point?

sdenton4 · 2026-04-02T23:10:12 1775171412

It indicates that there's a good chance that they have trained on the test set, making the eval scores useless. Even if you have given up on the dream of generalization entirely, you can't meaningfully compare models which have trained on test to those which have not.

Deegy · 2026-03-01T15:59:44 1772380784

And now there are 1% of the number of farmers that there used to be

nunez · 2026-03-01T18:45:01 1772390701

And the only people who could afford to tractor at scale are Cargill/Monsanto who bought out most of the small/medium-sized farms while leaving farms that didn't take the offer to slowly die...

RevEng · 2026-03-01T16:36:18 1772382978

And yet there isn't widespread unemployment. Fewer farmers were needed so fewer people became farmers. Food became cheap and plentiful. Everyone else went on to do other things that they couldn't afford to do before. Software will do the same; we will make more software with fewer people and it will become ubiquitous to the point that people will just quickly generate whatever software they need rather than do many monotonous tasks manually.

blell · 2026-03-01T17:49:09 1772387349

That argument does people who have invested decades of their lives into software engineering a lot of good.

johnnyanmac · 2026-03-02T03:02:54 1772420574

That may work if the scope that AI wanted to takeover wasn't scoped to nearly every job that involves a screeen.

Deegy · 2026-01-03T16:32:46 1767457966

Exactly. It's a lot easier to prompt an LLM when you spent a year understanding the problem.

Deegy · 2026-01-01T15:37:25 1767281845

They're increasing reps and therefore total load. That's still a form of progression ('pushing yourself'). This style will slightly favor hypertrophy gains over strength gains.

At 40 I recently made this switch in style as well. The weight was getting so high that my anxiety was causing a mental aversion to working out altogether. Consistency is really 95% of exercise so I think this is a reasonable trade-off.

That said, I understand where you are coming from. There's something to be said about facing the fear of the weight head on. I've already done that in my younger years though. I'd much rather avoid injury and get 80% of the benefits.

worldsavior · 2026-01-01T16:57:42 1767286662

You shouldn't be stressed of what's in front of you. Training also trains you for that other than muscle/power building. If you don't compete, you have no reason to be anxious. You should maybe dig into what's causing you that anxiety, if it's "I worry I won't make this weight", remind yourself that nothing will happen if you do, and if you do, it's part of the progression. I get this anxiousness also, but I always remind myself that.

I think that what you do in the gym will reflect on yourself.

Deegy · 2026-01-03T14:26:08 1767450368

I appreciate the response but I'm not sure I can agree with 'nothing will happen...'

When I have 275lbs on my back I'm very anxious that any lapse in focus could cause major injury to my knees, back, etc.

Deegy · 2025-12-09T16:32:04 1765297924

Extensive tailwind training data in the models. Sure there's something more efficient but it's just safer to let the model leverage what it was trained on.

camdenreslink · 2025-12-09T18:18:06 1765304286

Surely there is an order of magnitude more training data on plain CSS than tailwind, right?

frikk · 2025-12-09T20:55:10 1765313710

In my experience the LLMs work better with frameworks that have more rigid guidance. Something like Tailwind has a body of examples that work together, language to reason about the behavior needed, higher levels of abstraction (potentially), etc. This seems to be helpful.

The LLMs can certainly use raw CSS and it works well, the challenge is when you need consistent framing across many pages with mounting special cases, and the LLMs may make extrapolate small inconsistencies further. If you stick within a rigid framework, the inconsistencies should be less across a larger project (in theory, at least).

Deegy · 2025-12-03T16:20:27 1764778827

Professional legal services seem to be picking up steam. Which sort of makes sense as a natural follow on to programming, given that 'the law' is basically codified natural language.

DontForgetMe · 2025-12-04T22:32:45 1764887565

I don't know how it is in other countries, but in the UK using LLMs for any form of paid legal services is hugely forbidden, and would also be insanely embarrassing. Like, 'turns out nobody had any qualifications and they were sending all the work to mechanical Turks in third world countries, who they refused to pay' levels of embarrassing.

I say this as someone who once had the bright idea of sending deadline reminders, complete with full names of cases, to my smart watch. It worked great and made me much more organised until my managers had to have a little chat about data protection and confidentiality and 'sorry, what the hell were you thinking?'. I am no stranger to embarrassing attempts to jump the technological gun, or the wonders of automation in time saving.

But absolutely nobody in any professional legal context in the UK, that I can imagine, would use LLMs with any more gusto and pride than an industrial pack of diarrhoea relief pills or something - if you ever saw it in an office, you'd just hope it was for personal use and still feel a bit funny about shaking their hands.

sc68cal · 2025-12-03T18:41:24 1764787284

Except that it keeps getting lawyers into trouble when they use it.

https://www.reuters.com/legal/government/judge-disqualifies-...

CamperBob2 · 2025-12-04T02:12:24 1764814344

Yeah, good point. These things never get better.

Deegy · 2025-11-28T14:44:50 1764341090

We currently have human-in-the-loop AGI.

While it doesn't seem we can agree on a meaning for AGI, I think a lot of people think of it as an intelligent entity that has 100% agency.

Currently we need to direct LLM's from task to task. They don't yet posses the capability of full real world context.

This is why I get confused when people talk about AI replacing jobs. It can replace work, but you still need skilled workers to guide them. To me, this could result in humans being even more valuable to businesses, and result in an even greater demand for labor.

If this is true, individuals need to race to learn how to use AI and use it well.

vidarh · 2025-11-28T15:19:09 1764343149

> Currently we need to direct LLM's from task to task.

Agent-loops that can work from larger scale goals work just fine. We can't letting them run with no oversight, but we certainly also don't need to micro-manage every task. Most days I'll have 3-4 agent-loops running in parallel, executing whole plans, that I only check in on occasionally.

I still need to review their output occasionally, but I certianly don't direct them task to task.

I do agree with you we still need skilled workers to guide them, so I don't think we necessarily disagree all that much, but we're past the point where they need to be micromanaged.

gortok · 2025-11-28T15:50:20 1764345020

If we can't agree on a definition of AGI, then what good is it to say we have "human-in-the-loop AGI"? The only folks that will agree with you will be using your definition of AGI, which you haven't shared (at least in this posting). So, what is your definition of AGI?

Deegy · 2025-11-28T14:35:59 1764340559

They know that LLMs as a product are racing towards commoditization. Bye bye profit margins. The only way to win is regulation allowing a few approved providers.

fuzzy_biscuit · 2025-11-28T15:17:09 1764343029

They are more likely trying to race towards wildly overinflated government contracts because they aren't going to profit how they're currently operating without some of that funny money.

baxtr · 2025-11-28T19:33:35 1764358415

Isn’t that a bit like saying: storage is commodity and thus profit margins will be/should be low.

All major cloud providers have high profit margins in the range of 30-40%.

adam_arthur · 2025-11-28T21:55:07 1764366907

Storage doesn't require the same capex/upfront investment to get that margin.

How much does it cost to train a cutting edge LLM? Those costs need to be factored into the margin from inferencing.

Buying hard drives and slotting them in also has capex associated with it, but far less in total, I'd guess.

conradev · 2025-11-28T23:22:52 1764372172

  How much does it cost to train a cutting edge LLM? Those costs need to be factored into the margin from inferencing.

They don't, though! I can buy hardware off of the shelf, host open source models on it, and then charge for inference:

https://parasail.io, https://www.baseten.co

adam_arthur · 2025-11-28T23:33:59 1764372839

Yes, which is why the companies that develop the models aren't cost viable. (Google and others who can subsidize it at a loss obviously are excepted)

Where is the return on the model development costs if anybody can host a roughly equivalent model for the same price and completely bypass the model development cost?

Your point is inline with the entire bear thesis on these companies.

For any use cases which are analytical/backend oriented, and don't scale 1:1 with number of users (of which there are a lot), you can already run a close to cutting edge model on a few thousand dollars of hardware. I do this at home already

wyre · 2025-11-29T00:19:08 1764375548

Open source models are still a year or so behind the SotA models released the last few months. The price to performance is definitely in favor of Open Source models however.

DeepMind is actively using Google’s LLMs on groundbreaking research. Anthropic is focused on security for businesses.

For consumers it’s still a better deal for a subscription than to invest a few grand in a personal LLM machine. There will be a time in the future where diminishing returns shortens this gap significantly, but I’m sure top LLM researchers are planning for this and will do whatever they can to keep their firm alive beyond the cost of scaling.

adam_arthur · 2025-11-29T01:02:33 1764378153

Definitely

I am not suggesting these companies can't pivot or monetize elsewhere, but the return on developing a marginally better model in-house does not really justify the cost at this stage.

But to your point, developing research, drugs, security audits or any kind of services are all monetization of the application of the model, not the monetization of the development of new models.

Put more simply, say you develop the best LLM in the world, that's 15% better than peers on release at the cost of $5B. What is that same model/asset worth 1 year later when it performs at 85% of the latest LLM?

Already any 2023 and perhaps even 2024 vintage model is dead in the water and close to 0 value.

What is a best in class model built in 2025 going to be worth in 2026?

The asset is effectively 100% depreciated within a single year.

(Though I'm open to the idea that the results from past training runs can be reused for future models. This would certainly change the math)

wyre · 2025-11-29T03:22:37 1764386557

For sure, all these companies are racing to have the strongest model, and as time goes on we quickly start reaching diminishing returns. DeepSeek came out at the beginning of this year, blew everyone's minds, and now look at how far the industry has progressed beyond it.

It doesn't even seem like these companies are in a battle of attrition to not be the first to go bankrupt. Watching this would be a lot more exciting if that was the case! I think if there was less competition between LLMs developers could slow down, maybe.

Looking at the prices of inference of open-source models, I would bet proprietary models are making a nice margin on API fees, but there is no way OpenAI will make their investors whole because they make a few dollars of revenue for a million tokens. I am terrified of the world we will live in if OpenAI will be able to reverse their balance sheet. I think there's no where else that investors want to put their money.

TheRoque · 2025-11-29T01:13:46 1764378826

The other nightmare for these companies, is that any competitor can use their state of the art model for training another model. As some Chinese models are suspected to do. I personally think it's only fair, since those companies in the first place trained on a ton of data and nobody agreed to it. But it shows that training the frontier models have really low returns on investment

baxtr · 2025-11-30T09:45:18 1764495918

Yes you’re right. Capex spend is definitely higher.

In the end it comes all down to the value provided as you see in the storage example.

az09mugen · 2025-12-05T08:01:15 1764921675

RAM also apparently.

kupopuffs · 2025-11-28T21:37:03 1764365823

this is slightly more nuanced, since the AI portion is not making money. it's their side hustle

delusional · 2025-11-28T15:05:02 1764342302

What profit margins?

Deegy · 2025-11-28T15:47:08 1764344828

It is unclear. Everyday I seem to read contradictory headlines about whether or not inference is profitable.

If inference has significant profitability and you're the only game in town, you could do really well.

But without regulation, as a commodity, the margin on inference approaches zero.

None of this even speaks to recouping the R&D costs it takes to stay competitive. If they're not able to pull up the ladder, these frontier model companies could have a really bad time.

adam_arthur · 2025-11-28T16:17:33 1764346653

Probably it's "operationally profitable" when ignoring capex, depreciation, dilution and other required expenses to stay current.

Of course that means it's unprofitable in practice/GAAP terms.

You'd have to have a pretty big margin on inference to make up for the model development costs alone.

A 30% margin on inference for a GPU that will last ~7 years will not cut it

kibwen · 2025-11-28T15:23:04 1764343384

It's still technically a profit margin if it's less than zero...

bko · 2025-11-28T17:55:44 1764352544

There are profit margins on inference from what I understand. However the hefty training costs obviously make it a money losing operation.

SoftTalker · 2025-11-28T17:44:43 1764351883

The ones they hoped for.

JKCalhoun · 2025-11-28T16:00:09 1764345609

Perhaps P/E ratios?

vpShane · 2025-11-28T17:57:18 1764352638

Yeah, but we can self-host them. At this point in the span of it, it's more about infrastructure and compute power to meet demand and Google won because it has many business models, massive cashflow, TPUs, and the infrastructure to build expanding on their current, which would take new companies ~25 years to map out compute, data centers and have a viable, tangible infrastructure all while trying to figure out profits.

I'm not sure about how the regulation of things would work, but prompt injections and whatever other attacks we haven't seen yet where agents can be hijacked and made to do things sounds pretty scary.

It's a race towards AGI at this point. Not sure if that can be achieved as language != consciousness IMO

Arainach · 2025-11-28T18:27:20 1764354440

>Yeah, but we can self-host them

Who is "we", and what are the actual capabilities of the self-hosted models? Do they do the things that people want/are willing to pay money for? Can they integrate with my documents in O365/Google Drive or my calendar/email in hosted platforms? Can most users without a CS degree and a decade of Linux experience actually get them installed or interact with them? Are they integratable with the tools they use?

Statistically close to "everyone" cannot run great models locally. GPUs are expensive and niche, especially with large amounts of VRAM.

vpShane · 2025-11-28T20:00:33 1764360033

Correct. And glad you're aware of the challenges with running them.

I'm not saying the options are favorable for everybody, I'm saying the options are there if it becomes locked in to 1-3 companies.

wyre · 2025-11-29T00:31:59 1764376319

>It's a race towards AGI at this point. Not sure if that can be achieved as language != consciousness IMO

However it is arguable that thought is relatable with conscienceness. I’m aware non-linguistic thought exists and is vital to any definition of conscienceness, but LLMs technically dont think in words, they think in tokens, so I could imagine this getting closer.

idiotsecant · 2025-11-29T00:52:47 1764377567

'think' is one of those words that used to mean something but is now hopelessly vague- in discussions like these it becomes a blunt instrument. IMO LLMs don't 'think' at all - they predict what their model is most likely to say based on previously observed patterns. There is no world model or novelty. They are exceptionally useful idea adjacency lookup tools. They compress and organize data in a way that makes it shockingly easy to access, but they only 'think' in the way the Dewey decimal system thinks.

wyre · 2025-11-29T02:25:32 1764383132

if we were having this conversation in 2023 I would agree with you, but LLM's have advanced so much that they are essentially efficient lookup tables is an oversimplification so dramatic I know you don't understand what you're talking about.

No one accuses the Dewey decimal system of thinking.

idiotsecant · 2025-11-29T16:51:10 1764435070

If I am so ignorant maybe you'd like to expand on exactly why I'm wrong. It should be easy since the oversimplification is dramatic enough that it made you this aggressive.

wyre · 2025-11-29T18:54:07 1764442447

No, I don't want to waste my time trying to change the view of someone so close-minded they can't accept that LLM's do anything close to "thinking"

Sorry.

idiotsecant · 2025-11-30T14:03:38 1764511418

That's what I thought. Big talk, no substance.

llbbdd · 2025-11-30T20:54:48 1764536088

I'm not the other poster but he's probably referring to how your comment seems to only be talking about "pure" LLMs and seems pretty out of date, whereas most tools people are using in 2025 use LLMs as glue to stitch together other powerful systems.

threethirtytwo · 2025-11-28T19:14:25 1764357265

The bottleneck for commoditization is hardware. The manufacture of the hardware required is led by tmsc and samsung being a close second. The tooling required for manufacture is centralized with ASML and several other smaller players like Zeiss and the design of the product centers around nvidia though there are players like AMD who are attempting to catch up.

It is a complex supply chain but each section of the chain is held by only a few companies. Hopefully this is enough competition to accelerate the development of computational technologies that can run and train these LLMs at home. I give it a decade or more.

nradov · 2025-11-28T21:42:44 1764366164

Another way to win is through exclusive access to high quality training data. Training data quality and quantity represent an upper bound on LLM performance. That's why the frontier model developers are investing some of their "war chests" in purchasing exclusive rights to data locked up behind corporate firewalls, and even hiring human subject matter experts in order to create custom proprietary training data in certain strategic domains.

missedthecue · 2025-11-28T20:54:38 1764363278

The "few approved providers" model is what they have been fighting against since the Biden admin

flir · 2025-11-28T15:13:17 1764342797

The only way to win is commoditize your complement (IMO).

pclmulqdq · 2025-11-28T15:20:58 1764343258

That's a good line but it only works if market forces don't commoditize you first. Blithely saying "commoditize your complement" is a bit like saying "draw the rest of the owl."

flir · 2025-11-28T22:33:29 1764369209

Free models given away by social media companies (because they want people to generate content) and hardware companies (because they want people to buy GPUs, or whatever replaces them). Can the current subscription models compete with free? It's just a prediction - it could well be wrong.

Deegy · 2025-11-26T12:23:55 1764159835

That would be true in a monopolistic market. But these frontier models are all competing against each other. The incentive to 'just work and get shit done fast' is there as they each try to gain market share.