More

wouldbecouldbe · 2026-03-28T12:40:21 1774701621

This is really great, anyone know of a Dutch version?

layer8 · 2026-03-28T13:39:59 1774705199

https://news.ycombinator.com/item?id=47554274

wouldbecouldbe · 2026-03-28T16:48:07 1774716487

yeah thats a great start, however the md files of every change are really helpful in going though history and understand steps with llms'

wouldbecouldbe · 2026-03-27T22:10:17 1774649417

I exclusive use complete fullscreen mode for apps i'm actively using and on large screens connect the workspaces, on small screen swipe back and forth. So I you never actually use that.

wouldbecouldbe · 2026-03-27T11:57:01 1774612621

Yeah that's the skeptical key point.

The practical key point is: if you want to do a large migration is to have a very good & extensive test suite that Claude is not allowed to change during the migration. Then Claude is extremely impressive and accurate migrating your codebase and needs minimal handholding. If you don't have a test suite, claude will be freewheeling all the way. Just did an extensive migration project, and should have focused on the test suite much more.

helpfulfrond · 2026-03-28T13:32:24 1774704744

Yeah, apparently the original library has nearly 4,000 tests. This would have been impossible without those. This speaks to the power of testing. The lack of discussion here also shows how under-valued it is.

wouldbecouldbe · 2026-03-28T13:37:02 1774705022

Testing in the human era I think was less usefull. Too many tests would lead to high maintenance costs. In the AI era its a lot more easy to manage.

wouldbecouldbe · 2026-03-25T13:16:42 1774444602

Out of all things, you have to be a sadist to be passionate about pest control. Even though necessary at times, it's not a very clean job.

Sounded more like he likes he is passionate about building a business.

tezclarke · 2026-03-25T14:10:33 1774447833

It’s the business characteristics I like. Recurring and one off revenue, big market, growing, regulations. The exam barrier to entry rather than 4y apprenticeship like plumbing

wouldbecouldbe · 2026-03-24T11:30:38 1774351838

Some of us love it, bit intense sometimes, but fun. So I guess we get to decide it ourselves what we prefer.

I know many will then say, BUT QUALITY, but if you learn to deal with your own and claude quirks, you also learn how to validate & verify more efficiently. And experience helps here.

wouldbecouldbe · 2026-03-15T09:42:13 1773567733

Still point stand that fraud is at times punished harsher then rape or child molesting

danielheath · 2026-03-15T11:36:28 1773574588

The fraud isn’t what he’s being punished for.

The ongoing refusal to answer questions under oath is.

He could have agreed to talk anytime and been released shortly.

pc86 · 2026-03-15T16:38:15 1773592695

I understand being in contempt for not answering a question generally, but I'm curious how this doesn't fall under 5th amendment protections.

drdec · 2026-03-15T17:12:16 1773594736

IANAL

It's a civil proceeding not a criminal proceeding so he would not be incriminating himself.

He could argue that by answering he would be admitting crimes and opening himself to criminal liability. But there's a possibly they give him immunity and that route is taken away.

pc86 · 2026-03-15T19:17:39 1773602259

IANAL either but I'm not sure anyone involved in the civil case would have the power or authority to grant criminal immunity (perhaps up to and including the judge, at least local to me the civil judges do not do criminal cases - there is no overlap).

drdec · 2026-03-15T23:32:15 1773617535

Yes I agree that would need to involve the DA

superkuh · 2026-03-15T15:02:20 1773586940

It sure would be nice if this standard of conduct in court were also upheld for the US federal officials who refuse to answer or straight up bold faced lie in court. But nah, it only ever happens to normal people.

andrepd · 2026-03-15T10:24:22 1773570262

Rape and child molesting is often, unfortunately, hard to prove in a court of law. This case is the opposite.

graemep · 2026-03-15T10:42:59 1773571379

You are missing the point. When these crimes are proved in court they get lower sentences. The lower conviction rates are unavoidable. The shorter sentences are not.

I remember once reading two bits of news about people given similar sentences. One for copyright infringement, the other for sexual assault of a teenager.

pluc · 2026-03-15T11:46:25 1773575185

Money is more valuable than people

lazide · 2026-03-15T15:02:09 1773586929

Well, practically when I tried to buy that yacht with my 10 year old, the threatened me with more jail time… (/s)

pluc · 2026-03-15T17:32:54 1773595974

There's a certain client list you might be interested in

wouldbecouldbe · 2026-03-14T15:06:05 1773500765

I feel like few weeks ago i suddenly had a week where even after 3 messages it forgot what we did. Seems fixed now.

wouldbecouldbe · 2026-03-12T21:48:48 1773352128

I think its why its so good; it works on half ass assumptions, poorly written prompts and assumes everything missing.

vidarh · 2026-03-12T22:18:37 1773353917

I worked on a project that did fine tuning and RLHF[1] for a major provider, and you would not believe just how utterly broken a large proportion of the prompts (from real users) were. And the project rules required practically reading tea leaves to divine how to give the best response even to prompts that were not remotely coherent human language.

[1] Reinforcement learning from human feedback; basically participants got two model responses and had to judge them on multiple criteria relative to the prompt

redman25 · 2026-03-13T02:41:54 1773369714

I feel like the right response for those situations is to start asking questions of the user. It’s what a human would do if they did not understand.

vidarh · 2026-03-13T07:38:51 1773387531

I made the argument multiple times that the right answer to many prompts would be a question, and it was allowed under some rare circumstances, but far too few.

I suspect in part because the provider also didn't want to create an easy cop out for the people working on the fine-tuning part (a lot of my work was auditing and reviewing output, and there was indeed a lot of really sloppy work, up to and including cut and pasting output from other LLMs - we know, because on more than one occasion I caught people who had managed to include part of Claudes website footer in their answer...)

winterqt · 2026-03-14T04:23:50 1773462230

As in, participants would copy output from one LLM as a question to another?

XCSme · 2026-03-12T22:29:24 1773354564

To be honest, I had this "issue" too.

I upgraded to a new model (gpt-4o-mini to grok-4.1-fast), suddenly all my workflows were broken. I was like "this new model is shit!", then I looked into my prompts and realized the model was actually better at following instructions, and my instructions were wrong/contradictory.

After I fixed my prompts it did exactly what I asked for.

Maybe models should have another tuneable parameters, on how well it should respect the user prompt. This reminds me of imagegen models, where you can choose the config/guidance scale/diffusion strength.

wouldbecouldbe · 2026-02-26T08:45:21 1772095521

Switching llms is like switching a car. Its a bit annoying in the beginning, it responds slightly different and you need to change you subconscious habits before it feels comfortable. Why everyone always complains about new models. So unless there is a very obvious improvement; most users will prefer to stick to their current llm

checker659 · 2026-02-26T08:51:15 1772095875

That has not been my experience at all. My mom and dad were able to switch from ChatGPT to Gemini without any friction whatsoever. I myself round robin between Claude, Gemini and ChatGPT all the time.

wouldbecouldbe · 2026-02-16T10:02:45 1771236165

I just tried claude, only Opus gave the correct answer. Haiku & Sonnet both told me to walk.