More

grewil2 · 2026-03-30T21:45:29 1774907129

Side note: I don’t know what Anthropic changed but now Claude Code consumes the quota incredibly fast. I have the Max5 plan, and it just consumed about 10% of the session quota in 10 minutes on a single prompt. For $100/month, I have higher expectations.

landr0id · 2026-03-30T21:50:37 1774907437

Relevant: https://www.reddit.com/r/ClaudeAI/comments/1s7zgj0/investiga...

https://www.reddit.com/r/ClaudeAI/comments/1s7mkn3/psa_claud...

onemoresoop · 2026-03-31T01:37:02 1774921022

That explains things. Im getting this: API Error: 400 {"error":{"message":"Budget has been exceeded! Current cost: 271.29866200000015, Max budget: 200.0","type":"budget_exceeded","param":null,"code":"400"}}

So I completetly ran out of tokens and haven’t even used it at all for the past couple of days, and last week my usage was very light. Let me scratch that, all my usage has been very light since I got this plan at work. It’s a an enterprise subscription I believe, hard to tell since it doesn’t connect directly to Anthropic, rather it goes through a proxy on Azure.

Im not liking this at all and all, so flaky and opaque. Not possible to get a breakdown on what the usage went on, right? Do we have to contact Anthropic for a refund or will they restore the bogus usage?

nixpulvis · 2026-03-30T23:20:25 1774912825

This is a serious problem with the fact that it's nearly impossible to understand what a "token" is and how to tame their use in a principled way.

It's like if cars didn't advertise MPG, but instead something that could change randomly.

uoaei · 2026-03-30T23:39:52 1774913992

Like if cars measured fuel efficiency or range using the knobs in the tread on your tire.

amitprasad · 2026-03-31T00:18:43 1774916323

Relevant post: https://modal.com/blog/dollars-per-token-considered-harmful

(disclaimer: I work with the author)

nixpulvis · 2026-03-31T00:49:28 1774918168

I completely agree that requests are what should be charged for. But I think there are two things, given that requests aren't all going to cost the same amount:

1. Estimate free invoicing the requests and letting users figure it out after the fact. 2. Somehow estimating cost and telling users how much a request will cost.

We have 1, we want 2.

claw-el · 2026-03-31T00:44:30 1774917870

Also, certain models are more verbose than the others. We are basically at the mercy of a model who likes to ramble a lot.

konfusinomicon · 2026-03-31T01:39:38 1774921178

im fiarly certain the knob on the machine that controls length of redundant comments and docblocks is cranked to 11. it makes me curious how much of their bottom line is driven by redundant comment output.

clawfund · 2026-03-31T00:27:23 1774916843

[flagged]

sebmellen · 2026-03-31T02:52:51 1774925571

Please do not bot HN.

prodigycorp · 2026-03-31T03:07:06 1774926426

Anthropic really needs to opensource claude code.

One of the biggest turnoffs as a claude code user is the CC community cargo culting the subreddit because community outreach is otherwise poor.

landr0id · 2026-03-31T17:39:27 1774978767

Looks like your wish was accidentally granted :)

prodigycorp · 2026-04-01T06:52:17 1775026337

Hilarious. Makes you hope CC team sticks to it, so much goodwill to be had by doing so.

conception · 2026-03-30T22:16:04 1774908964

I noticed 1M context window is default and no way not to use it. If your context is at 500-900k tokens every prompt, you’re gonna hit limits fast.

Wowfunhappy · 2026-03-30T22:45:49 1774910749

I had to double check that they'd removed the non-1M option, and... WTF? This is what's in `/config` → `model`

    1. Default (recommended)    Opus 4.6 with 1M context · Most capable for complex work
    2. Sonnet                   Sonnet 4.6 · Best for everyday tasks
    3. Sonnet (1M context)      Sonnet 4.6 with 1M context · Billed as extra usage · $3/$15 per Mtok
    4. Haiku                    Haiku 4.5 · Fastest for quick answers

So there's an option to use non-1M Sonnet, but not non-1M Opus?

Except wait, I guess that actually makes sense, because it says Sonnet 1M is billed as extra usage... but also WTF, why is Sonnet 1M billed as extra usage? So Opus 1M is included in Max, but if you want the worse model with that much context, you have to pay extra? Why the heck would anyone do that?

The screen does also say "For other/previous model names, specify with --model", so I assume you can use that to get 200K Opus, but I'm very confused why Anthropic wouldn't include that in the list of options.

What a strange UX decision. I'm not personally annoyed, I just think it's bizarre.

retrofuturism · 2026-03-30T23:24:14 1774913054

`/model opus` sets it to the original non-1M Opus... for now.

windexh8er · 2026-03-31T00:23:24 1774916604

Thanks. I quickly burned through $100 in credit when I started using Opus 4.6 in OpenCode via OpenRouter. My session stopped and was getting an error not representative of credit availability, so was surprised after a few minutes when I finally realized Opus just destroyed those credits on a bullshit reasoning loop it got stuck in. Anthropic seems to know that the expanded context is better for their bottom line as they've defaulted it now.

And as others have said it's very easy to burn token usage on the $100/month plan. It's getting to the point where it's going to very much make sense to do model routing when using coding tooling.

weird-eye-issue · 2026-03-31T04:44:40 1774932280

Not sure why you were downvoted because this is actually correct. Can also use --model opus

aberoham · 2026-03-30T22:19:31 1774909171

export CLAUDE_CODE_DISABLE_1M_CONTEXT=1

teaearlgraycold · 2026-03-30T22:26:07 1774909567

Anthropic is not building good will as a consumer brand. They've got the best product right now but there's a spring charging behind me ready to launch me into OpenCode as soon as the time is right.

kylecazar · 2026-03-30T22:34:59 1774910099

Would you use Opus if you switched to OpenCode?

teaearlgraycold · 2026-03-30T22:54:36 1774911276

I'd like to use Opus with OpenCode right now to combine the best TUI agent app with the best LLM. But my understanding is Anthropic will nuke me from orbit if I try that.

joecot · 2026-03-30T23:42:41 1774914161

You can use Opus with OpenCode anytime you want, just not with the Claude plan. You can use it via API with any provider, including Anthropic's API. You can use it with Github Copilot's plan. The only thing you can't do without getting banned is use OpenCode with one of Claude's plans.

nurettin · 2026-03-31T02:45:21 1774925121

I keep seeing this "you can use the inconvenient and unpredictably costly way all you want" pedantic kneejerk response so often lately.

It's like saying well humans can fly with a paraglider. It is correct and useless. Most here won't have cash to burn with unbounded opus api usage.

joecot · 2026-04-01T16:10:49 1775059849

If you want to use Opus with a different coding harness along with a coding plan, you can use Github CoPilot. It even has built in authentication with OpenCode.

corford · 2026-03-30T23:14:57 1774912497

OpenCode with a Copilot Business sub and Opus 4.6 as the model works well

teaearlgraycold · 2026-03-31T01:49:52 1774921792

I'm looking at their plans (https://github.com/features/copilot/plans) it seems like the limits might be pretty low, even with the Pro+ plan which is 2x the cost of Claude Pro. It seems like Claude Pro might be 10-20x the Opus tokens for only twice the price.

corford · 2026-03-31T12:10:30 1774959030

Copilot has a totally different billing model. It's request based rather than token based. Counter-intuitively, in our case at least, it is way cheaper than token based pricing. One request can sometimes consume 2-4 million tokens but is billed as a single request (or it's multiplier if using a premium model like opus).

nextaccountic · 2026-03-31T01:35:25 1774920925

do you pay for the full context every prompt? what happened with the idea of caching the context server side?

davesque · 2026-03-31T01:49:25 1774921765

You don't. Most of the time (after the first prompt following a compaction or context clear) the context prefix is cached, and you pay something like 10% of the cost for cached tokens. But your total cost is still roughly the area under a line with positive slope. So increases quadratically with context length.

weird-eye-issue · 2026-03-31T04:45:42 1774932342

It helps a ton but it doesn't last forever and you still have to pay to write to the cache

no1youknowz · 2026-03-30T22:00:00 1774908000

I've been jumping from Claude -> Gemini -> GPT Codex. Both Claude and Gemini really reduced quotas and so I cancelled. Only subbed GPT for the special 2x quota in March and now my allocation is done as well.

I decided to give opencode go a try today. It's $5 for the first month. Didn't get much success with Kimi K2, overly chatty, built too complex solutions - burned 40% of my allocation and nothing worked. ¯\_(ツ)_/¯.

But Minimax m2.7. Wow, it feels just like Claude Opus 4.6. Really has serious chops in Rust.

Tomorrow/Wednesday will try a month of their $40 plan and see how it goes.

victorbjorklund · 2026-03-30T22:18:19 1774909099

Minimax 2.7 is great. Not close to Claude but good enough for a lot of coding tasks.

girvo · 2026-03-31T01:20:52 1774920052

GLM-5 (and 5.1) is surprisingly impressive too I’m finding.

outside1234 · 2026-03-30T23:43:55 1774914235

They need to get to profitability because that sweet sweet Saudi subsidy cash is gone gone.

kderbyma · 2026-03-31T00:39:08 1774917548

They wont be profitable at this point...they just dont realise they are eating their own tail.

lkbm · 2026-03-30T23:35:27 1774913727

I've heard this a few times lately, but this past weekend I built a website for a friend's birthday, and it took me several hours and many queries to get through my regular paid plan. I just use default settings (Sonnet 4.6, medium effort, thinking on).

I'm guessing Opus eats up usage much, much faster. I don't know what's going on, since a lot of people are hitting limits and I don't seem to be.

lkbm · 2026-03-31T14:36:33 1774967793

Update: Maybe the difference is that I think I was just using the vscode extension at the time: https://news.ycombinator.com/item?id=47586176

I go back and forth between vscode and claude in the terminal, but that day I think I did vscode.

notatoad · 2026-03-30T23:40:54 1774914054

what they changed was peak vs off-peak usage metering.

using it on the weekend gets you more use than during weekdays 9-5 in US eastern time.

matheusmoreira · 2026-03-31T00:29:35 1774916975

I waited until off peak hours to use Opus 4.6 to do some research. One prompt consumed 100% of my 5h limit and 15% of my weekly usage. Even off peak it's still insane. Opus didn't even manage to finish what it was doing.

hrimfaxi · 2026-03-31T00:04:33 1774915473

I'm surprised it's during east coast working hours and not west coast.

notatoad · 2026-03-31T00:12:13 1774915933

the speculation i read was that it's trading hours, and they're getting a lot of load from the finance industry

lkbm · 2026-03-31T00:01:50 1774915310

Technically, this was Friday morning, so I think I was still in peak hours.

teaearlgraycold · 2026-03-31T00:26:45 1774916805

Even with Opus I don’t usually hit limits on the standard plan. But I am not doing professional work at the moment and I actually alternate between using the LLM and reading/writing code the old fashioned way. I can see how you’d blow through the quota quickly if you try to use LLMs as universal problem solvers.

zar1048576 · 2026-03-31T02:39:00 1774924740

Have had similar issues with costs sometimes being all over the map. I suspect that the major providers will figure this out as it’s an important consideration in the enterprise setting

xantronix · 2026-03-31T01:32:26 1774920746

This is a very normal thing to be the top comment on an article on how to use Claude Code.

maximinus_thrax · 2026-03-31T00:29:23 1774916963

I'm very surprised to see enshittification starting so early. I was expecting at last 3-4 years of VC subsidized gravy train.

kderbyma · 2026-03-31T00:38:12 1774917492

This has been 6 months of constant decline so at this point I am wondering when they cliff it like wework

manmal · 2026-03-30T21:56:18 1774907778

Looks like they are falling victim to their own slop. This smells a lot like the Amazon outages caused by mandated clanker usage.

irishcoffee · 2026-03-31T00:01:34 1774915294

Reminds me of when I would mess with my friends on "pay per text" plans by sending them 10 text messages instead of just 1. I should start paying attention to unattended laptops and blow up some token usage in the same manner.

It's almost like an evolution of bobby tables.

skwallace36 · 2026-03-30T22:41:22 1774910482

things are rough out there right now

grewil2 · 2026-02-03T07:32:16 1770103936

Since it’s an 1.44M image I assume they use 3.5” diskettes. The terms floppy and diskette are used as synonyms today, but the different names make sense since floppies are flexible and “floppy”. Diskettinux?

grewil2 · 2026-01-15T07:54:31 1768463671

Docker won’t save you from prompt injektions that attack your network.

rcarmo · 2026-01-15T07:56:22 1768463782

No kidding? https://taoofmac.com/space/blog/2026/01/12/1830

Still, I don’t think bubblewrap is either a simple or safe enough solution.

grewil2 · 2026-01-15T07:49:14 1768463354

It won’t save you from prompt injektions that attack your network.

TCattd · 2026-01-15T12:07:46 1768478866

Shameless plug, in case you're interested: https://github.com/EstebanForge/construct-cli

Let me know if you give it a go ;)

sschueller · 2026-01-15T12:38:59 1768480739

Interesting, any plans to add LiteLLM (https://github.com/BerriAI/litellm) and Kilocode (https://github.com/Kilo-Org/kilocode)?

TCattd · 2026-01-15T23:45:05 1768520705

Added https://github.com/EstebanForge/construct-cli/releases

TCattd · 2026-01-15T18:17:39 1768501059

Will check those out :)

fgonzag · 2026-01-15T11:05:32 1768475132

In theory the docker container should only have the projects directory mounted, open access to the internet, and thats it. No access to anything else on the host or the local network.

Internet to connect with the provider, install packages, and search.

It's not perfect but it's a start.

63stack · 2026-01-15T08:32:36 1768465956

Docker containers run in their separate isolated network

raphinou · 2026-01-15T08:00:16 1768464016

of course, I'm not pretending this is a universal remedy solving all the problems. But I will add a note in the readme to make it clear, thanks for the feedback!

grewil2 · 2025-07-20T05:52:14 1752990734

Let’s not forget Amiga 3000UX - a workstation released with Amiga Unix, a full port of AT&T Unix System V Release 4 (SVR4). Notable users include Free Software Foundation staff programmers who used it at MIT to help further some early development of the GNU operating system.

https://en.m.wikipedia.org/wiki/Amiga_3000UX

hapless · 2025-07-20T06:49:29 1752994169

The 3000UX was dead on arrival

It cost as much as an early 1990s UNIX workstation but it featured the technology of the 1980s, so it was extremely slow by the standards of the day.

For the price of a 3000UX, you could buy an SGI with 10x the CPU power, or a Sun with 10x as many pixels on the display. It was a really, really bad deal. As per usual for Commodore, too little, too late.

grewil2 · on Aug 17, 2023

This is very annoying - I get this too now, and there is no way to make it go away (except for enabling watch history I assume). Very bad!

grewil2 · on June 4, 2020

> - What is a belief you had as a child that you no longer have? ...

Hm, the type of some of these questions resemble the type of the personal questions used for password recovery by some companies. As a paranoid person, I am reluctant to disclose this information to unknown people.

dhimes · on June 4, 2020

But you should never give the correct answer for password recovery.

What is a belief you had as a child that you no longer have?

Purple Ocean.

And that can be your answer for all the questions: first pet name, elementary school, and so on.

imtringued · on June 4, 2020

> But you should never give the correct answer for password recovery.

Exactly. You're supposed to use your password manager to generate a second password and use that as your answer. I know this sounds stupid but it is the only way to stay safe.

dhimes · on June 4, 2020

The only problem with this is if you have to read it to someone live. Otherwise, yes!

woodrowbarlow · on June 5, 2020

i've heard stories of call centers accepting "oh, i don't remember, i just typed a bunch of random letters and numbers" as confirmation over the phone.

grewil2 · on June 4, 2020

Very good idea!

grewil2 · on June 4, 2020

Good idea!

melicerte · on June 4, 2020

You already started the discussion :)

CGamesPlay · on June 4, 2020

Ah, for me it was cosmic-deflect-attitude. I really struggled with it in my teenage years.

smabie · on June 5, 2020

How could that possibly be a password recovery question? A password recovery question should only have one correct answer.

dntbnmpls · on June 4, 2020

Yep. Not to mention sharing your true opinions on deep/intimate topics to strangers is generally a terrible idea especially on camera.

Also, these types of posts make me wonder what the goals of the project are. Is the intent here to gather data and sell it? Gather users and sell the company? I don't believe in saints. I don't believe in the "don't be evil" mantra.

For all the talk about privacy and anonymity, seems many here want to give away their privacy and anonymity.

grewil2 · on May 27, 2020

Have you tried applying for a position in a public sector IT-department? The public sector should not age-discriminate, at least in theory.

grewil2 · on March 9, 2020

I think he was great in The Emigrants and The New Land.

grewil2 · on Dec 5, 2019

Sounds more like an index of beginner languages.

"The PYPL PopularitY of Programming Language Index is created by analyzing how often language tutorials are searched on Google."