Should be possible with optimised models, just drop all "generic" stuff and focu...

jstummbillig · 2026-02-03T18:31:37 1770143497

There is: It works (even if we can't explain why right now).

If we knew how to create a SOTA coding model by just putting coding stuff in there, that is how we would build SOTA coding models.

moffkalast · 2026-02-03T18:22:15 1770142935

That's what Meta thought initially too, training codellama and chat llama separately, and then they realized they're idiots and that adding the other half of data vastly improves both models. As long as it's quality data, more of it doesn't do harm.

Besides, programming is far from just knowing how to autocomplete syntax, you need a model that's proficient in the fields that the automation is placed in, otherwise they'll be no help in actually automating it.

theshrike79 · 2026-02-03T21:13:08 1770153188

But as far as I know, that was way before tool calling was a thing.

I'm more bullish about small and medium sized models + efficient tool calling than I'm about LLMs too large to be run at home without $20k of hardware.

The model doesn't need to have the full knowledge of everything built into it when it has the toolset to fetch, cache and read any information available.

noveltyaccount · 2026-02-03T17:29:04 1770139744

I think I like coding models that know a lot about the world. They can disambiguate my requirements and build better products.

regularfry · 2026-02-03T17:52:05 1770141125

I generally prefer a coding model that can google for the docs, but separate models for /plan and /build is also a thing.

noveltyaccount · 2026-02-03T18:31:00 1770143460

> separate models for /plan and /build

I had not considered that, seems like a great solution for local models that may be more resource-constrained.

regularfry · 2026-02-03T18:49:11 1770144551

You can configure aider that way. You get three, in fact: an architect model, a code editor model, and a quick model for things like commit messages. Although I'm not sure if it's got doc searching capabilities.

MarsIronPI · 2026-02-03T17:34:18 1770140058

But... but... I need my coding model to be able to write fanfiction in the comments...

wongarsu · 2026-02-03T22:25:19 1770157519

Now I wonder how strong the correlation between coding performance and ao3 knowledge is in human programmers. Maybe we are on to something here /s