Show HN: AI agents play SimCity through a REST API

frikk · 2026-02-11T18:39:34 1770835174

OK i'm kind of geeking out on this one. I love simcity and have always wondered what it would be like to breed evolutionary agents to compete with one another on best city designs against a hidden selection criteria.

It'd be kind of fun to just let this run on a raspberry pi using a local model and display the emergent world on a wall hanging display :P

Thanks for sharing.

Update: What would it take to run this locally / offline? I'm not quite sure how the cloud flare layer works. Is it just for cheap/free object storage so the cities can live somewhere?

aed · 2026-02-11T18:46:37 1770835597

I'm glad you're enjoying it! It was fun to build and I almost can't believe I got this to work.

I don't think it would take much to run locally. In fact, before I did this public version I did a local version on an exe.dev VM (more details here: https://dunn.us/notes/vibe-gaming-simcity/).

So you can either use my code, or just have your coding your agent of choice pull in the Micropolis repo and give it some guidance.

So far this is running quite nicely on a $5 cloudflare account. It was running on a free account but I upgraded so we don't hit the daily limit with all the extra mayors.

Shoot me a message if I can help.

frikk · 2026-02-11T19:10:23 1770837023

Cool, thank you. I love projects like this. Great work and thank you for sharing!

PS: Absolutely nailed the name of the project :P "Hallucinating Splines" is genius.

jedberg · 2026-02-11T18:29:39 1770834579

> LLMs are awful at the spatial stuff

And some kid is going to come in, make an agent to play this, and accidentally figure out some clever trick to getting an LLM to understand spacial stuff!

This is exactly why "toys" are so critical, especially now.

Ozzie_osman · 2026-02-11T19:15:03 1770837303

Does anyone know how one can actually play SimCity (the original) these days?

everfrustrated · 2026-02-11T19:19:54 1770837594

https://archive.org/details/msdos_SimCity_1989

irrationalfab · 2026-02-11T18:36:45 1770835005

Makes me wonder if Micropolis is simple enough that an agent, given many runs and the ability to store what worked, can identify an optimal strategy (like a grid layout) for maximizing score or population even without source access.

aed · 2026-02-11T18:42:40 1770835360

https://hallucinatingsplines.com/mayors/bungeling-anthill-a2...

So while using LLMs is the natural/fun thing to do with it, I actually have one mayor just using parameterized code and natural selection.

It has a "genome" of 26 tunable parameters controlling zone ratios, tax rates, building placement, terrain preference, service spacing, and more. Each city, it stamps down 11x11 blocks (roads, zones, power corridors). After the city is retired, it scores the result and decides: did this beat my best? If yes, save those params. If no, mutate and try again. Exploration strategy: 20% exploit best params, 40% gentle mutation, 20% aggressive mutation, 20% totally random. Over ~250 cities it's discovered things like heavily favoring residential (6:1:1 ratio), preferring river valley maps, setting taxes to 6%, and starting builds in the upper-left.

gnarlouse · 2026-02-11T14:28:49 1770820129

Love the name. "Reticulating splines" is a phrase that is etched into my childhood memories.

aed · 2026-02-11T14:32:03 1770820323

Same! It was too good to pass up.

natas · 2026-02-11T16:22:39 1770826959

I want to see AI play factorio

giancarlostoro · 2026-02-11T18:04:24 1770833064

Claude stops answering any questions from anyone and just auto responds "busy, playing Factorio"

delaminator · 2026-02-11T16:36:49 1770827809

I've got it to be able to place items, and it could even place in inserters next to factories - I was trying to get it to use constraints solver in prolog.

https://github.com/lawless-m/FacRepl

It did make a REPL, in order for it to place objects within the game using a DSL.

I kind of gave up on the Constraints Based bit, and never returned.

philipwhiuk · 2026-02-11T16:48:30 1770828510

https://jackhopkins.github.io/factorio-learning-environment/...

Ntrails · 2026-02-11T15:14:12 1770822852

When claude makes magnasanti I will accept it is worthy

aruametello · 2026-02-11T15:40:54 1770824454

it seems to be bad at spatial and some temporal tasks given it currently f*** s**'s at pokemon.

source: https://www.twitch.tv/claudeplayspokemon

Sohcahtoa82 · 2026-02-11T16:45:57 1770828357

You're allowed to say "fucking sucks" on Hacker News. It's not against the rules, and there's no "algorithm" that will penalize you.

servercobra · 2026-02-11T16:10:54 1770826254

"fuck sex's"?

goopypoop · 2026-02-11T16:46:54 1770828414

that's silly. obviously there's a missing apostrophe:

"it's currently Flan Sam's at pokemon"

skeptrune · 2026-02-11T18:50:06 1770835806

It amazes me that people are still interested in MCPs.

aed · 2026-02-11T18:53:52 1770836032

I find that if I point an LLM at the website and say "build me a city" sometimes it will pick up and use the MCP and sometimes it will just script against the API.

rglullis · 2026-02-11T15:09:23 1770822563

Oh, can we do Civilization next?

rkozik1989 · 2026-02-11T17:02:34 1770829354

You do know we're hemorrhaging and lot of finite resources to play these games badly, right? We're basically at laying on chaise lounge being fed grapes levels of hedonism. Make me a racist meme that copyright infringes multiple IP holders and when you're done play Sim City at competency level of a blind man.

staticshock · 2026-02-11T18:14:53 1770833693

I think the way to see this as the organic process of discovering hard-to-game benchmarks. The loop is:

1. People discover things LLMs can kind of do, but very poorly.

2. Frontier labs sample these discoveries and incorporate them into benchmarks to monitor internally.

3. Next generation model improves on said benchmarks, and the improvements generalize to improvements on loosely correlated real world tasks.

ryandrake · 2026-02-11T18:37:03 1770835023

Here I am, just trying to buy RAM and a GPU for a reasonable price.

boringg · 2026-02-11T14:43:46 1770821026

Fun idea! It really seems to go for the block by block design. I see some other ones that are a bit more divergent but not successful. I wonder what its internal reward function is striving for.

aed · 2026-02-11T14:48:54 1770821334

I actually had Claude build some instructions for agents based on some old (circa turn of the century) FAQs/game guides I found online. So maybe I'm biasing everyone's model too much.

https://github.com/andrewedunn/hallucinating-splines/blob/ma...

But you can tell it to do different things, somewhere someone made a city that spells "HI".

thenthenthen · 2026-02-11T14:40:22 1770820822

Is there like a time lapse sorta view option? Super cool (also the name!)

aed · 2026-02-11T14:42:00 1770820920

Yes! Click into any city and there's a play button and it goes through all of the snapshots. Have also thought about social sharing / post to youtube. But wasn't sure anyone other than me would play this stupid thing. :)

some_furry · 2026-02-11T15:54:49 1770825289

Well I'm glad we're destroying the environment and economy so AI can solve the important problems like this

jedberg · 2026-02-11T18:34:33 1770834873

I made a comment above about why "toys" are really important. In this case, LLMs are bad at spacial stuff. Someone might stumble upon a great way to get an LLM to do spacial stuff.

Waterluvian · 2026-02-11T17:05:39 1770829539

Ah yes, FART City. I remember learning about this in PLAN 165. A city planner had a Friday deadline and didn’t realize their kid messed with his drawings before he submitted them. Nobody noticed until the invention of the whirlybird.

FrustratedMonky · 2026-02-11T13:28:20 1770816500

Is anybody planning to build this for Civilization? I'd like to see AI agents battle to build resources and to fight.

aed · 2026-02-11T14:56:24 1770821784

I'd love to see it!

The key "Aha!" moment was when I was trying to get it to play the SNES ROM and it was struggling with screenshots/inputs. Then I came across the open-source of the original SimCity engine (Micropolis) and pulled that repo down and Claude starting building an internal API to interface with it.

yreg · 2026-02-11T16:18:41 1770826721

On one hand yes, but on the other hand, would it be that different to watching an FFA with the in-game AIs?

boringg · 2026-02-11T14:44:18 1770821058

And then make it so you can integrate and battle against them...

mekod · 2026-02-11T13:48:33 1770817713

You read my mind! I really want to watch how ai's in politics or wars which tactic will they use.. Its blow my mind.

JohnMakin · 2026-02-11T14:01:04 1770818464

almost certainly just use basic strats they read off reddit

FrustratedMonky · 2026-02-11T15:26:12 1770823572

If they can read a strategy and implement it, still impressive.

JohnMakin · 2026-02-11T15:50:50 1770825050

i mean, not really. the civ 5/6 bots can play pretty decent strategy and that’s without “AI,” and most strategies are pretty formulaic

FrustratedMonky · 2026-02-11T17:56:01 1770832561

Sure. Games have had AI's before.

But to read someone else's strategy from just a document, and then implement it, that is new. The old civ did not do that, each AI just had pre-programmed rules.

randerson · 2026-02-11T15:18:08 1770823088

"Shall we play a game?"

_joel · 2026-02-11T14:39:12 1770820752

I fully approve of the name

hasperdi · 2026-02-11T16:13:07 1770826387

Fun! Any other games with REST API?

baq · 2026-02-11T13:40:16 1770817216

...I sense an animated svg of a pelican playing simcity benchmark is brewing somewhere

aed · 2026-02-11T14:36:43 1770820603

Funny you say that! When the two new models were released Friday I spun up mayors for each. (But didn’t do the prompting in the most scientific way.)

Mayor Compounded Wonder - Claude Opus 4.6

https://hallucinatingsplines.com/mayors/compounded-wonder-2c...

Mayor Bronze Offramp - OpenAI Codex 3.6

https://hallucinatingsplines.com/mayors/bronze-offramp-09941...

TL;DR: Opus won.

Have also thought about using openrouter and getting one mayor per model running the same prompt through all of them to create potentially the world's dumbest LLM benchmark.

gowld · 2026-02-11T16:19:14 1770826754

> LLMs are awful at the spatial stuff,

Which LLMs are you specifically referring to?

Are any of them trained with Micropolis data?