More

vunderba · 2026-04-03T17:09:34 1775236174

Nice job. I have a similar automated CRON that runs overnight and does the following when it encounters new pictures in a folder:

- Qwen3-VL 8b creates a verbose description + keywords

- Simpler CLIP encoder builds another set of tags

- Description is placed into an image RAG

- Image has keywords placed using "underscores" into the file name itself

- Description/Tags/Keywords are all embedded in EXIF data on the image

I've got close to around 30k worth of images so doing this gives me a more manifold means of searching using natural language, keywords, etc. to quickly retrieve images.

vunderba · 2026-04-03T00:06:38 1775174798

Nice job, but that polka‑esque music makes me want to feed myself to the yeti as quickly as possible. :)

You might consider a mechanic where moving your character in a sinusoidal pattern works like building up blue sparks in Mario Kart rewarding the player with a temporary speed boost.

Another SkiFree homage that was posted a while back on HN that has a nice pixel art aesthetic:

https://broski.winterpixel.io

BSTRhino · 2026-04-03T00:12:14 1775175134

Haha fair enough, I have had that music in my head for days since making this game

Ah it's always interesting to see what other people come up with, thanks for sharing!

cdaringe · 2026-04-03T07:38:22 1775201902

I thought the music was perfect fwiw

BSTRhino · 2026-04-03T10:03:03 1775210583

Oh actually that does help actually, thank you :) I kind of think of this game as a comedy and was trying to fit that theme

vunderba · 2026-04-02T16:57:23 1775149043

Not OP but one example is that recent VL models are more than sufficient for analyzing your local photo albums/images for creating metadata / descriptions / captions to help better organize your library.

kejaed · 2026-04-02T17:04:27 1775149467

Any pointers on some local VLMs to start with?

vunderba · 2026-04-02T17:13:04 1775149984

The easiest way to get started is probably to use something like Ollama and use the `qwen3-vl:8b` 4‑bit quantized model [1].

It's a good balance between accuracy and memory, though in my experience, it's slower than older model architectures such as Llava. Just be aware Qwen-VL tends to be a bit verbose [2], and you can’t really control that reliably with token limits - it'll just cut off abruptly. You can ask it to be more concise but it can be hit or miss.

What I often end up doing and I admit it's a bit ridiculous is letting Qwen-VL generate its full detailed output, and then passing that to a different LLM to summarize.

- [1] https://ollama.com/library/qwen3-vl:8b

- [2] https://mordenstar.com/other/vlm-xkcd

canyon289 · 2026-04-02T17:09:02 1775149742

You could try Gemma4 :D

vunderba · 2026-04-02T16:54:31 1775148871

Strongly agree. Gemma3:27b and Qwen3-vl:30b-a3b are among my favorite local LLMs and handle the vast majority of translation, classification, and categorization work that I throw at them.

misiti3780 · 2026-04-02T20:17:16 1775161036

what HW are you running them on ? are you using OLLAMA ?

vunderba · 2026-04-02T20:37:55 1775162275

I'm using the default llama-server that is part of Gerganov's LLM inference system running on a headless machine with an nVidia 16GB GPU, but Ollama's a bit easier to ease into since they have a preset model library.

https://github.com/ggml-org/llama.cpp

vunderba · 2026-04-02T01:13:11 1775092391

There actually was an attempt on HN a little while back to use GenAI to convert facts, flashcards, lists, etc. into automated melodic mnemonics. The biggest issue in that particular case was that it was also generating the motif from scratch.

At least for me, part of the reason I can still sing the countries of the world is because the original Animaniacs song was set to a tune that was already familiar: “Jarabe Tapatío” (aka the Mexican Hat Dance).

Timwi · 2026-04-03T09:35:42 1775208942

I memorized that (and several other) Animaniacs songs without being familiar with the melody. Even Tom Lehrer’s The Elements reached me before Pirates of Penzance did. I think the melody just needs to be simple, then it'll become ”familiar” quickly.

However, for the use-case at hand (remembering IPv6 addresses) I don't think I'd use that. I'd just write them down somewhere, like, uh, perhaps, oh I know: the hosts file.

vunderba · 2026-04-03T15:51:37 1775231497

“Catchiness” is probably more important than anything, hence the concept of the earworm aka stuck song syndrome. Even SOTA GenAI like Suno/Udio fall pretty short of generating genuinely engaging melodies.

vunderba · 2026-04-01T21:19:17 1775078357

Nice job. There are a couple of these already out there:

https://www.speedcoder.net

https://typing.io

Feedback:

You really need to let people use the Tab key instead of having to insert spaces manually. Even better would be to automatically start new lines at the correct indentation level, since Tab is often intercepted by browsers.

The current layout introduces ugly horizontal scroll bars when the viewport is even modestly resized, especially because the code snippets already use a fairly large font. As a result, you can’t see all the text at once. Since the program doesn’t auto-scroll to keep the cursor in view, it becomes very difficult to use unless you run it full screen.

a2nb · 2026-04-02T08:14:25 1775117665

Hey thanks for the feedback, the text should auto scroll, I have setup a minimum of 4 lines at the bottom, but this is maybe broken because of the overflow you talk about. I'm going to update that :)

Also, the two site you show got a shitload of cookie and login requirement. I plan to build something with no data being resold or login. Don't know if its a good idea ^^

vunderba · 2026-04-02T13:22:12 1775136132

Less cookie/login is always a plus. I've been using uBlock Origin on Firefox for so long I don't even notice stuff like that anymore.

One more bit of feedback - you left the React default favicon in the app, so you might want to change that as well.

a2nb · 2026-04-02T14:01:49 1775138509

That weird I have added a favicon last night, thanks for the feedback, I'm going to check that too

vunderba · 2026-04-02T15:43:00 1775144580

Sure. It also might just be a local caching issue - I've definitely experienced this on my apps before.

vunderba · 2026-04-01T14:29:02 1775053742

Nice job. Given the attention to detail, I was a bit surprised that you made the jump height a fixed value though which no Mario game has ever done.

EDIT: Well except for maybe Hotel Mario on the CD-i. :)

supertommy · 2026-04-01T15:33:03 1775057583

yea... you got me lol I was literally thinking of adjusting that yesterday as the last tweak but I didn't get to it

good eye for detail

vunderba · 2026-03-31T18:11:25 1774980685

Nice job. FYI there have been a couple of attempts at an RTS LLM Arena with varying degrees of success.

Show HN: A real-time strategy game that AI agents can play

https://news.ycombinator.com/item?id=47149586

Show HN: RTS for Agents

https://news.ycombinator.com/item?id=46649853

And of course there's a ton of research into more traditional forms of AI playing RTS games such as Alphastar.

https://en.wikipedia.org/wiki/AlphaStar_(software)

Feedback:

That percussive sound that starts the song at the beginning of Play is SUPER jarringly loud. I'd trim it out of the track entirely.

I know you've mentioned tileset improvements but just to put it out there you've generated isometric buildings, but the tile set you're using appears to be square based. This creates a very incongruous style where the buildings don't feel like they're actually attached to the ground.

Finally, about the pixel art you’re using: you’re asking Nano Banana to generate it, but there are a couple of issues with prompting pixel art from GenAI models. The most obvious one is that the pixels aren’t aligned to a traditional grid which leads to really noticeable fringing. This is especially obvious when I use the scroll wheel to zoom in on some of the art assets.

I’d highly recommend using something like Unfake [1] to clean this up, aligning pixels and reducing the palette to something more consistent. It’s a bit more manual work, but it will make your assets look dramatically better.

https://github.com/jenissimo/unfake.js

Here's an example of where I demonstrate generating better sprite sheets with NB:

https://mordenstar.com/other/nb-sprites

AJSturrock · 2026-03-31T18:59:24 1774983564

Thanks for all of this, really appreciate the detailed feedback and the links, will check those out.

Apologies about the drum hit at the start! Appreciate that is probably way too loud, especially on headphones. Cutting the first second out now and also adding a fade in for music track on the menu music to ease you in!

The tileset/isometric mismatch and the pixel grid fringing are both great calls. I am by no means an artist so this is a big help! I hadn't come across Unfake before, that looks super useful to clean up the existing assets and any new ones I generate.

I came across https://www.pixellab.ai/ today when I was researching unit sprite animation & tileset generation and think I may have to redo a lot of the graphics entirely, seems to be the most expensive part outside of the Claude Max plan..

Running through your suggestions with Claude now to get them implemented. Cheers!

vunderba · 2026-03-31T14:11:50 1774966310

If you want something truly ostentatious, apparently there’s also a company that makes a hand-carved wooden linear clock.

https://linearclockworks.com

vunderba · 2026-03-30T00:18:14 1774829894

Very cool. Small bit of feedback - I'd suggest using the pointer events so that the site is compatible with desktop and tablets. It didn't seem to respond to touch events when I tried it.

https://developer.mozilla.org/en-US/docs/Web/API/Element/poi...

aaronetz · 2026-03-30T03:58:50 1774843130

Thanks for the feedback. Are you using mobile Safari? Can you try changing the pressure sensitivity slider to 0 percent? It might be an issue with wrong pressure detection. I've had to write some custom code for the native iOS version to fix it but hadn't really tested in the browser.

vunderba · 2026-03-30T04:32:53 1774845173

Sure. I dialed the pressure sensitivity down to zero, but it still didn’t work. This was in mobile Safari (WebKit) on an iPad. It definitely works in Firefox on Android, though, so it does seem to be an iOS-specific issue.

aaronetz · 2026-03-30T05:14:08 1774847648

Thanks for trying it out. I'll investigate.