It's notable the Apple devices are very low-RAM compared to similar devices from...

MilaM · on Dec 20, 2023

Add to that, that you can't upgrade RAM in most Desktop Macs anymore.

I want to buy a Mac soon, and I'm really struggling to decide how much RAM I should order. Unfortunately, my budget is limited. If it wasn't, I would probably go for at least 32GB. I'm still hoping Apple might change their RAM pricing, but probably in vain.

SparkyMcUnicorn · on Dec 20, 2023

> I want to buy a Mac soon, and I'm really struggling to decide how much RAM I should order.

I'd recommend getting at least 32GB if you're on the fence. Not being able to upgrade it is a bummer, and your future self will thank you for getting the most you possibly can.

For my most recent upgrade I went for 64GB (previously 32GB) and I'm really glad I did, especially since llama.cpp became a thing shortly after getting it.

tomduncalf · on Dec 20, 2023

Also in the “glad I got 64gb” camp - even though it seemed ridiculous when I bought it, technology has advanced so quickly that now it’s actually very useful.

Now I wish I’d bought 4tb rather than 2tb hard drive lol but that’s just me being lazy - that upgrade definitely felt like a step too far.

cced · on Dec 21, 2023

Would you recommend the 64 over 128? What kind of models will 128 open up over 64? Can you get most of them out of the 64?

SparkyMcUnicorn · on Dec 27, 2023

For LLM models, TheBloke has provided memory requirements for all of their quantisations. For Apple Silicon you want to look at the GGUF models.

Here's Mixtral-8x7B-Instruct: https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-G...

64GB of memory will struggle on 65-70B (or bigger) models, and you'll be limited to running 70B on Q3 or Q4 if you want to use it somewhat comfortably.

tomduncalf · on Dec 22, 2023

I’m not sure, not really an expert in this field, just casually interested as an outsider. When I bought my Mac, 64 was as high as you could go.

gajjanag · on Dec 20, 2023

A couple of additional points on how the "low-RAM" works:

1 - https://www.lifewire.com/understanding-compressed-memory-os-... : Apple devices have support for memory compression, see https://opensource.apple.com/source/xnu/xnu-2050.18.24/libke...

2 - Apple devices support something called "jetsam", which basically frees up memory from unused/background apps by killing them in order to keep high priority apps running smoothly: https://developer.apple.com/documentation/xcode/identifying-...

londons_explore · on Dec 20, 2023

I didn't mention either because Android does both of those too. (via zRAM and the Low Memory Killer Daemon)

gajjanag · on Dec 21, 2023

lmkd (low memory killer daemon) works fairly differently off of a different set of signals and different policy. But yes, conceptually they try to achieve the same goal.

I also do not know if Android combines system libraries into one big file for the savings, something Apple devices do.

stjohnswarts · on Dec 20, 2023

The only thing that keeps me on a Mac is familiarity, and air macs are silent. I am open to any suggestions for Linux laptops that are quiet or almost silent, most have fans that rev up, I'll gladly sacrifice some CPU for quiet or even a quiet mode (easy switch on/off). Nothing I've seen matches the silence. I'm more than happy to hear anything that proves me wrong. I would be glad to hear about something like that, obviously it has to have plus like either cheaper/replaceable ram. Furthermore, I mostly use my Mac air as a remote terminal to web based services and my Linux server that I use for compiling bigger projects and home/self-hosting.

anonymouse008 · on Dec 20, 2023

Not sure if this is the right take. Apple is betting that in the long term that flash memory will be equivalent to RAM with the right CPU / GPU architectures. The timeline is pressed up, certainly, but I don't think their thesis is wrong.

xadhominemx · on Dec 20, 2023

That thesis is definitely wrong and I don't think their current device architectures would need to reflect any such a long-term convergence.

anonymouse008 · on Dec 20, 2023

Yeah I phrased it extremely poorly as well (was thinking 3D XPoint and the like). Point still taken