Estimate memory usage for GGUF models based on GPU layers, context length, and cache type.
The formula was discovered through symbolic regression using TuringBot, evaluating over a billion candidate formulas against 19,517 real measurements. For details, see this blog post.