LLM Calculator

A native Apple-platform utility that tells you whether a GGUF model will fit on your Mac, iPhone, iPad, or Vision Pro, and how much memory is left for KV-cache at a selected context window.

Built with SwiftUI for iOS, macOS, and visionOS 26.1+.

What it does

Browses GGUF models on Hugging Face by popularity and search, with file sizes resolved per quant variant.
Computes exact KV-cache memory by parsing the GGUF header (block_count, head_count, head_count_kv, embedding_length) for models you've downloaded — falls back to a calibrated estimate for browse-only models.
Reports compatibility against the device you're running on, using ProcessInfo.physicalMemory and the OS-reported device class. No chip lookup tables, no hardcoded device catalog.
Ranks model files by fit, so you can quickly find the largest compatible quantization for the current device.

Build it yourself

Clone, then create your local signing config:

cp Config/Signing.xcconfig.template Config/Signing.xcconfig
# edit Config/Signing.xcconfig and fill in your Apple Developer Team ID

Open SelfHostLLM Calculator.xcodeproj in Xcode 26+ and build, or:

xcodebuild -project "SelfHostLLM Calculator.xcodeproj" \
  -scheme "SelfHostLLM Calculator" \
  -destination 'platform=macOS' build

To run the unit tests:

xcodebuild -project "SelfHostLLM Calculator.xcodeproj" \
  -scheme "SelfHostLLM Calculator" \
  -destination 'platform=macOS' test

The repo is self-contained — no Swift Package Manager dependencies, no CocoaPods, no Carthage. Just SwiftUI and Foundation.

Architecture

The app is intentionally small and direct:

Core/Services/CalculatorEngine.swift is the pure-struct calculator.
Core/Services/GGUFHeaderReader.swift parses GGUF v2/v3 headers without loading model weights.
Data/Remote/HFAPIClient.swift talks to the Hugging Face model API.
Core/Services/ModelRepository.swift owns the offline-first repo cache and downloaded model merge.
Features/Dashboard/ is the unified single-screen experience.

See CLAUDE.md for the full project map and implementation notes.

How the calculation works

LLM Calculator treats the GGUF file size as model memory. That is the memory needed to load the weights.

KV-cache is handled in two ways:

Downloaded GGUF files: exact KV-cache is calculated from GGUF header metadata.
Browse-only Hugging Face results: KV-cache is estimated as a conservative fraction of file size and scaled linearly by context window.

Compatibility uses the device's OS-reported physical memory, a platform reserve, and a small framework overhead. Results are intentionally simple:

Green: fits comfortably
Orange: tight, but should fit
Red: exceeds the device budget
Gray: file size is unknown

Privacy

LLM Calculator does not require an account and does not include analytics or tracking code.

The app makes network requests to Hugging Face to load GGUF repository metadata, search results, file lists, and model downloads. Search queries are sent to Hugging Face when you use search. Downloaded models, Hugging Face cache data, and the downloaded-model registry are stored locally in the app's Documents directory.

Limitations

Browse-only compatibility is an estimate until the GGUF header is available from a downloaded file.
Device detection uses OS-reported device class and physical memory only. It does not identify chip names or rely on hardware lookup tables.
The app checks memory fit, not runtime speed, prompt-processing throughput, or model quality.

Why I built this

I wanted a tool I'd actually trust before pulling a 30 GB GGUF onto a laptop, and the answer "it depends on KV-cache" deserves to be a number, not a vibe. Putting it on GitHub because the audience is devs, the math is worth peer-reviewing, and the SwiftUI is worth borrowing.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
Config		Config
SelfHostLLM Calculator.xcodeproj		SelfHostLLM Calculator.xcodeproj
SelfHostLLM Calculator		SelfHostLLM Calculator
SelfHostLLM CalculatorTests		SelfHostLLM CalculatorTests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Calculator

What it does

Build it yourself

Architecture

How the calculation works

Privacy

Limitations

Why I built this

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Calculator

What it does

Build it yourself

Architecture

How the calculation works

Privacy

Limitations

Why I built this

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages