Question 1

What does What Model do?

Accepted Answer

What Model scans your system hardware (CPU, RAM, GPU, and VRAM) and recommends open-weight LLMs that fit your machine, including the best quantization level and copy-ready Ollama pull and run commands.

Question 2

Do I need Ollama installed?

Accepted Answer

No. Recommendations work without Ollama. If Ollama is installed and running locally, the app detects pulled models and marks them on each card.

Question 3

How accurate are the memory estimates?

Accepted Answer

Estimates are based on parameter count, quantization format, and context length. They are approximate; actual usage varies by runtime such as Ollama, llama.cpp, or LM Studio.

Question 4

Should I run this on the machine I use for inference?

Accepted Answer

Yes. For accurate results, run What Model on the same PC or laptop where you plan to run local models. On phones and tablets, use manual VRAM and RAM configuration.

What can your machine actually run?

02 / Models

03 / FAQ