Setup · 10 min

Run an AI Model Locally on Your Laptop

Cloud AI tools like ChatGPT and Claude are powerful — but every prompt you type is sent to an external server. For sensitive business content like client contracts, financial data, or HR matters, that's a real exposure risk. Running an AI model locally keeps everything on your machine, with zero data leaving your device and no monthly subscription required.

What you need

Step-by-step setup

  1. Go to ollama.com and download the installer for your operating system. The install takes about two minutes.
  2. Open Terminal (Mac) or Command Prompt (Windows) and run:
    ollama pull gemma4
    This downloads Google's Gemma 4 model — roughly 5 GB. Go make a coffee.
  3. Once the download completes, start a conversation:
    ollama run gemma4
  4. Type your prompt and press Enter. Your conversation stays entirely on your machine.

Which model should you use?

Using the chat interface

The Ollama desktop app includes a built-in chat window — no extra tools required. Once Ollama is installed, open the app and you'll find a ChatGPT-style interface where you can type prompts, paste documents, and switch between models. Your conversations stay entirely on your machine.

Practical uses once you're up and running

Want to put this into practice?

Our training sessions walk through this setup live and show you how to use local AI in the context of your specific business.

Book a Free Discovery Call