Ollama lets you run large language models locally with a single command.

Quick start

ollama run llama3

Why local?

  • No API keys or rate limits.
  • Data stays on your machine.
  • Works offline.

Models I use

  • llama3 — general purpose
  • codellama — code generation
  • nomic-embed-text — embeddings for vector search