AI-powered tools for developers are more popular than ever, with many services, code editors, extensions emerging every day. While these tools are usually helpfull and convinient, they often come with monthly fees and require sending your data to their servers.
In this post, we’ll look at free, open-source AI models that you can run directly on your own computer and protect your privacy since all your data stays on your machine.
Setup
First we need to install Ollama. A tool for running open source models locally.
Verfy that Ollama is running:
ollama helpOllama offers a wide collection of models, each with multiple options depending on the number of parameters used during training. In general, models with more parameters provide better accuracy and understanding of tasks. However, this increased precision comes at the cost of more storage and compute on your local machine.
I usually prefer sticking with the default configurations for most models. They tend to strike a good balance of accuracy and performance while being resource-friendly. It’s important to note that these default configurations are not the most powerful options available—those are often provided by paid third-party services. While it is possible to run such high-end models locally, they require very strong hardware to handle the increased demands.
Here are the models I’am running at the moment. Keep in mind you’ll need ~15GB of disk space to store these.
- Llama - general purpose model sutable for simple tasks:
ollama run llama3.1- Deepseek R1 - new reasoning model everyone’s talking about, it’s suitable for solving quite complex problems:
ollama run deepseek-r1- Qwen2.5 - model optimized for code completion:
ollama run qwen2.5-coderWith the models running locally, let’s connect them to a code editor. I prefer using VSCode, so we’ll use the Continue extension. It’s pretty much the GitHub Copilot that we can hook onto local Ollama models. It offers features like chat, context awareness, autocomplete, and much more.
Once installed, you should see new icon appear on the VSCode’s sidebar.
Let’s open Continue’s configuration file:
code ~/.continue/config.jsonAnd add the following to setup models we installed. We will use Llama and Deepseek for our chat and Qwen for autocomple:
{  "models": [    {      "title": "llama3.1 8B",      "provider": "ollama",      "model": "llama3.1:latest"    },    {      "title": "deepseek-r1 7B",      "provider": "ollama",      "model": "deepseek-r1:latest"    }  ],  "tabAutocompleteModel": {    "title": "qwen2.5-coder 7B",    "provider": "ollama",    "model": "qwen2.5-coder:latest"  }}By default Continue collects and reports telemetry, to opt-out of it, add the following to the config:
{  "allowAnonymousTelemetry": true}We’re all setup! 🚀
Open the Continue extension from your sidebar in VSCode and get to work!