Ollama Client by Shishir Chaurasiya

Privacy-first Ollama Chrome extension to chat with local AI models like LLaMA, Mistral, Gemma — fully offline.

5 (2 reviews)

205 Users

Download Firefox and get the extension

Download file

About this extension

Ollama Client – Local-First AI Chat in Your Browser

Ollama Client is a privacy-focused browser extension for chatting with local and user-configured AI providers. Connect directly to your preferred model server, manage conversations, use local knowledge, and access AI tools from the browser side panel.

Verified Built-In Providers
• Ollama
• LM Studio
• llama.cpp

Beta Custom Providers
• OpenAI-compatible servers and services, including vLLM, LocalAI, KoboldCPP, and OpenRouter
• Anthropic through the native Claude Messages API
• Other compatible local, LAN, or remote endpoints

Features
• Connect and manage multiple AI providers
• Discover available models and configure custom model IDs
• Switch models and monitor provider connection status
• Stream chat responses with stop, retry, edit, regenerate, and fork controls
• View reasoning and thinking traces from supported models
• Use native and prompt-based tool calling
• Search the web through your configured SearXNG, Brave, or Tavily provider
• Attach local files and images to conversations
• Build reusable local knowledge from uploaded documents
• Add optional webpage and browser-tab context
• Use selected-text actions from the right-click context menu
• Save memories and reusable context
• Organise conversations with search, tags, pinning, and session management
• Create custom prompt templates and system prompts
• Adjust model parameters and capability overrides
• Export, import, and reset locally stored data
• Access privacy, permission, storage, and diagnostic controls
• Use the browser side panel for a focused desktop workflow
• Multi-language interface: English, German, Spanish, French, Hindi, Italian, Japanese, Russian, and Simplified Chinese

Privacy
• Chats, session history, settings, knowledge, and vector embeddings are stored locally in your browser
• Provider API keys are stored only on your device and excluded from backups
• Requests are sent directly to the provider you configure; Ollama Client does not operate an intermediary inference server
• Local providers can keep inference entirely on your device or local network
• Remote providers receive content only when you choose to use them
• Web search is off by default and connects only to the search provider you configure
• Page, tab, file, and browser context is shared with the selected model only when you include or request that context
• Optional browser permissions can be reviewed and revoked from the extension settings
• Local data can be exported, deleted, or fully reset at any time

Who It’s For
• Developers building with local and self-hosted AI models
• Researchers testing different LLM providers and model capabilities
• Students learning local and offline AI workflows
• Teams using private LAN-hosted model servers
• Privacy-conscious users who want control over providers and stored data

Setup
1. Install the extension
2. Start Ollama, LM Studio, llama.cpp, or another supported provider
3. Open the extension settings and connect using a localhost, LAN, or remote endpoint
4. Select a model and start chatting

Important Notes
• Ollama Client is a frontend client and does not include AI models
• Local inference performance depends on your hardware, selected model, and backend configuration
• Custom and remote provider support is marked Beta because compatibility can vary between services
• Remote providers may have their own privacy policies, pricing, authentication, and data-retention rules

Page Context and Script Injection

When you ask the model to use the current page, Ollama Client first contacts its content script on that tab. If the content script is unavailable—for example, because the tab was opened before the extension was installed or updated—the extension uses the scripting permission to inject its content script into that specific tab.

This injection is:
• Triggered only by a user-requested page-context action
• Scoped to the active tab involved in the request
• Limited to extracting readable page content needed for that request
• Never performed continuously, on a schedule, or across every open tab
• Not used on protected browser pages where extensions cannot run

Useful Links

Chrome Web Store:
https://chromewebstore.google.com/detail/ollama-client/bfaoaaogfcgomkjfbmfepbiijmciinjl

Provider Setup Guide:
https://www.ollamaclient.in/guides/provider-setup

Website:
https://www.ollamaclient.in

GitHub:
https://github.com/Shishir435/ollama-client

Rated 5 by 2 reviewers

Ollama Client by Shishir Chaurasiya

Required permissions:

Optional permissions:

Data collection: