Deep Dives

A Guide to Running LLMs Locally with No-Code Framework Dify

Running LLMs locally on your CPU with Dify and Ollama opens up a world of possibilities for AI enthusiasts, developers, and privacy-conscious users.

Explore more from ADaSci

Enhancing GAN Training Stability through Xavier Glorot Initialization: A Solution to Unstable Training

Fine-Tuning Pre-Trained Multitask LLMs: A Comprehensive Guide

Elevating Fairness in Consumer Credit Assessments: A Large Language Model (LLM) Driven Approach

End-to-end point cloud-based generative model for multi-part engineering designs

The Rapid Evolution of Generative AI: Is Your Workforce Prepared?

How to Build a Multi-Agent System With AutoGen?

Unlocking the Power of AI Pair Programming with Gemini

Beyond Chatbots: Unraveling the Next Wave of Conversational AI

What are the Benefits of Chartered Data Scientist™

Safeguarding Data Privacy in LLM-Powered Generative AI: Top Concerns and Effective Mitigation Strategies

Large Language Models (LLMs) have revolutionized how we interact with technology, offering unprecedented capabilities in natural language processing and generation. While cloud-based AI services are popular, there’s a growing trend towards local AI deployment for enhanced privacy, control, and customization. In this guide, we’ll explore how to run LLMs locally on your CPU using Dify and Ollama. This approach allows you to harness the power of AI without relying on cloud services, giving you more control over your data and reducing costs. Running LLMs locally allows for greater privacy, customization, and offline capabilities compared to cloud-based solutions.

Table of Content

Understanding Local LLM Deployment
Hands-on Tutorial: Integrating Ollama with Dify
Key Points to Remember

Let us start with understanding the local deployment of LLMs.

Understanding Local LLM Deployment

Local LLM deployment brings the power of advanced AI to your personal computer. By using tools like Dify which is an open-source platform for building AI applications and Ollama which is an open-source project to run LLMs on local machines, you can create custom AI applications without relying on external APIs or cloud services. This approach offers several advantages such as enhanced privacy and data security, Reduced operational costs, Customization and fine-tuning capabilities, Offline functionality, Learning opportunities for AI enthusiasts and developers

Running an LLM locally is like having a personal AI assistant living on your computer. Here is an analogy to help you understand the concept: Consider an example Personal Library vs. Public Library: Running LLMs locally is like having a personal library at home. You have immediate access to all the information, but it takes up space in your house (computer). Cloud-based LLMs are like public libraries – vast resources, but you need to go there (connect to the internet) to access them.

Step-by-step guide: Integrating Ollama with Dify

Step 1: Install Ollama

First, let’s get Ollama up and running on your PC. Follow the instructions from the ADaSci’s blog’s “Hands-On Guide to Running LLMs Locally using Ollama“. Once you’ve completed the installation successfully, Ollama should be running on your PC. You’ll be able to interact with it locally through your command prompt.

After installation, it should look something like this:

Step 2: Install Docker

Next, we’ll install Docker. Why Docker? It’s a game-changer for several reasons:

First, let’s know what exactly it is. Docker is an open-source containerization platform. Consider it like a box(Container) where we can package an app and everything it needs, like libraries and dependencies, to run. So, it can work the same no matter where you run it.

It simplifies the setup process by packaging all dependencies, avoiding manual configuration headaches.
It makes moving applications between different environments (development, testing, production) seamless.
It ensures that Ollama and Dify run without conflicts with other software on your system.

Head over to the official Docker website to download and install it.

Step 3: Fetch Dify using Git

Now, let’s get Dify on your PC. Open your command prompt and use Git to clone the Dify repository:

Step 4: Configure Docker

Time to get Docker configured. Run the following command:

This process might take several minutes, so be patient. Once it’s done, all the necessary Docker processes should be up and running.

Step 5: Setting up Dify

You’re almost there! Now you need to set up an admin account for Dify. Use http://localhost/install to set up an admin account. After successfully signing up, you can log in using your credentials. You should see a screen similar to this:

Pro tip: If you ever forget your password, you can reset it using the command prompt with this command:

docker exec -it docker-api-1 flask reset-password

After signing in, click on the top right corner of the screen. A user profile will appear; click on “Settings” there. In the Settings menu, navigate to “Model Provider” and select Ollama from the list of available models.

You’ll need to add some details here:

Model Name: Use the same model name that you installed using Ollama on your local PC. To find the available models, use this command:

This will give you a list of available models and their respective names. Copy the name you want to use.

As in this case, the model name is llama 3.2:3b. The same should be provided while adding ollama as the model provider.

Base URL: Use `http://host.docker.internal:11434`

After successfully adding Ollama as a model provider, it should look like this:

Now you’re ready to create an app! On the homepage, choose the “Create from blank” option.

A dialog box will open. Add your app name and icon, then click the “Create” button. For this demonstration, let’s create a “Demo App”.

Click on your newly created Demo App. In the top right corner, you’ll see a “Publish” option – click on it.

Click on “Start Chat”, and voila! You can now ask anything using this locally running LLM model on your PC.

Let’s try it out! Ask: “What are some interesting facts about India?”

Here’s what the AI might respond with:

Isn’t it amazing what your locally-run AI can do? Feel free to ask more questions and explore the capabilities of your new setup!

Key Points to Remember

CPU Performance: While running LLMs on a CPU is possible, it may be slower than GPU-accelerated solutions. Be patient with response times, especially for larger models.
Memory Usage: LLMs can be memory-intensive. Ensure your system has sufficient RAM and close unnecessary applications when running models.
Model Selection: Choose models that balance performance and resource requirements for your specific use case and hardware limitations.
Regular Updates: Keep Dify and Ollama updated to benefit from performance improvements and new features.

Final Words

Running LLMs locally on your CPU with Dify and Ollama opens up a world of possibilities for AI enthusiasts, developers, and privacy-conscious users. While it may require some initial setup and patience, the benefits of having a personal AI assistant at your fingertips are immense. As you explore this exciting field, remember to stay curious, experiment with different models and configurations, and share your experiences with the growing community of local LLM users.

Other Helpful Resources

Dify Documentation: https://docs.dify.ai/
Ollama GitHub Repository: https://github.com/ollama/ollama

Aniruddha Shrikhande

Aniruddha Shrikhande is an AI enthusiast and technical writer with a strong focus on Large Language Models (LLMs) and generative AI. Committed to demystifying complex AI concepts, he specializes in creating clear, accessible content that bridges the gap between technical innovation and practical application. Aniruddha's work explores cutting-edge AI solutions across various industries. Through his writing, Aniruddha aims to inspire and educate, contributing to the dynamic and rapidly expanding field of artificial intelligence.

The Chartered Data Scientist Designation

Achieve the highest distinction in the data science profession.

Elevate Your Team's AI Skills with our Proven Training Programs

Strengthen Critical AI Skills with Trusted Generative AI Training by Association of Data Scientists.

Our Latest Courses

A Guide to Running LLMs Locally with No-Code Framework Dify

Explore more from ADaSci

Table of Content

Understanding Local LLM Deployment

Step-by-step guide: Integrating Ollama with Dify

Key Points to Remember

Final Words

Other Helpful Resources

Aniruddha Shrikhande

The Chartered Data Scientist Designation

Elevate Your Team's AI Skills with our Proven Training Programs

Our AI Courses

Agentic AI in Production: Hands-On Workshop

Agentic AI Workforce Readiness Strategies for CXOs

MCP and A2A – The AI Protocols for Next-Gen Agent Ecosystems

Our Accreditations

Get global recognition for AI skills

Chartered Data Scientist (CDS™)

The highest distinction in the data science profession. Not just earn a charter, but use it as a designation.

Certified Data Scientist - Associate Level

Global recognition of data science skills at the beginner level.

Certified Generative AI Engineer

An upskilling-linked certification initiative designed to recognize talent in generative AI and large language models

Join thousands of members and receive all benefits.

Become Our Member

We offer both Individual & Institutional Membership.

The power of intelligence to propel humanity and make a difference

Our Accrediations

CDS Program

Membership

About

For Organizations

Journal