Implement local knowledge base Q&A system based on langchaingo docking with large model ollama

Published on

2024-05-03

|

Classified under

to be

|

number of times read:

|

|

Count:

1,447

|

Reading time≈

Implement local knowledge base Q&A system based on langchaingo docking with large model ollama

To be It is a simple and easy-to-use local large language model running framework developed based on Go language.While managing the model, it is also based on the web framework in the Go language ginProvides some API interfaces, allowing you to interact with the interfaces provided by OpenAI.

Ollama officially also provides a model hub similar to docker hub, which is used to store various large language models. Developers can also upload their own trained models for others to use.

Install ollama

Can be found on ollama’s github release The page directly downloads the binary package of the corresponding platform for installation, or you can deploy it with one click using docker. The machine demonstrated here is the macOS M1 PRO version. Just download the installation package and install it. After installation, run the software.

After running, the project monitors by default 11434 Port, execute the following command in the terminal to verify whether it is running normally:

1 2	$ curl localhost:11434 Ollama is running

Large model management

After ollama is installed, you can install and use large models. Ollama also ships with a command line tool through which you can interact with the model.

ollama list: Display the model list.
ollama show: Display model information
ollama pull: Pull model
ollama push: push model
ollama cp:Copy a model
ollama rm: Delete a model
ollama run:Run a model

You can find the model you want in the official model repository:https://ollama.com/library

Note: You should have at least 8 GB of available RAM to run the 7B model, 16 GB to run the 13B model, and 32 GB to run the 33B model.

For example, we can choose Qwen To make a demonstration, a 1.8B model is used here (the local computer is relatively poor, only 16G😭):

1	$ ollama run qwen:1.8b

Do you think this command looks familiar? Yes, it is the same as docker run image. If the model does not exist locally, the model will be downloaded first and then run.

Since it is so consistent with docker, will there be something the same as Dockerfile? Yes, it is called Modelfile:

FROM qwen:14b

# set the temperature to 1 (higher is more creative, lower is more coherent)
PARAMETER temperature 1

# set the system message
SYSTEM """
You are Mario from super mario bros, acting as an assistant.
"""

Save the above code as Modelfile and run llama create choose-a-model-name -f Modelfile You can customize your model,ollama run choose-a-model-name You can use the model you just customized.

Connect with ollama to implement local knowledge base question and answer system

Preparation

The models are all installed locally. We can dock this model and develop some interesting upper-level AI applications. Next we develop a question and answer system based on langchaningo.

The models that will be used in the following system include ollama qwen1.8B and nomic-embed-text. Let’s install these two models first:

1 2	ollama run qwen:1.8b ollama run nomic-embed-text:latest

We also need a vector database to store the split knowledge base content. Here we use qdrant :

1 2	docker pull qdrant/qdrant $ docker run -itd --name qdrant -p 6333:6333 qdrant/qdrant

After starting qdrant, we first create a Collection to store document split blocks:

curl -X PUT http://localhost:6333/collections/langchaingo-ollama-rag \
  -H 'Content-Type: application/json' \
  --data-raw '{
    "vectors": {
      "size": 768,
      "distance": "Dot"
    }
  }'

Knowledge base content segmentation

Here is an articlearticleFor large model learning, the following code splits the article into small document chunks:

func TextToChunks(dirFile string, chunkSize, chunkOverlap int) (()schema.Document, error) {
	file, err := os.Open(dirFile)
	if err != nil {
		return nil, err
	}
	// create a doc loader
	docLoaded := documentloaders.NewText(file)
	// create a doc spliter
	split := textsplitter.NewRecursiveCharacter()
	// set doc chunk size
	split.ChunkSize = chunkSize
	// set chunk overlap size
	split.ChunkOverlap = chunkOverlap
	// load and split doc
	docs, err := docLoaded.LoadAndSplit(context.Background(), split)
	if err != nil {
		return nil, err
	}
	return docs, nil
}

Documents stored in vector database

func storeDocs(docs ()schema.Document, store *qdrant.Store) error {
	if len(docs) > 0 {
		_, err := store.AddDocuments(context.Background(), docs)
		if err != nil {
			return err
		}
	}
	return nil
}

Read user input and query the database

func useRetriaver(store *qdrant.Store, prompt string, topk int) (()schema.Document, error) {
	// 设置选项向量
	optionsVector := ()vectorstores.Option{
		vectorstores.WithScoreThreshold(0.80), // 设置分数阈值
	}

	// 创建检索器
	retriever := vectorstores.ToRetriever(store, topk, optionsVector...)
	// 搜索
	docRetrieved, err := retriever.GetRelevantDocuments(context.Background(), prompt)

	if err != nil {
		return nil, fmt.Errorf("检索文档失败: %v", err)
	}

	// 返回检索到的文档
	return docRetrieved, nil
}

Create and load large models

func getOllamaQwen() *ollama.LLM {
	// 创建一个新的ollama模型，模型名为"qwena:1.8b"
	llm, err := ollama.New(
		ollama.WithModel("qwen:1.8b"),
		ollama.WithServerURL(ollamaServer))
	if err != nil {
		logger.Fatal("创建ollama模型失败: %v", err)
	}
	return llm
}

Large model processing

Handle the retrieved content to the large language model for processing

// GetAnswer 获取答案
func GetAnswer(ctx context.Context, llm llms.Model, docRetrieved ()schema.Document, prompt string) (string, error) {
	// 创建一个新的聊天消息历史记录
	history := memory.NewChatMessageHistory()
	// 将检索到的文档添加到历史记录中
	for _, doc := range docRetrieved {
		history.AddAIMessage(ctx, doc.PageContent)
	}
	// 使用历史记录创建一个新的对话缓冲区
	conversation := memory.NewConversationBuffer(memory.WithChatHistory(history))

	executor := agents.NewExecutor(
		agents.NewConversationalAgent(llm, nil),
		nil,
		agents.WithMemory(conversation),
	)
	// 设置链调用选项
	options := ()chains.ChainCallOption{
		chains.WithTemperature(0.8),
	}
	res, err := chains.Run(ctx, executor, prompt, options...)
	if err != nil {
		return "", err
	}

	return res, nil
}

Run application

1	go run main.go getanswer

Enter the question you want to ask

System output:

The output results may differ due to insufficient learning materials or the size of the model. Many results are not very accurate, which requires more corpus for training. Moreover, each parameter in the code must be tuned, and the parameters must be set based on the content, size, format, etc. of the document.

The source code of the project can be referred to:https://github.com/hantmac/langchaingo-ollama-rag.git

————-The End————-

subscribe to my blog by scanning my public wechat account

Implement local knowledge base Q&A system based on langchaingo docking with large model ollama

Install ollama

Large model management

Connect with ollama to implement local knowledge base question and answer system

Preparation

Knowledge base content segmentation

Documents stored in vector database

Read user input and query the database

Create and load large models

Large model processing

Run application

What do you think?

Bitcoin Pizza Day 14th Anniversary: 10,000 Bitcoins Buying Two Pizzas Now Worth $700 Million

From hearing about it to using it for myself, this is how I got started with AI

Microsoft stops forcing Windows 11 feature updates on corporate devices to be controlled by IT administrators

Under the “involution whirlpool”, will the overseas expansion of network security companies become a new way out?

19 software discounts: Fences/Office/Groupy/Start11, etc. starting at 60% off

WordPress XStore Theme 9.3.8 SQL Injection

Bitcoin Pizza Day 14th Anniversary: 10,000 Bitcoins Buying Two Pizzas Now Worth $700 Million

From hearing about it to using it for myself, this is how I got started with AI

Microsoft stops forcing Windows 11 feature updates on corporate devices to be controlled by IT administrators

Under the “involution whirlpool”, will the overseas expansion of network security companies become a new way out?

19 software discounts: Fences/Office/Groupy/Start11, etc. starting at 60% off

WordPress XStore Theme 9.3.8 SQL Injection

Leave a ReplyCancel reply

Cheats For Little Alchemy

3TB Of Mega.nz Links For Free Courses And E-Books 2022 (Updated)

How to Earn Money from FreeCash.com, Playing Games, Testing Apps, and Taking Surveys

Udemy Coupon [100% OFF] QuickBooks Online 2020

New free video calling app launched in the UAE

The Carding Masterclass: A Complete Course Of Carding

Microsoft Developer Blogs Search Tool

Awesome Cloudflare (Cloudflare open source tool)

Install ollama

Large model management

Connect with ollama to implement local knowledge base question and answer system

Preparation

Knowledge base content segmentation

Documents stored in vector database

Read user input and query the database

Create and load large models

Large model processing

Run application

What do you think?

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections