all Technical posts

Creating a LangChain App with Cognitive Search

This blog post will walk you through creating an AI chat application that uses AI search – also known as cognitive search.

Thomas Van Petegem

27 June 2024

This application will consult documentation before generating a response. This is also known as RAG – retrieval augmented generation. This means that the AI will search for the required information to answer the question before the actual question is posed to the LLM.

Requirements: Deployed Azure LLM, deployed embedding AI, Azure Search Service.

Creating AI Objects

Firstly, create your LLM object and template as seen in this previous blog post, ensuring that the context can be added to your prompt in the same way as the question.

Following this, create a cognitive search function. This involves creating an AI embeddings object, like the previously mentioned LLM object. This embeddings AI will scan through the documents and return the relevant documentation. It looks through all the information stored in your index, such as where the information is kept, and compares it to the user’s question.

Vector Store

Next, establish a vector store. This facilitates a connection with the index where your chunked data is stored. The process of chunking data and uploading it to your blob storage and index will be discussed in a following blog post. Chunking data means that a document is separated into blocks of text. These are then added to an index, which is then used to perform the cognitive search. This step requires an Azure search service. Input the endpoint and key of your search service when creating your vector store. Also, include the index name and your embedding model. These elements link your vector store to your embeddings AI, search service, and data index. Now, you can activate your vector store by conducting a similarity search. Input the question (query parameter), the number of results you want to return (the K parameter), and the type of search you wish to perform (query_type parameter).

Here’s an example of how you can create a function to perform a cognitive search:

	def searchdocs(question):
	embeddings = AzureOpenAIEmbeddings(model=embedding_api_type, azure_endpoint=openai_api_endpoint, api_key=openai_api_key)
	vector_store = AzureSearch(
	azure_search_endpoint=azure_search_endpoint,
	azure_search_key=azure_search_key,
	index_name=index_name,
	embedding_function=embeddings.embed_query,
	)
	searchedDocs = vector_store.similarity_search(
	query=question,
	k=20,
	search_type="hybrid",
	)
	return searchedDocs

view raw search.py hosted with ❤ by GitHub

Cognitive Search and Invoking the Chain

Now you can use this function to perform a cognitive search. Call this function, create your chain, and format the retrieved chunks into a single string using the page_content attribute.

	def format_docs(docs):
	return "\n\n".join(doc.page_content for doc in docs)

	searchedDocs = searchdocs(question)
	llm_chain = LLMChain.from_string(llm=llm, template=template)
	context = format_docs(searchedDocs)

view raw searchsnip.py hosted with ❤ by GitHub

	def format_docs(docs):
	return "\n\n".join(doc.page_content for doc in docs)

	searchedDocs = searchdocs(question)
	llm_chain = LLMChain.from_string(llm=llm, template=template)
	context = format_docs(searchedDocs)

view raw searchsnip.py hosted with ❤ by GitHub

Finally, invoke the chain and input the question and context.

	response = llm_chain.invoke(input={
	"question": question,
	"context": context
	})

view raw callLlm.py hosted with ❤ by GitHub

The process of posing a question, searching through the documents, and generating a response is relatively quick, even with many documents returned from the cognitive search.

Complete Code:

	import os
	from dotenv import load_dotenv
	from langchain_openai import AzureChatOpenAI, AzureOpenAIEmbeddings
	from langchain.chains import LLMChain
	from langchain.vectorstores.azuresearch import AzureSearch

	load_dotenv()
	openai_api_key = os.getenv('OPENAI_API_KEY')
	openai_api_endpoint = os.getenv('OPENAI_API_ENDPOINT')
	embedding_api_type = os.getenv('EMBEDDING_API_TYPE')
	azure_search_key = os.getenv('AZURE_SEARCH_KEY')
	azure_search_endpoint = os.getenv('AZURE_SEARCH_ENDPOINT')
	deployment_name = os.getenv('DEPLOYMENT_NAME')
	model_name = os.getenv('MODEL_NAME')
	deployment_name2 = os.getenv("DEPLOYMENT_NAME2")
	model_name2 = os.getenv("MODEL_NAME2")
	index_name = os.getenv("INDEX_NAME")

	def searchdocs(question):
	embeddings = AzureOpenAIEmbeddings(model=embedding_api_type, azure_endpoint=openai_api_endpoint, api_key=openai_api_key)
	vector_store = AzureSearch(
	azure_search_endpoint=azure_search_endpoint,
	azure_search_key=azure_search_key,
	index_name=index_name,
	embedding_function=embeddings.embed_query,
	)
	searchedDocs = vector_store.similarity_search(
	query=question,
	k=20,
	search_type="hybrid",
	)
	return searchedDocs
	def format_docs(docs):
	return "\n\n".join(doc.page_content for doc in docs)

	llm = AzureChatOpenAI(
	deployment_name=deployment_name2,
	model_name=model_name2,
	azure_endpoint=openai_api_endpoint,
	api_key=openai_api_key,
	max_tokens=1500,
	temperature=0.7,
	)

	template = """
	You are an AI assistant that answers the questions given, use the context if required.
	xxxxxxxxx
	Question to answer: {question}
	xxxxxxxxx
	Context: {context}
	xxxxxxxxx
	Assistant:"""

	question = input("Please input question: ")

	searchedDocs = searchdocs(question)
	llm_chain = LLMChain.from_string(llm=llm, template=template)
	context = format_docs(searchedDocs)
	response = llm_chain.invoke(input={
	"question": question,
	"context": context
	})

	print(response["text"])

view raw post2full.py hosted with ❤ by GitHub

Using Cognitive Search, RAG, helps you improve your apps. You can now ask questions about your own documentation. It will retrieve the desired information using a cognitive search and send the retrieved documentation with the query to the LLM. The resulting answer properly answers the question if the answer can be found in the documents.

Subscribe to our RSS feed

Semantic Kernel: Execute Plans As Part of your Chatbot Flow

In this blog post, we’ll explore how Semantic Kernel simplifies the complex, making the integration of AI not just possible but effortlessly efficient.

The Data Maturity Journey

Watch the video to see how Codit can help transform your data into actionable insights

A Simple Chat App with LangChain

In the final blog post of this series, we will have a look at how to make an AI-driven app in LangChain.

Want to know more?

Contact Francis

Chief Portfolio & Marketing Officer

Click here to get in touch

Hi there,
how can we help?

Got a project in mind?

Connect with us

Brussels Airlines’ Digital Transformation Takes Off

IoT Takes Bühler Group from Field to Fork

Going the Distance with Cloud-Connected Industrial Sensors

Swiss Re leverages Cloud Technology and Data Services for its Digital Risk Intelligence Solutions

Soudal is Digitally Transforming Sales in the Chemical Industry

Creating New Revenue Streams in Logistics by Connecting Data

Brussels Airlines’ Digital Transformation Takes Off

IoT Takes Bühler Group from Field to Fork

Going the Distance with Cloud-Connected Industrial Sensors

Swiss Re leverages Cloud Technology and Data Services for its Digital Risk Intelligence Solutions

Soudal is Digitally Transforming Sales in the Chemical Industry

Creating New Revenue Streams in Logistics by Connecting Data

Creating a LangChain App with Cognitive Search

Creating AI Objects

Vector Store

Cognitive Search and Invoking the Chain

Related articles

Want to know more?

Contact Francis

Hi there,
how can we help?

Let's talk

Let's talk

Thanks, we'll be in touch soon!

Call us

Send blog to my inbox

Thanks, we've sent the link to your inbox

Your download should start shortly!

What can we connect for you?

Brussels Airlines’ Digital Transformation Takes Off

IoT Takes Bühler Group from Field to Fork

Going the Distance with Cloud-Connected Industrial Sensors

Swiss Re leverages Cloud Technology and Data Services for its Digital Risk Intelligence Solutions

Soudal is Digitally Transforming Sales in the Chemical Industry

Creating New Revenue Streams in Logistics by Connecting Data

Creating a LangChain App with Cognitive Search

Creating AI Objects

Vector Store

Cognitive Search and Invoking the Chain

Related articles

Want to know more?

Contact Francis

Hi there,how can we help?

Let's talk

Let's talk

Thanks, we'll be in touch soon!

Call us

Send blog to my inbox

Thanks, we've sent the link to your inbox

Your download should start shortly!

Stay in Touch - Subscribe to Our Newsletter

Great you’re on the list!

What can we connect for you?

Hi there,
how can we help?