#pip install langchain
#pip install langchain-community
#pip install huggingface_hub


from langchain_community.llms import HuggingFaceHub
huggingfacehub_api_token = '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'

llm = HuggingFaceHub(repo_id='tiiuae/falcon-7b-instruct',
                     huggingfacehub_api_token=huggingfacehub_api_token)


question = 'Can you still have fun if'
output = llm.invoke(question)

print(output)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\1477989867.py:4: LangChainDeprecationWarning: The class `HuggingFaceHub` was deprecated in LangChain 0.0.21 and will be removed in 1.0. An updated version of the class exists in the :class:`~langchain-huggingface package and should be used instead. To use it run `pip install -U :class:`~langchain-huggingface` and import as `from :class:`~langchain_huggingface import HuggingFaceEndpoint``.
  llm = HuggingFaceHub(repo_id='tiiuae/falcon-7b-instruct',
D:\Learning\MyWebsite\FinalGithub\AlreadyPublihsed\blogs\DataCamp_Intro_to_LangChain\vm_data_capm_langchain\lib\site-packages\tqdm\auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html
  from .autonotebook import tqdm as notebook_tqdm

Can you still have fun if you don't win?
Yes, you can still have fun even if you don't win. Winning is not the only way to have fun. You can have fun by participating in activities you enjoy, spending time with friends and family, and finding new ways to challenge yourself and learn new skills.


#pip install langchain-openai


from langchain_openai import OpenAI
import os
openai_api_key = "....."
llm = OpenAI(openai_api_key=openai_api_key)

question = 'Take care of your mental health if'
output = llm.invoke(question)

print(output)

 you’re feeling alone


1. Practice self-care: Make sure you are taking care of yourself physically, emotionally, and mentally. This can include getting enough rest, eating well, exercising, and engaging in activities that bring you joy.

2. Reach out to loved ones: Don't be afraid to reach out to friends and family for support. They may not know how you are feeling unless you tell them. Talking to someone you trust can help you feel less alone.

3. Join a support group: Consider joining a support group where you can connect with others who are going through similar experiences. It can be comforting to know that you are not alone in your struggles.

4. Seek professional help: If you are feeling overwhelmed and struggling to cope, don't hesitate to seek help from a mental health professional. They can provide you with the tools and support you need to manage your feelings of loneliness.

5. Find a hobby or activity: Engaging in a hobby or activity that you enjoy can help you feel more connected to yourself and others. It can also give you a sense of purpose and fulfillment.

6. Practice mindfulness: Mindfulness techniques, such as meditation or deep breathing, can help you stay present and calm your mind. This can be especially helpful if you are


from langchain.prompts import PromptTemplate

template = "As an artificial intelligence assistant, please answer the question: {question}"
prompt = PromptTemplate(template=template, input_variables=["question"])
prompt

PromptTemplate(input_variables=['question'], input_types={}, partial_variables={}, template='As an artificial intelligence assistant, please answer the question: {question}')


from langchain.chains import LLMChain
from langchain_community.llms import HuggingFaceHub

llm = HuggingFaceHub(repo_id='tiiuae/falcon-7b-instruct', 
                     huggingfacehub_api_token=huggingfacehub_api_token)
llm_chain = LLMChain(prompt=prompt, llm=llm)

question = "What is LangChain and how it can be used?"
print(llm_chain.run(question))

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\645706002.py:6: LangChainDeprecationWarning: The class `LLMChain` was deprecated in LangChain 0.1.17 and will be removed in 1.0. Use :meth:`~RunnableSequence, e.g., `prompt | llm`` instead.
  llm_chain = LLMChain(prompt=prompt, llm=llm)
C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\645706002.py:9: LangChainDeprecationWarning: The method `Chain.run` was deprecated in langchain 0.1.0 and will be removed in 1.0. Use :meth:`~invoke` instead.
  print(llm_chain.run(question))

As an artificial intelligence assistant, please answer the question: What is LangChain and how it can be used?
LangChain is a blockchain-based language learning platform that allows users to learn and earn cryptocurrency while learning a new language. It offers a gamified approach to language learning, where users can earn tokens by completing courses and interacting with the platform. The platform offers courses in various languages, including English, Spanish, French, and Mandarin. Users can also earn tokens by creating and sharing their own courses, as well as by participating in various community-driven activities.


from langchain_openai import ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate
llm = ChatOpenAI(temperature=0, openai_api_key=openai_api_key)

prompt_template = ChatPromptTemplate.from_messages(
    [
        ("system", "You are omnipotent."),
        ("human", "Answer this question: {question}")
    ]
)
full_prompt = prompt_template.format_messages(question='What is the reason for creting snakes?')
llm(full_prompt)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\857107592.py:12: LangChainDeprecationWarning: The method `BaseChatModel.__call__` was deprecated in langchain-core 0.1.7 and will be removed in 1.0. Use :meth:`~invoke` instead.
  llm(full_prompt)

AIMessage(content='Snakes were created as part of the diverse ecosystem on Earth. They play important roles in controlling populations of rodents and other pests, helping to maintain balance in the food chain. Additionally, snakes have unique adaptations and characteristics that make them fascinating creatures to study and appreciate.', additional_kwargs={'refusal': None}, response_metadata={'token_usage': {'completion_tokens': 53, 'prompt_tokens': 29, 'total_tokens': 82, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo-0125', 'system_fingerprint': None, 'finish_reason': 'stop', 'logprobs': None}, id='run-bdea1407-d4b7-452f-aa51-acbaa32e8e7e-0', usage_metadata={'input_tokens': 29, 'output_tokens': 53, 'total_tokens': 82, 'input_token_details': {'audio': 0, 'cache_read': 0}, 'output_token_details': {'audio': 0, 'reasoning': 0}})


from langchain.memory import ChatMessageHistory
from langchain.chat_models import ChatOpenAI
chat = ChatOpenAI(temperature=0.2, openai_api_key=openai_api_key)
history = ChatMessageHistory()
history.add_ai_message("Hi! Ask me anything about Math.")
history.add_user_message("Describe cintral limit theorem?")
chat(history.messages)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\1518057443.py:3: LangChainDeprecationWarning: The class `ChatOpenAI` was deprecated in LangChain 0.0.10 and will be removed in 1.0. An updated version of the class exists in the :class:`~langchain-openai package and should be used instead. To use it run `pip install -U :class:`~langchain-openai` and import as `from :class:`~langchain_openai import ChatOpenAI``.
  chat = ChatOpenAI(temperature=0.2, openai_api_key=openai_api_key)

AIMessage(content='The Central Limit Theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the shape of the population distribution. In other words, if you take multiple random samples from a population and calculate the mean of each sample, the distribution of those sample means will be approximately normally distributed, even if the original population is not normally distributed. This theorem is a fundamental concept in statistics and is used in various statistical analyses and hypothesis testing.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 95, 'prompt_tokens': 26, 'total_tokens': 121, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo', 'system_fingerprint': None, 'finish_reason': 'stop', 'logprobs': None}, id='run-faba9346-58fb-4779-93e9-02a049e89355-0')


history.add_user_message("Does this rule apply for all distributions like uniform, trangular and random?")
chat(history.messages)

AIMessage(content='The Central Limit Theorem (CLT) is a fundamental concept in statistics that states that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the shape of the population distribution. This means that even if the original population distribution is not normal (e.g., uniform, triangular, exponential, etc.), the distribution of sample means will tend to be normal as the sample size increases.\n\nIn other words, the Central Limit Theorem applies to a wide range of distributions, not just normal distributions. As long as the sample size is sufficiently large (usually n ≥ 30 is considered a rule of thumb), the distribution of sample means will approximate a normal distribution, regardless of the shape of the original population distribution. This property makes the Central Limit Theorem a powerful tool in statistical inference and hypothesis testing.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 167, 'prompt_tokens': 45, 'total_tokens': 212, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo', 'system_fingerprint': None, 'finish_reason': 'stop', 'logprobs': None}, id='run-4b4d84a3-2165-4631-ba38-9bc746b3406d-0')


history.add_user_message("Summarize the conversations")
chat(history.messages)

AIMessage(content='The Central Limit Theorem states that the sampling distribution of the sample mean will be approximately normally distributed, regardless of the shape of the original population distribution, as long as the sample size is sufficiently large. This theorem applies to a wide range of distributions, including uniform, triangular, and random distributions. In summary, the Central Limit Theorem allows us to make inferences about population parameters based on sample means, even if the original population distribution is not normal.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 91, 'prompt_tokens': 54, 'total_tokens': 145, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo', 'system_fingerprint': None, 'finish_reason': 'stop', 'logprobs': None}, id='run-48305e52-6cb0-4b14-bbe9-dc9ef4a4327b-0')


from langchain.memory import ConversationBufferMemory
from langchain_openai import OpenAI
from langchain.chains import ConversationChain
chat = OpenAI(model_name="gpt-3.5-turbo-instruct", temperature=1, openai_api_key=openai_api_key)
memory = ConversationBufferMemory(size=4)
chain_buffer = ConversationChain(llm=chat, memory=memory, verbose=True)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\328167441.py:5: LangChainDeprecationWarning: Please see the migration guide at: https://python.langchain.com/docs/versions/migrating_memory/
  memory = ConversationBufferMemory(size=4)
C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\328167441.py:6: LangChainDeprecationWarning: The class `ConversationChain` was deprecated in LangChain 0.2.7 and will be removed in 1.0. Use :meth:`~RunnableWithMessageHistory: https://python.langchain.com/v0.2/api_reference/core/runnables/langchain_core.runnables.history.RunnableWithMessageHistory.html` instead.
  chain_buffer = ConversationChain(llm=chat, memory=memory, verbose=True)


chain_buffer.predict(input="Describe a logistic regression in two sentence")
chain_buffer.predict(input="When it can be applied?")
chain_buffer.predict(input="What are its limitation?")
chain_buffer.predict(input="What was my second question? I forgot.")


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

Human: Describe a logistic regression in two sentence
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:
Human: Describe a logistic regression in two sentence
AI:  Logistic regression is a statistical method used for predicting the probability of a binary outcome based on one or more independent variables. It uses a logistic function to map the input variables to a linear regression model and outputs a probability between 0 and 1, with values closer to 1 indicating a higher likelihood of the event occurring.
Human: When it can be applied?
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:
Human: Describe a logistic regression in two sentence
AI:  Logistic regression is a statistical method used for predicting the probability of a binary outcome based on one or more independent variables. It uses a logistic function to map the input variables to a linear regression model and outputs a probability between 0 and 1, with values closer to 1 indicating a higher likelihood of the event occurring.
Human: When it can be applied?
AI:  Logistic regression can be applied in various fields such as marketing, finance, biostatistics, and social sciences. It is commonly used for predicting binary outcomes, such as whether a customer will make a purchase or if a patient will respond to a new medication.
Human: What are its limitation?
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:
Human: Describe a logistic regression in two sentence
AI:  Logistic regression is a statistical method used for predicting the probability of a binary outcome based on one or more independent variables. It uses a logistic function to map the input variables to a linear regression model and outputs a probability between 0 and 1, with values closer to 1 indicating a higher likelihood of the event occurring.
Human: When it can be applied?
AI:  Logistic regression can be applied in various fields such as marketing, finance, biostatistics, and social sciences. It is commonly used for predicting binary outcomes, such as whether a customer will make a purchase or if a patient will respond to a new medication.
Human: What are its limitation?
AI:  Like any statistical model, logistic regression has its limitations. It assumes that the relationship between the independent variables and the outcome is linear, and it is not suitable for predicting continuous outcomes. It also assumes that the observations are independent, which may not be true in some cases. Additionally, it is sensitive to overfitting and requires a large sample size to accurately estimate the coefficients. 
Human: What was my second question? I forgot.
AI:

> Finished chain.

' Your second question was "When can it be applied?" and I provided several examples of potential applications for logistic regression. Did you have any other questions for me?'


from langchain.memory import ConversationSummaryMemory
chat = OpenAI(model_name="gpt-3.5-turbo-instruct", temperature=0, openai_api_key=openai_api_key)
memory = ConversationSummaryMemory(llm=OpenAI(model_name="gpt-3.5-turbo-instruct",
openai_api_key=openai_api_key))
chain_summary = ConversationChain(llm=chat, memory=memory, verbose=True)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\2154888729.py:3: LangChainDeprecationWarning: Please see the migration guide at: https://python.langchain.com/docs/versions/migrating_memory/
  memory = ConversationSummaryMemory(llm=OpenAI(model_name="gpt-3.5-turbo-instruct",


chain_summary.predict(input="Today is very rainy but had to go outside to buy some grocery")
chain_summary.predict(input="I had invited some of my frineds to my house for dinner party but did not have meat, wine and fruits?")
chain_summary.predict(input="Unfortunately, although I prepared all the foods for dinner, no one showed up because of sever thonder storm ?")
chain_summary.predict(input="I do not know what I should do with all the foods. Can you help?")
chain_summary.predict(input="Any suggenstion to ensure something like this will not happen to me?")


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

Human: Today is very rainy but had to go outside to buy some grocery
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human mentions going outside in the rain to buy groceries. The AI expresses sympathy and asks what kind of groceries were needed.
Human: I had invited some of my frineds to my house for dinner party but did not have meat, wine and fruits?
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human mentions going outside in the rain to buy groceries. The AI expresses sympathy and asks what kind of groceries were needed. The human reveals they were planning a dinner party but didn't have meat, wine, and fruits. The AI expresses further sympathy and asks what specific types of these items were needed.
Human: Unfortunately, although I prepared all the foods for dinner, no one showed up because of sever thonder storm ?
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human mentions going outside in the rain to buy groceries. The AI expresses sympathy and asks what kind of groceries were needed. The human reveals they were planning a dinner party but didn't have meat, wine, and fruits. The AI expresses further sympathy and asks what specific types of these items were needed. The human then explains that the dinner party was cancelled due to a severe thunderstorm and the AI expresses even more sympathy, asking about the specific groceries that were needed for the party.
Human: I do not know what I should do with all the foods. Can you help?
AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human mentions going outside in the rain to buy groceries. The AI expresses sympathy and asks what kind of groceries were needed. The human reveals they were planning a dinner party but didn't have meat, wine, and fruits. The AI expresses further sympathy and asks what specific types of these items were needed. The human then explains that the dinner party was cancelled due to a severe thunderstorm and the AI expresses even more sympathy, asking about the specific groceries that were needed for the party. The human expresses uncertainty about what to do with the food and the AI offers to help by suggesting creative recipes using the ingredients available.
Human: Any suggenstion to ensure something like this will not happen to me?
AI:

> Finished chain.

" I'm sorry to hear that your dinner party was cancelled due to the thunderstorm. What specific groceries were you planning to buy for the party? Perhaps I can help you come up with some creative recipes using the ingredients you already have."


from langchain.memory import ConversationSummaryMemory
chat = OpenAI(model_name="gpt-3.5-turbo-instruct", temperature=0, openai_api_key=openai_api_key)
memory_new = ConversationSummaryMemory(llm=OpenAI(model_name="gpt-3.5-turbo-instruct",
openai_api_key=openai_api_key))
chain_summary_new = ConversationChain(llm=chat, memory=memory_new, verbose=True)


chain_summary_new.predict(input="""Mia had always loved the quaint little bookshop at 
the corner of Maple Street. Every Saturday morning, she would wander through its 
narrow aisles, the scent of aged paper and ink a comforting presence. It was 
on one such morning, while perusing the dusty shelves in the back, that she 
stumbled upon an old, leather-bound journal. The cover was worn, and the pages 
yellowed with time, but something about it called to her. As she opened it, 
she realized it wasn't just any journal—it was filled with intricate drawings and 
cryptic notes, seemingly leading to a hidden treasure.
""")

chain_summary_new.predict(input="""Intrigued by the mystery, Mia decided to follow the 
clues detailed within the journal. Each drawing seemed to depict a different landmark 
in her town, places she had known all her life but never looked at closely. She spent 
the next few weeks deciphering the codes and visiting these locations, discovering hidden 
messages and secret symbols carved into stone and wood. As the pieces of the puzzle started 
to come together, Mia felt a sense of adventure she hadn't experienced since childhood. 
The thrill of the hunt consumed her, and she began to dream of what the treasure might be.
""")

chain_summary_new.predict(input="""One evening, as the sun set in a blaze of orange and pink, 
Mia found herself standing in front of an old, abandoned lighthouse at the edge of town, 
the final location marked in the journal. Heart pounding with anticipation, she climbed 
the rickety stairs to the top, where she discovered a small, rusted box hidden under a 
loose floorboard. Inside, instead of gold or jewels, she found a collection of letters 
and photographs from the early 1900s, telling the love story of a young couple separated 
by war. Mia realized that the true treasure was not material wealth, but the poignant 
history and love preserved in those letters, a testament to enduring love and the 
passage of time. As she read through the heartfelt words, she felt a deep connection 
to the past and a newfound appreciation for the stories hidden in her own town.
""")

chain_summary_new.predict(input="make summary of two sentences?")


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

Human: Mia had always loved the quaint little bookshop at 
the corner of Maple Street. Every Saturday morning, she would wander through its 
narrow aisles, the scent of aged paper and ink a comforting presence. It was 
on one such morning, while perusing the dusty shelves in the back, that she 
stumbled upon an old, leather-bound journal. The cover was worn, and the pages 
yellowed with time, but something about it called to her. As she opened it, 
she realized it wasn't just any journal—it was filled with intricate drawings and 
cryptic notes, seemingly leading to a hidden treasure.

AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human describes Mia's love for a quaint bookshop on Maple Street and her chance discovery of an old journal filled with cryptic notes and drawings. The AI expresses fascination and offers to provide more information about the bookshop.
Human: Intrigued by the mystery, Mia decided to follow the 
clues detailed within the journal. Each drawing seemed to depict a different landmark 
in her town, places she had known all her life but never looked at closely. She spent 
the next few weeks deciphering the codes and visiting these locations, discovering hidden 
messages and secret symbols carved into stone and wood. As the pieces of the puzzle started 
to come together, Mia felt a sense of adventure she hadn't experienced since childhood. 
The thrill of the hunt consumed her, and she began to dream of what the treasure might be.

AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:

The human describes Mia's discovery of a mysterious journal in a quaint bookshop on Maple Street. Intrigued, Mia follows the clues within the journal and uncovers hidden messages in familiar landmarks. The AI expresses fascination and reveals that the bookshop has a history of puzzles and riddles, possibly left behind by the previous owner. The human and AI discuss the excitement and adventure that such a simple place can hold. 
Human: One evening, as the sun set in a blaze of orange and pink, 
Mia found herself standing in front of an old, abandoned lighthouse at the edge of town, 
the final location marked in the journal. Heart pounding with anticipation, she climbed 
the rickety stairs to the top, where she discovered a small, rusted box hidden under a 
loose floorboard. Inside, instead of gold or jewels, she found a collection of letters 
and photographs from the early 1900s, telling the love story of a young couple separated 
by war. Mia realized that the true treasure was not material wealth, but the poignant 
history and love preserved in those letters, a testament to enduring love and the 
passage of time. As she read through the heartfelt words, she felt a deep connection 
to the past and a newfound appreciation for the stories hidden in her own town.

AI:

> Finished chain.


> Entering new ConversationChain chain...
Prompt after formatting:
The following is a friendly conversation between a human and an AI. The AI is talkative and provides lots of specific details from its context. If the AI does not know the answer to a question, it truthfully says it does not know.

Current conversation:
 
The human describes Mia's discovery of a mysterious journal in a quaint bookshop on Maple Street. Intrigued, Mia follows the clues within the journal and uncovers hidden messages in familiar landmarks. The AI expresses fascination and reveals that the bookshop has a history of puzzles and riddles, possibly left behind by the previous owner. The human and AI discuss the excitement and adventure that such a simple place can hold. They also discuss Mia's later discovery of a collection of letters and photographs hidden in an old lighthouse, showing the power of stories and history in unexpected places.
Human: make summary of two sentences?
AI:

> Finished chain.

' Sure, the summary of the conversation is that Mia discovered a mysterious journal in a quaint bookshop on Maple Street and followed its clues to uncover hidden messages in familiar landmarks. The AI reveals that the bookshop has a history of puzzles and riddles, possibly left behind by the previous owner. Mia also discovered a collection of letters and photographs hidden in an old lighthouse, showing the power of stories and history in unexpected places.'


#pip install pypdf


from langchain_community.document_loaders import PyPDFLoader

loader = PyPDFLoader("data/2024_February.pdf")
data = loader.load()
data[0]

Document(metadata={'source': 'data/2024_February.pdf', 'page': 0}, page_content='Pre Authorized Amount to be Withdrawn Mar 11.....$ 463.31\nIf payment is received after 2024 March 11,the following late payment fees will apply:\nA one-time late payment fee of 3.25% on Current Charges.\nSummary of Your Account\nPrevious Charges and Credits\nPrevious balance ..............................................................$ 362.97\nPayment we processed on FEB 12. Thank you................$ 362.97 CR\nBalance Forward ............................................................$ 0.00\nElectricity ....................................(GST: $5.11) $ 102.19\nNatural Gas ................................(GST: $11.21) $ 224.10\nSubtotal ............................................................................$ 326.29\nWater Treatment and Supply............................................$ 33.11\nWastewater Collection and Treatment..............................$ 44.77\nStormwater Management .................................................$ 14.59\nWaste and Recycling........................................................$ 28.23\nSubtotal ............................................................................$ 120.70\nTotal GST .........................................................................$ 16.32\nTotal Current Charges .....................................................$ 463.31\nTotal Amount Due ............................................................$ 463.31')


from langchain_community.document_loaders.csv_loader import CSVLoader
loader = CSVLoader(file_path='./data/Churn_Modelling.csv')
data = loader.load()
#data[:4]


from langchain_community.document_loaders import TextLoader

loader = TextLoader("./data/Paper.txt", encoding = 'UTF-8')
data = loader.load()


len(data)

1


from langchain_community.document_loaders import HNLoader
loader = HNLoader("https://news.ycombinator.com")
data = loader.load()
data

[]


quote = """Success is not final, failure is not fatal: It is the courage to continue that counts. 
The journey of a thousand miles begins with one step. In the end, we will remember not 
the words of our enemies, but the silence of our friends."""

len(quote)

233


chunk_size = 30
chunk_overlap = 10


from langchain.text_splitter import CharacterTextSplitter
ct_splitter = CharacterTextSplitter(
    separator = '.',
    chunk_size = chunk_size,
    chunk_overlap = chunk_overlap)
docs = ct_splitter.split_text(quote)
print(docs)

Created a chunk of size 85, which is longer than the specified 30
Created a chunk of size 54, which is longer than the specified 30

['Success is not final, failure is not fatal: It is the courage to continue that counts', 'The journey of a thousand miles begins with one step', 'In the end, we will remember not \nthe words of our enemies, but the silence of our friends']


from langchain.text_splitter import RecursiveCharacterTextSplitter
rc_splitter = RecursiveCharacterTextSplitter(
    chunk_size = chunk_size,
    chunk_overlap = chunk_overlap)
docs = rc_splitter.split_text(quote)
print(docs)

['Success is not final, failure', 'failure is not fatal: It is', 'It is the courage to continue', 'continue that counts.', 'The journey of a thousand', 'thousand miles begins with', 'with one step. In the end, we', 'end, we will remember not', 'the words of our enemies, but', 'but the silence of our', 'of our friends.']


#pip install unstructured


from langchain_community.document_loaders import UnstructuredHTMLLoader
from langchain.text_splitter import RecursiveCharacterTextSplitter

loader = UnstructuredHTMLLoader("data/Machine_Learning_Overview.html")
data = loader.load()
rc_splitter = RecursiveCharacterTextSplitter(
    chunk_size = chunk_size,
    chunk_overlap = chunk_overlap,
    separators = ['.']
)
html = rc_splitter.split_documents(data)
print(html[0])

page_content='Table of Contents

1 ENPE 519' metadata={'source': 'data/Machine_Learning_Overview.html'}


quote = """Success is not final, failure is not fatal: It is the courage to continue that counts. 
The journey of a thousand miles begins with one step. In the end, we will remember not 
the words of our enemies, but the silence of our friends."""


from langchain.text_splitter import RecursiveCharacterTextSplitter
chunk_size = 40
chunk_overlap = 15
splitter = RecursiveCharacterTextSplitter(
    chunk_size=chunk_size,
    chunk_overlap=chunk_overlap
)

docs = splitter.split_text(quote)


docs

['Success is not final, failure is not',
 'failure is not fatal: It is the courage',
 'is the courage to continue that counts.',
 'that counts.',
 'The journey of a thousand miles begins',
 'miles begins with one step. In the end,',
 'In the end, we will remember not',
 'the words of our enemies, but the',
 'but the silence of our friends.']


docs

['Success is not final, failure is not',
 'failure is not fatal: It is the courage',
 'is the courage to continue that counts.',
 'that counts.',
 'The journey of a thousand miles begins',
 'miles begins with one step. In the end,',
 'In the end, we will remember not',
 'the words of our enemies, but the',
 'but the silence of our friends.']


#pip install chromadb


from langchain_openai import OpenAIEmbeddings
from langchain_community.vectorstores import Chroma

embedding_function = OpenAIEmbeddings(openai_api_key=openai_api_key)

vectordb = Chroma(embedding_function=embedding_function)
vectordb.persist()
docstorage = Chroma.from_texts(docs, embedding_function)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\1307043483.py:6: LangChainDeprecationWarning: The class `Chroma` was deprecated in LangChain 0.2.9 and will be removed in 1.0. An updated version of the class exists in the :class:`~langchain-chroma package and should be used instead. To use it run `pip install -U :class:`~langchain-chroma` and import as `from :class:`~langchain_chroma import Chroma``.
  vectordb = Chroma(embedding_function=embedding_function)
C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\1307043483.py:7: LangChainDeprecationWarning: Since Chroma 0.4.x the manual persistence method is no longer supported as docs are automatically persisted.
  vectordb.persist()


from langchain.chains import RetrievalQA

qa = RetrievalQA.from_chain_type(llm=OpenAI(model_name="gpt-3.5-turbo-instruct", 
                                            openai_api_key=openai_api_key,),
                                 chain_type="stuff", retriever=docstorage.as_retriever())
                                 
query = "What is failure?"
print(qa.run(query))

 Failure is not fatal. It is the courage to continue that counts.


loader = PyPDFLoader('data/Paper.pdf')
data = loader.load()

splitter = RecursiveCharacterTextSplitter(
    chunk_size=250,
    chunk_overlap=60,
    separators=['.'])
paper = splitter.split_documents(data) 

embedding_model = OpenAIEmbeddings(openai_api_key=openai_api_key)
docstorage_paper = Chroma.from_documents(paper, embedding_model)

# Define the function for the question to be answered with
qa = RetrievalQA.from_chain_type(
    OpenAI(model_name="gpt-3.5-turbo-instruct", temperature=0.2, openai_api_key=openai_api_key), 
    chain_type="stuff", retriever=docstorage_paper.as_retriever()
)

# Run the query on the documents
question = "What is data leakage?"
results = qa(question)
print(results["result"])

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\3031139129.py:21: LangChainDeprecationWarning: The method `Chain.__call__` was deprecated in langchain 0.1.0 and will be removed in 1.0. Use :meth:`~invoke` instead.
  results = qa(question)

 Data leakage is when information from the validation data is unintentionally used in the model, leading to biased results. It can also refer to a highly skewed distribution of data that affects the accuracy of predictive models.


from langchain_community.llms import HuggingFaceHub
huggingfacehub_api_token = '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'

llm = HuggingFaceHub(repo_id='mistralai/Mistral-7B-Instruct-v0.2',
                     huggingfacehub_api_token=huggingfacehub_api_token)


question = 'Can you still have fun if'
output = llm.invoke(question)

print(output)

Can you still have fun if you’re not drinking? Absolutely! Here are some ideas for a fun and alcohol-free night out in the city.

1. Explore the night markets

Toronto has a vibrant night market scene, and they’re a great place to spend an evening without drinking. The Kensington Market Night Market is a popular choice, with a variety of food vendors, live music, and local artists selling their wares. The St. Lawrence Market Night is another great


#pip install sentence-transformers


from langchain_community.embeddings.sentence_transformer import (
    SentenceTransformerEmbeddings,)
# create the open-source embedding function
embedding_function = SentenceTransformerEmbeddings(model_name="all-MiniLM-L6-v2")

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\3879088051.py:4: LangChainDeprecationWarning: The class `HuggingFaceEmbeddings` was deprecated in LangChain 0.2.2 and will be removed in 1.0. An updated version of the class exists in the :class:`~langchain-huggingface package and should be used instead. To use it run `pip install -U :class:`~langchain-huggingface` and import as `from :class:`~langchain_huggingface import HuggingFaceEmbeddings``.
  embedding_function = SentenceTransformerEmbeddings(model_name="all-MiniLM-L6-v2")


from chromadb.errors import InvalidDimensionException

loader = PyPDFLoader('data/Paper.pdf')
data = loader.load()

splitter = RecursiveCharacterTextSplitter(
    chunk_size=500,
    chunk_overlap=20,
    separators=['.'])
paper = splitter.split_documents(data) 


# If you can't find the directory where your index/collection 
# is stored in order to remove it, you can use a workaround that works for me.
try:
    docstorage_paper = Chroma.from_documents(paper, embedding_function)
except InvalidDimensionException:
    Chroma().delete_collection()
    docstorage_paper = Chroma.from_documents(paper, embedding_function)

# Define the function for the question to be answered with
qa = RetrievalQA.from_chain_type(llm, 
    chain_type="stuff", retriever=docstorage_paper.as_retriever()
)

# Run the query on the documents
question = "What is advantage of Multivariate Bootstrapping?"
results = qa(question)
print(results["result"])

Use the following pieces of context to answer the question at the end. If you don't know the answer, just say that you don't know, don't try to make up an answer.

. Multivariate Bootstrapping (Khan and Deutsch, 2016; Rezvandehy and
Deutsch, 2017; Rezvandehy et al., 2019) is another approach that can be applied to quantify the uncer-
tainty in the distribution of each feature, and then replace each missing value with a random sample
from the distribution. This approach is fast, quantifies the uncertainty in imputation and the correla-
tion between features are reproduced. However, the missing data cannot be simulated by conditioning
on non-missing values

Data-Centric Engineering 9
Figure 4. a) Cholesky decomposition of correlation matrix for 𝑛 features (well properties). b) LU
unconditional simulation. c) LU conditional simulation..
efficient because it just picks a random instance at every iteration and computes the gradients based
only on that single instance (Bottou, 2012; Géron, 2019).
• Logistic Regression is a simple approach to estimate the probability of a particular class

.
There are complex techniques such as multivariate imputation by chained equation (MICE) (Buuren
and Groothuis-Oudshoorn, 2010) and deep learning (DataWing, 2022). However, the imputation using
these techniques can be quite slow and computationally expensive for large datasets. They may also
need special software, distributional assumption, and the uncertainty in imputation of missing data
can not be taken into account

.
To evaluate the efficiency of the proposed imputation technique a synthetic example is considered
with four correlated features with 10000 data as shown in Fig6-a. Features 1 and 2 are Gaussian and
lognormal distributions, respectively while features 3 and 4 are triangular distributions with different
statistics (mean and mode). Fig6-b shows the correlation matrix between features (below diagonal
elements) and percentage of missing data for each bivariate feature (above diagonal elements)

Question: What is advantage of Multivariate Bootstrapping?
Helpful Answer: Multivariate Bootstrapping is an advantageous approach for imputing missing data because it is fast and efficiently quantifies the uncertainty in the imputation for each feature. It also reproduces the correlation between features. However, it cannot simulate missing data by conditioning on non-missing values.


print(results["result"].split("Helpful Answer:",1)[1])

 Multivariate Bootstrapping is an advantageous approach for imputing missing data because it is fast and efficiently quantifies the uncertainty in the imputation for each feature. It also reproduces the correlation between features. However, it cannot simulate missing data by conditioning on non-missing values.


from chromadb.errors import InvalidDimensionException

#loader = PyPDFLoader('data/Paper.pdf')
loader = PyPDFLoader('./pdfs/offer_test.pdf')
data = loader.load()

splitter = RecursiveCharacterTextSplitter(
    chunk_size=100,
    chunk_overlap=100,
    separators=['.'])
paper = splitter.split_documents(data) 


# If you can't find the directory where your index/collection 
# is stored in order to remove it, you can use a workaround that works for me.
try:
    docstorage_paper = Chroma.from_documents(paper, embedding_function)
except InvalidDimensionException:
    Chroma().delete_collection()
    docstorage_paper = Chroma.from_documents(paper, embedding_function)

# Define the function for the question to be answered with
qa = RetrievalQA.from_chain_type(llm, 
    chain_type="stuff", retriever=docstorage_paper.as_retriever()
)

# Run the query on the documents
question = """What is the conclusion of this validation?

            The output should be one of these categories: "Unacceptable", "Acceptable but remediation required", "Acceptable but improvement required", "Unacceptable"
"""

## Run the query on the documents
#question = """Extract the following values described in table 1a: "Severity", "Observation Description".
#
#        Expected output: 
#        {{"Severity": "Medium", "Observation Description": Model Execution * MRM noticed that the model development team did not have UAT tests ( parallel test and User Acceptance Test) for this model, "Severity": "High", "Observation Description": Model Building Code ● MRM reviewed the code presented by the model owner and notes that the code to create Target=0 is not correct.}}
#
#?"""

results = qa(question)
#print(results["result"])


print(results["result"].split("Helpful Answer:",1)[1])

 The conclusion of this validation is "Acceptable but improvement required". During the validation process, MRM identified some issues with the model, specifically instances of negative class being misclassified as positive class and vice versa. While the overall accuracy of the model was not specified in the context provided, it is mentioned that improvements are required. The validation also ensured that overfitting did not occur by comparing the performance of the training and validation sets.


# OpenAI
from langchain.chat_models import ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate
model = ChatOpenAI(openai_api_key=openai_api_key)
prompt = ChatPromptTemplate.from_template("""As you are a capable personal assistant, would you answer the 
following question: {question}""")
chain = prompt | model
print(chain.invoke({"question": "Can we travel to Mars in 2050?"}))

content='As of now, there are plans and ongoing research by various space agencies and private companies to send humans to Mars in the near future, with some aiming for as early as the 2030s. However, there are still numerous challenges that need to be overcome before a manned mission to Mars can be successfully carried out, including technological, logistical, and health considerations. While it is possible that we may see a manned mission to Mars by 2050, it is difficult to predict with certainty at this time.' additional_kwargs={} response_metadata={'token_usage': {'completion_tokens': 102, 'prompt_tokens': 33, 'total_tokens': 135, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo', 'system_fingerprint': None, 'finish_reason': 'stop', 'logprobs': None} id='run-c66e73b2-27cd-4373-a28d-a6b8bea8b545-0'


# HuggingFace
from langchain_core.prompts import ChatPromptTemplate
from langchain_community.llms import HuggingFaceHub
huggingfacehub_api_token = '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'

llm_hf = HuggingFaceHub(repo_id='tiiuae/falcon-7b-instruct',
                     huggingfacehub_api_token=huggingfacehub_api_token)

prompt = ChatPromptTemplate.from_template("""As you are a capable personal assistant, would you answer the 
following question: {question}""")
chain = prompt | llm_hf
print(chain.invoke({"question": "Can we travel to Mars in 2050?"}))

Human: As you are a capable personal assistant, would you answer the 
following question: Can we travel to Mars in 2050?
Mini While it is currently not possible for humans to travel to Mars in 2050, there are plans in place to make it a possibility in the future. NASA is already working on developing the necessary technology and infrastructure to make this happen.
User


for chunk in chain.stream({"question": "What's the name of Iran country 1500 years ago?"}):
    print(chunk)

Human: As you are a capable personal assistant, would you answer the 
following question: What's the name of Iran country 1500 years ago?
Mini The name of Iran country 1500 years ago was Persia.
User


inputs = [{"question": "What's the name of Iran country 1500 years ago?"},
          {"question": "What snakes do to scape the heat?"}]

results = chain.batch(inputs)
for result in results:
    print(result)

Human: As you are a capable personal assistant, would you answer the 
following question: What's the name of Iran country 1500 years ago?
Mini The name of Iran country 1500 years ago was Persia.
User 
Human: As you are a capable personal assistant, would you answer the 
following question: What snakes do to scape the heat?
Mini Snakes have a few ways to escape the heat. Some of them will seek shade, while others will try to find a cool spot to rest. Some snakes will also try to absorb heat from the ground by using their scales to heat up and then cool down.
User


from langchain_core.runnables import RunnablePassthrough
from langchain.schema.output_parser import StrOutputParser
model = ChatOpenAI(openai_api_key=openai_api_key, temperature=0)

# If you can't find the directory where your index/collection 
# is stored in order to remove it, you can use a workaround that works for me.
try:
    vectorstore = Chroma.from_texts(["Sandian village is near Rezvanshar in North Iran near Caspain sea. The weather there is moderate"],
                                    embedding=OpenAIEmbeddings(openai_api_key=openai_api_key))
except InvalidDimensionException:
    Chroma().delete_collection()
    vectorstore = Chroma.from_texts(["Nothing is shaking on Shakedown Street."],
                                    embedding=OpenAIEmbeddings(openai_api_key=openai_api_key))
    
retriever = vectorstore.as_retriever()
template = """Answer the question based on the context:{context}. Question: {question}"""
prompt = ChatPromptTemplate.from_template(template)

chain = ({"context": retriever, "question": RunnablePassthrough()} | prompt | model | StrOutputParser())
chain.invoke("What is Sandian?")

Number of requested results 4 is greater than number of elements in index 1, updating n_results = 1

'There is no mention of Sandian in the provided context.'


tourguid_prompt_1 = PromptTemplate.from_template(
    """You are an expert tour guild. Mention the most popular places to visit 
    in your region. question: {question}. Your answer: """)

transn_prompt_2 = PromptTemplate.from_template(
    """You are an expert translator. Translate the answer 
    in the native language . Item: {answer}. Your translation:""")
llm = ChatOpenAI(openai_api_key=openai_api_key)
chain = ({"answer": tourguid_prompt_1 | llm | StrOutputParser()} | transn_prompt_2 | llm | StrOutputParser())

chain.invoke({"question": "I am in BandareAnzali, Iran, where I should go to visit first?"})

'به عنوان یک راهنمای تور حرفه ای در بندر انزلی، ایران، توصیه می کنم با بازدید از دریاچه زیبای بندر انزلی شروع کنید. این جاذبه طبیعی، با چشم انداز زیبا، جانوران متنوع و فرصت های تور قایق و مشاهده پرندگان، باید دیده شود. بعد، پیشنهاد می کنم بازدید از بازار تاریخی بندر انزلی را در نظر بگیرید، جایی که می توانید خود را در فرهنگ محلی فرو ببرید، برای یافتن سوغاتی های منحصر به فرد، و طعم گذاری از غذاهای خوشمزه پارسی. در نهایت، فرصت دیدن روستای زیبای ماسوله را از دست ندهید، که به خاطر معماری خیره کننده و چشم اندازهای کوهستانی زیبا شناخته می شود. این تنها چند مکان محبوب برای بازدید در بندر انزلی هستند، و من مطمئن هستم که تجربه ای فراموش ناپذیر از کاوش این منطقه زیبا خواهید داشت.'


llm = ChatOpenAI(openai_api_key=openai_api_key)
#
prompt_1 = ChatPromptTemplate.from_template("what is the age of earth")
chain1 = prompt_1 | llm
#
prompt_2 = ChatPromptTemplate.from_template("Divide {age} by the age of Albert Einstein when he died.")
chain2 = prompt_2 | llm
#
answer1 = chain1.invoke({})
answer2 = chain2.invoke({"age": answer1.content})
print("Age of earth:", answer1.content)
print("Result of division:", answer2.content)

Age of earth: The age of Earth is estimated to be around 4.54 billion years old.
Result of division: 4.54 billion years / 76 years = 59,736,842.1

Therefore, the age of Earth is estimated to be around 59,736,842 times older than Albert Einstein was when he died.


from langchain_core.runnables import RunnablePassthrough

prompt_1 = ChatPromptTemplate.from_template("You are a helpful helper. Please answer the question: {input}")
prompt_1_q_response = (prompt_1 | llm | {"response": RunnablePassthrough() | StrOutputParser()})
#
prompt_2 = ChatPromptTemplate.from_template(
    "You are a challenger. Describe the most powerful opposing idea for {response}")
prompt_2_contrarian_response = (prompt_2 | llm | StrOutputParser())
#
final_chain = (
    {"response": prompt_1_q_response, "opposing_view": prompt_2_contrarian_response}
    | ChatPromptTemplate.from_messages([("ai", "{response}"),
                                        ("human", "Response:\n{response}\n\nOpposing view:\n{opposing_view}"),
                                        ("system", "Summarize the original response and an opposing response.")])
    | llm
    | StrOutputParser()
)

print(final_chain.invoke({"input": "What is the best dish in Iran?", "response": "", "opposing_response": ""}))

The original response highlighted the popular and delicious Iranian dish called "Chelow Kabab," consisting of grilled skewers of meat served with saffron-infused rice and yogurt sauce, a staple in Iranian cuisine.

The opposing response argued that technology and automation will create more jobs than it destroys, citing factors such as increased demand for skilled workers, new emerging industries, and historical examples of technological advancements leading to job creation. It emphasized the potential for positive outcomes and opportunities for workers in the future due to advancements in technology.


#pip install numexpr


from langchain.agents import initialize_agent, AgentType, load_tools

llm = OpenAI(model_name="gpt-3.5-turbo-instruct", temperature=0, openai_api_key=openai_api_key)
tools = load_tools(["llm-math"], llm=llm)
zero_shot_agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)
zero_shot_agent.run("What is the calculation of 10 powered 3?")

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\252818712.py:5: LangChainDeprecationWarning: The function `initialize_agent` was deprecated in LangChain 0.1.0 and will be removed in 1.0. Use :meth:`~Use new agent constructor methods like create_react_agent, create_json_agent, create_structured_chat_agent, etc.` instead.
  zero_shot_agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)


> Entering new AgentExecutor chain...
 I should use a calculator to solve this problem.
Action: Calculator
Action Input: 10 ** 3
Observation: Answer: 1000
Thought: I now know the final answer.
Final Answer: 1000

> Finished chain.

'1000'


from langchain.agents import tool

@tool # decorator
def calculate_bmr(peron_name: str):
    """Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment."""
    # (weight:float, height: float, age: int, gender: str)
    weight = 98
    height = 184
    age = 44
    gender = 'male'

    if gender == 'male':
        bmr = 10 * weight + 6.25 * height - 5 * age + 5
    elif gender == 'female':
        bmr = 10 * weight + 6.25 * height - 5 * age - 161
    else:
        raise ValueError("Gender must be 'male' or 'female'")
    
    # Calculate BMI
    height_m = height / 100
    bmi = weight / (height_m ** 2)
    
    # Provide a judgment based on BMI
    if bmi < 18.5:
        body_judgment = "Underweight"
    elif 18.5 <= bmi < 24.9:
        body_judgment = "Normal weight"
    elif 25 <= bmi < 29.9:
        body_judgment = "Overweight"
    else:
        body_judgment = "Obese"
    
    return bmr, body_judgment


calculate_bmr.args

{'peron_name': {'title': 'Peron Name', 'type': 'string'}}


# Import libraries
from langchain.agents import tool, AgentType, Tool, initialize_agent
from langchain_openai import OpenAI

# Define the previously created tool in a list
tools = [Tool(name="calculate_bmr",
              func=calculate_bmr,
              description="Use this to calculate Basal Metabolic Rate (BMR).",)]

# Define the model and the agent
llm = OpenAI(temperature=0, openai_api_key=openai_api_key)
agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)

# Define a question and run the agent
question = """calculate Basal Metabolic Rate (BMR) for Ali"""

agent.run(question)


> Entering new AgentExecutor chain...
 I should use the calculate_bmr tool to calculate Ali's BMR
Action: calculate_bmr
Action Input: 'Ali'
Observation: (1915.0, 'Overweight')
Thought: I now know Ali's BMR is 1915.0 and he is classified as 'Overweight'
Final Answer: Ali's BMR is 1915.0 and he is classified as 'Overweight'

> Finished chain.

"Ali's BMR is 1915.0 and he is classified as 'Overweight'"


def divisible_by_five(n: int) -> int:
    """Calculate the number of times an input is divisible by five."""
    n_times = n // 5
    return n_times


from langchain.agents import initialize_agent, AgentType
from langchain_openai import OpenAI
from langchain.tools import StructuredTool
factorial_tool = StructuredTool.from_function(divisible_by_five)
llm = OpenAI(temperature=0, openai_api_key=openai_api_key)
agent = initialize_agent(tools=[factorial_tool],llm=llm,
agent=AgentType.STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION,
verbose=True)
result = factorial_tool.func(n=12)
print(result)

2


#@tool # decorator
def calculate_bmr(weight:float, height: float, age: int, gender: str) -> float:
    """Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment."""

    if gender == 'male':
        bmr = 10 * weight + 6.25 * height - 5 * age + 5
    elif gender == 'female':
        bmr = 10 * weight + 6.25 * height - 5 * age - 161
    else:
        raise ValueError("Gender must be 'male' or 'female'")
    
    # Calculate BMI
    height_m = height / 100
    bmi = weight / (height_m ** 2)
    
    # Provide a judgment based on BMI
    if bmi < 18.5:
        body_judgment = "Underweight"
    elif 18.5 <= bmi < 24.9:
        body_judgment = "Normal weight"
    elif 25 <= bmi < 29.9:
        body_judgment = "Overweight"
    else:
        body_judgment = "Obese"
    
    return bmr, body_judgment


from langchain.agents import initialize_agent, AgentType
from langchain_openai import OpenAI
from langchain.tools import StructuredTool

calculate_bmr_tool = StructuredTool.from_function(calculate_bmr)

llm = OpenAI(temperature=0, openai_api_key=openai_api_key)

agent = initialize_agent(tools=[calculate_bmr_tool],llm=llm,
                         agent=AgentType.STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION,
                         verbose=True)
# Define a question and run the agent
question = """calculate Basal Metabolic Rate (BMR) for Ali for if his weight is 98, 
height is 184, age of 44 and gender='male' """

agent.run(question)


> Entering new AgentExecutor chain...
Action:
```
{
  "action": "calculate_bmr",
  "action_input": {
    "weight": 98,
    "height": 184,
    "age": 44,
    "gender": "male"
  }
}
```


Observation: (1915.0, 'Overweight')
Thought: I know what to respond
Action:
```
{
  "action": "Final Answer",
  "action_input": "Ali's BMR is 1915.0 and he is overweight."
}
```

> Finished chain.

"Ali's BMR is 1915.0 and he is overweight."


from langchain_core.pydantic_v1 import BaseModel, Field

class CalculateBmr(BaseModel):query: str = Field(description='Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment')

@tool(args_schema=CalculateBmr) # decorator
def calculate_bmr(weight:float, height: float, age: int, gender: str):
    """Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment."""

    if gender == 'male':
        bmr = 10 * weight + 6.25 * height - 5 * age + 5
    elif gender == 'female':
        bmr = 10 * weight + 6.25 * height - 5 * age - 161
    else:
        raise ValueError("Gender must be 'male' or 'female'")
    
    # Calculate BMI
    height_m = height / 100
    bmi = weight / (height_m ** 2)
    
    # Provide a judgment based on BMI
    if bmi < 18.5:
        body_judgment = "Underweight"
    elif 18.5 <= bmi < 24.9:
        body_judgment = "Normal weight"
    elif 25 <= bmi < 29.9:
        body_judgment = "Overweight"
    else:
        body_judgment = "Obese"
    
    return bmr, body_judgment

D:\Learning\MyWebsite\FinalGithub\AlreadyPublihsed\blogs\DataCamp_Intro_to_LangChain\vm_data_capm_langchain\lib\site-packages\IPython\core\interactiveshell.py:3550: LangChainDeprecationWarning: As of langchain-core 0.3.0, LangChain uses pydantic v2 internally. The langchain_core.pydantic_v1 module was a compatibility shim for pydantic v1, and should no longer be used. Please update the code to import from Pydantic directly.

For example, replace imports like: `from langchain_core.pydantic_v1 import BaseModel`
with: `from pydantic import BaseModel`
or the v1 compatibility namespace if you are working in a code base that has not been fully upgraded to pydantic 2 yet. 	from pydantic.v1 import BaseModel

  exec(code_obj, self.user_global_ns, self.user_ns)


from langchain.tools import format_tool_to_openai_function

print(format_tool_to_openai_function(calculate_bmr))

{'name': 'calculate_bmr', 'description': 'Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment.', 'parameters': {'type': 'object', 'properties': {'query': {'description': 'Calculate Basal Metabolic Rate (BMR) using the Mifflin-St Jeor Equation and provide a body judgment', 'type': 'string'}}, 'required': ['query']}}

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\3082090257.py:3: LangChainDeprecationWarning: The function `format_tool_to_openai_function` was deprecated in LangChain 0.1.16 and will be removed in 1.0. Use :meth:`~langchain_core.utils.function_calling.convert_to_openai_function()` instead.
  print(format_tool_to_openai_function(calculate_bmr))


from langchain import LLMChain, OpenAI, PromptTemplate
from langchain.callbacks.base import BaseCallbackHandler

class CallingItBack(BaseCallbackHandler):
    def on_llm_start(self, serialized, prompts, invocation_params, **kwargs):
        print(prompts)
        print(invocation_params["model_name"])
        print(invocation_params["temperature"])

    def on_llm_new_token(self, token: str, **kwargs) -> None:
        print(repr(token))


llm = OpenAI(model_name="gpt-3.5-turbo-instruct", streaming=True, openai_api_key=openai_api_key)
prompt_template = "How far is it from {location1} to {location2} in km and what is the best possible route and appraoch to travel"
chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template))
output = chain.run({"location1": "Rasht-Iran", "location2": "Yazd-Iran"}, callbacks=[CallingItBack()])
print(output)

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\167339161.py:1: LangChainDeprecationWarning: The class `OpenAI` was deprecated in LangChain 0.0.10 and will be removed in 1.0. An updated version of the class exists in the :class:`~langchain-openai package and should be used instead. To use it run `pip install -U :class:`~langchain-openai` and import as `from :class:`~langchain_openai import OpenAI``.
  llm = OpenAI(model_name="gpt-3.5-turbo-instruct", streaming=True, openai_api_key=openai_api_key)

['How far is it from Rasht-Iran to Yazd-Iran in km and what is the best possible route and appraoch to travel']
gpt-3.5-turbo-instruct
0.7
'\n\n'
'The'
' distance'
' between'
' Ras'
'ht'
' and'
' Yaz'
'd'
' is'
' approximately'
' '
'1'
','
'300'
' kilometers'
'.'
' The best route'
' to'
' travel'
' by car would'
' be to take the Tehran-Q'
'om'
'-'
'Is'
'f'
'ahan'
'-Y'
'az'
'd'
' highway'
','
' which'
' is'
' the'
' most'
' direct and'
' well'
'-maintained route'
'.'
' The'
' estimated'
' driving'
' time'
' is'
' around'
' '
'16'
'-'
'18'
' hours.\n\n'
'Alternatively,'
' you'
' can'
' also'
' take'
' a'
' domestic'
' flight'
' from'
' Ras'
'ht'
' to'
' Yaz'
'd'
','
' which'
' would'
' take'
' about'
' '
'2'
' hours'
'.'
' You'
' can'
' check'
' for'
' flight'
' options and prices on'
' websites such'
' as'
' Iran'
' Air'
','
' Mah'
'an'
' Air, or Aseman Airlines'
'.\n\n'
'Another'
' option'
' would'
' be'
' to'
' take'
' a'
' train'
' from'
' Ras'
'ht'
' to'
' Tehran'
' and'
' then'
' from'
' Tehran'
' to'
' Yaz'
'd'
'.'
' The'
' total'
' journey'
' time'
' would'
' be'
' around'
' '
'20-'
'22'
' hours.'
' You can check for'
' train'
' schedules'
' and tickets on'
' the'
' Iranian'
' Rail'
'ways'
' website'
'.\n\n'
'It'
' is'
' recommended'
' to'
' plan'
' your'
' trip'
' in advance and book transportation'
' tickets'
' and'
' accommodations'
' in'
' advance'
','
' especially'
' during'
' peak travel seasons'
'.'
' You can also use'
' a'
' travel'
' app'
' such'
' as'
' Google'
' Maps'
' or Waze for'
' navigation and'
' real'
'-time'
' traffic'
' updates'
' during'
' your'
' journey'
'.'
' '
''


The distance between Rasht and Yazd is approximately 1,300 kilometers. The best route to travel by car would be to take the Tehran-Qom-Isfahan-Yazd highway, which is the most direct and well-maintained route. The estimated driving time is around 16-18 hours.

Alternatively, you can also take a domestic flight from Rasht to Yazd, which would take about 2 hours. You can check for flight options and prices on websites such as Iran Air, Mahan Air, or Aseman Airlines.

Another option would be to take a train from Rasht to Tehran and then from Tehran to Yazd. The total journey time would be around 20-22 hours. You can check for train schedules and tickets on the Iranian Railways website.

It is recommended to plan your trip in advance and book transportation tickets and accommodations in advance, especially during peak travel seasons. You can also use a travel app such as Google Maps or Waze for navigation and real-time traffic updates during your journey.


from langchain.chat_models import ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate

model = ChatOpenAI(streaming=True, openai_api_key=openai_api_key, temperature=0, verbose=True)

prompt = ChatPromptTemplate.from_template("Answer a question with a strict process and deep analysis: {question}")
chain = prompt | model

response = chain.invoke({"question": "How far is it from Rasht to Yazd in km and what is the best possible route and appraoch to travel"})

output = response.content
print(output)

The distance from Rasht to Yazd is approximately 1,100 kilometers. The best possible route to travel between these two cities would involve taking the following steps:

1. Begin by heading south on the highway towards Tehran, which is the capital city of Iran. This route will take you through major cities such as Qazvin and Saveh.

2. Once you reach Tehran, continue south towards Isfahan. Isfahan is a major city in central Iran and serves as a hub for transportation to other parts of the country.

3. From Isfahan, head east towards Yazd. This route will take you through smaller towns and villages, offering a glimpse into the local culture and landscape of Iran.

4. Upon reaching Yazd, take some time to explore the city's historical sites, such as the Yazd Atash Behram and the Jameh Mosque of Yazd.

By following this route, you will not only cover the distance between Rasht and Yazd efficiently but also have the opportunity to experience the diverse landscapes and cultures of Iran along the way. Additionally, using a GPS navigation app such as Google Maps or Waze can help you navigate the route with ease and avoid any potential roadblocks or traffic delays.


from langchain.evaluation import Criteria
list(Criteria)

[<Criteria.CONCISENESS: 'conciseness'>,
 <Criteria.RELEVANCE: 'relevance'>,
 <Criteria.CORRECTNESS: 'correctness'>,
 <Criteria.COHERENCE: 'coherence'>,
 <Criteria.HARMFULNESS: 'harmfulness'>,
 <Criteria.MALICIOUSNESS: 'maliciousness'>,
 <Criteria.HELPFULNESS: 'helpfulness'>,
 <Criteria.CONTROVERSIALITY: 'controversiality'>,
 <Criteria.MISOGYNY: 'misogyny'>,
 <Criteria.CRIMINALITY: 'criminality'>,
 <Criteria.INSENSITIVITY: 'insensitivity'>,
 <Criteria.DEPTH: 'depth'>,
 <Criteria.CREATIVITY: 'creativity'>,
 <Criteria.DETAIL: 'detail'>]


from langchain.chat_models import ChatOpenAI
from langchain.evaluation import load_evaluator

evaluator = load_evaluator("criteria", criteria="relevance",
                           llm = ChatOpenAI(openai_api_key=openai_api_key))

eval_result = evaluator.evaluate_strings(prediction="I want to make sum of two numbers",
                                        input="who was president of USA on 1973?")
print(eval_result)

{'reasoning': '1. The submission is not relevant to the input question, which is about the president of the USA in 1973. The submission talks about wanting to make a sum of two numbers, which does not address the question at all.\n\nTherefore, the submission does not meet the criteria of relevance.\n\nN', 'value': 'N', 'score': 0}


evaluator = load_evaluator("criteria", criteria="conciseness",

llm = ChatOpenAI(openai_api_key=openai_api_key))

eval_result = evaluator.evaluate_strings(prediction="I want to make sum of two numbers",
                                        input="who was president of USA on 1973?")
print(eval_result)

{'reasoning': '1. Is the submission concise and to the point?\n- The submission is clearly not concise and directly addresses the question asked.\n\nN', 'value': 'N', 'score': 0}


# Load evaluator, assign it to criteria
evaluator = load_evaluator("criteria", criteria="relevance", llm=ChatOpenAI(openai_api_key=openai_api_key))

# Evaluate the input and prediction
eval_result = evaluator.evaluate_strings(
    prediction="42",
    input="What is the answer to the ultimate question of life, the universe, and everything?",
)

print(eval_result)

{'reasoning': '1. The submission "42" is a reference to the quote from "The Hitchhiker\'s Guide to the Galaxy" by Douglas Adams, where the answer to the ultimate question of life, the universe, and everything is indeed "42."\n\nTherefore, the submission meets the criteria for relevance.\n\nY', 'value': 'Y', 'score': 1}


custom_criteria = {"truthfulness": "Is the writing honest and factual?",
                   "bias": "Does the language stay free of human bias?",
                   "simplicity": "Does the language use brevity?",
                   "clarity": "Is the writing easy to understand?"
                  }

evaluator = load_evaluator("criteria", criteria=custom_criteria,
                           llm=ChatOpenAI(openai_api_key=openai_api_key))

eval_result = evaluator.evaluate_strings(
    input="What is the majority job that Asian people have in Canada?",
    prediction="I think most of them are involved in manual works like construction")

print(eval_result)

{'reasoning': '- truthfulness: The submission states that Asian people in Canada are mostly involved in manual works like construction. This statement is not entirely accurate as it generalizes the job roles of an entire demographic group. While some Asian individuals may work in construction, there is a wide range of occupations that Asian people hold in Canada. This statement lacks factual accuracy and can be misleading. Therefore, the submission does not meet the criteria of truthfulness.\n- bias: The submission does not contain any evident bias based on the language used. It simply presents an opinion without showing favoritism or prejudice towards any group. Therefore, the submission meets the criteria of bias.\n- simplicity: The language used in the submission is brief and straightforward. It conveys the main point in a concise manner without unnecessary elaboration. Therefore, the submission meets the criteria of simplicity.\n- clarity: The writing in the submission is easy to understand as it clearly states the opinion about the majority job that Asian people have in Canada. The message is not convoluted or confusing. Therefore, the submission meets the criteria of clarity.', 'value': 'N', 'score': 0}


# Add a scalability criterion to custom_criteria
custom_criteria = {
    "market_potential": "Does the suggestion effectively assess the market potential of the startup?",
    "innovation": "Does the suggestion highlight the startup's innovation and uniqueness in its sector?",
    "risk_assessment": "Does the suggestion provide a thorough analysis of potential risks and mitigation strategies?",
    "scalability": "Does the suggestion address the startup's scalability and growth potential?"
}

# Criteria an evaluator from custom_criteria
evaluator = load_evaluator("criteria", criteria=custom_criteria, llm=ChatOpenAI(openai_api_key=openai_api_key))

# Evaluate the input and prediction
eval_result = evaluator.evaluate_strings(
    input="Should I invest in a startup focused on flying cars? The CEO won't take no for an answer from anyone.",
    prediction="No, that is ridiculous.")

print(eval_result)

{'reasoning': "- market_potential: The submission does not effectively assess the market potential of the startup. It simply states that investing in a startup focused on flying cars is ridiculous without providing any analysis or reasoning.\n- innovation: The submission does not highlight the startup's innovation and uniqueness in its sector. It simply dismisses the idea without mentioning any specific innovative features of the startup.\n- risk_assessment: The submission does not provide a thorough analysis of potential risks and mitigation strategies. It only states a negative opinion without delving into the specific risks associated with investing in a startup focused on flying cars.\n- scalability: The submission does not address the startup's scalability and growth potential. It only gives a one-word answer without considering the potential for the startup to grow and expand.", 'value': 'N', 'score': 0}


# Read a pdf document
loader = PyPDFLoader('data/Paper.pdf')
data = loader.load()
chunk_size = 200
chunk_overlap = 50

# split it to chunks
splitter = RecursiveCharacterTextSplitter(
    chunk_size=chunk_size,
    chunk_overlap=chunk_overlap,
    separators=['.'])
docs = splitter.split_documents(data)

# set up the enbedding models
embedding = OpenAIEmbeddings(openai_api_key=openai_api_key)
docstorage = Chroma.from_documents(docs, embedding)


question_set = [
    {
        "query": "What is the difference between GM and SCVF?",
        "answer": "there is no difference between GM and SCVF."
    },
    {
        "query": "According to the paper, what is the advantage of LU simulation?",
        "answer": "LU simulation respects the correltion between predictors."
    },
    {
        "query": "What is disadvantage of oversampling within k-fold?",
        "answer": "It will lead to overfitting."
    }
]


qa = RetrievalQA.from_chain_type(llm=llm, 
                                 chain_type="stuff", 
                                 retriever=docstorage.as_retriever(),
                                 input_key="query")


from langchain.evaluation.qa import QAEvalChain


# Define the evaluation chain
eval_chain = QAEvalChain.from_llm(llm)

for i in range(len(question_set)):
    # Generate the model responses using the RetrievalQA chain and question_set
    predictions = qa.apply([question_set[i]])
    
    # Evaluate the ground truth against the answers that are returned
    results = eval_chain.evaluate([question_set[i]],
                                  predictions,
                                  question_key="query",
                                  answer_key='answer',
                                  prediction_key="result")

    print(f"Question {i+1}: {question_set[i]['query']}")
    print(f"Expected Answer: {question_set[i]['answer']}")
    print(f"Model Prediction: {predictions}\n")    
    print(f"Result: {results[0]['results']}\n")

C:\Users\mrezv\AppData\Local\Temp\ipykernel_11232\826670384.py:6: LangChainDeprecationWarning: The method `Chain.apply` was deprecated in langchain 0.1.0 and will be removed in 1.0. Use :meth:`~batch` instead.
  predictions = qa.apply([question_set[i]])

Question 1: What is the difference between GM and SCVF?
Expected Answer: there is no difference between GM and SCVF.
Model Prediction: [{'query': 'What is the difference between GM and SCVF?', 'answer': 'there is no difference between GM and SCVF.', 'result': '\nGM refers to the flow of any detectable gas outside of the outermost casing string of oil and gas wells, while SCVF is specifically the flow of gas within the well itself and is often used to refer to internal migration.'}]

Result:  CORRECT

Question 2: According to the paper, what is the advantage of LU simulation?
Expected Answer: LU simulation respects the correltion between predictors.
Model Prediction: [{'query': 'According to the paper, what is the advantage of LU simulation?', 'answer': 'LU simulation respects the correltion between predictors.', 'result': ' The advantage of LU simulation is reduced computational cost and quantifying uncertainty.'}]

Result:  CORRECT

Question 3: What is disadvantage of oversampling within k-fold?
Expected Answer: It will lead to overfitting.
Model Prediction: [{'query': 'What is disadvantage of oversampling within k-fold?', 'answer': 'It will lead to overfitting.', 'result': '\nThe disadvantage of oversampling within k-fold is that it increases the likelihood of overfitting due to including exact copies of minority class examples.'}]

Result:  CORRECT

Table of Contents

LangChain with Hugging Face¶

LangChain with OpenAI¶

Chatbot¶

Prompt templates¶

Memory of Chat Model¶

ChatMessageHistory¶

ConversationBufferMemory¶

ConversationSummaryMemory¶

Using External Data for Chatbots¶

Retrieval Augmented Generation (RAG)¶

Document Loaders in LangChain¶

PDF document loader¶

CSV document loader¶

Text Loader¶

Third-party document loader¶

Splitting External Data¶

Split by CharacterTextSplitter¶

Split by RecursiveCharacterTextSplitter¶

RecursiveCharacterTextSplitter by HTML¶

Vector Databases for RAG Storage¶

Which vector database?¶

How to Prepare the data for storage¶

Embedding¶

Chroma Vector Database Set up¶

RAG using a vector databa¶

Example (reading paper) using OpenAI¶

Example (reading paper) using HuggingFace¶

LangChain Expression Language (LCEL), Chains, and Agents¶

Calling a chain created with LCEL¶

Runnables in LCEL¶

RAG Operations with LCEL¶

Implementing functional LangChain chains¶

Sequential Chain¶

Sequential Chains to Manipulate Values¶

How to use RunnablePassthrough in Chains¶

LangChain Agents¶

ReAct Agent (Zero-Shot)¶

Utilizing Tools¶

Using StructuredTool¶

Troubleshooting methods for optimization¶

Evaluating LangChain's Output¶

Built-in Evaluation Criteria¶

Custom Evaluation Criteria¶

QAEvalChain¶

`ChatMessageHistory`¶

`ConversationBufferMemory`¶

`ConversationSummaryMemory`¶

Split by `CharacterTextSplitter`¶

Split by `RecursiveCharacterTextSplitter`¶

`RecursiveCharacterTextSplitter` by HTML¶

How to use `RunnablePassthrough` in Chains¶

Using `StructuredTool`¶