pinecone-io / examples

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Error with Langchain demo notebook

Tylersuard opened this issue · comments

commented

Is this a new bug?

  • I believe this is a new bug
  • I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

When running this notebook: https://colab.research.google.com/github/pinecone-io/examples/blob/master/generation/langchain/handbook/05-langchain-retrieval-augmentation.ipynb

from datasets import load_dataset
causes this error:


AttributeError Traceback (most recent call last)

in <cell line: 1>()
----> 1 from datasets import load_dataset
2
3 data = load_dataset("wikipedia", "20220301.simple", split='train[:10000]')
4 data

8 frames

/usr/local/lib/python3.10/dist-packages/multiprocess/dummy/init.py in
85 #
86
---> 87 class Condition(threading._Condition):
88 # XXX
89 if sys.version_info < (3, 0):

AttributeError: module 'threading' has no attribute '_Condition'

Expected Behavior

It should import load_dataset

Steps To Reproduce

Open this notebook and run all:
https://colab.research.google.com/github/pinecone-io/examples/blob/master/generation/langchain/handbook/05-langchain-retrieval-augmentation.ipynb

Relevant log output

No response

Environment

Colab notebook

Additional Context

No response

The following steps solve the problem for me:
!pip install --upgrade datasets multiprocess

!pip install --upgrade protobuf apache-beam

you may need to restart the runtime after each of these executions

Thanks @tytung2020 .
And !pip install -qU "pinecone-client[grpc]"==2.2.1 didn't give me issues like xx2.22.
And...changed this

index = pinecone.GRPCIndex(index_name)

index.describe_index_stats()

To:

import time

index = pinecone.GRPCIndex(index_name)
# wait a moment for the index to be fully initialized
time.sleep(1)

index.describe_index_stats()

I'm simply replacing some code snippets in this notebook for what's working here

This Poetry setup has worked for me using Python 3.9 :

pyproject.toml

[tool.poetry]
name = "aiexplore"
version = "0.1.0"
description = "Exploring OpenAI"
readme = "README.md"

[tool.poetry.dependencies]
python = "~3.9.17"
openai = "^0.27.7"
dill = "~0.3.1.1"
multiprocess = "<0.70.9"
datasets = "2.12.0"
protobuf = "3.19.3"
apache-beam = "~2.46.0"
pinecone-client = { version = "2.2.1", extras = ["grpc"] }
mwparserfromhell = "^0.6.4"
langchain = "^0.0.162"
tiktoken = "0.4.0"

[tool.poetry.group.dev.dependencies]
ipykernel = "^6.25.0"
pipdeptree = "^2.12.0"

[build-system]
requires = ["poetry-core"]
build-backend = "poetry.core.masonry.api"

Then:

poetry install