activeloopai / deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Home Page:https://activeloop.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[BUG] ds.visualize not working in jupyter notebook for local dataset

journey-wang opened this issue · comments

Severity

P1 - Urgent, but non-breaking

Current Behavior

Hello everyone, I tried ds.visualize with dataset like 'hub://activeloop/animal10n-train', it worked in jupyter notebook. But with local dataset like './animal10n-train', ds.visualized showed nothing.

Steps to Reproduce

`import deeplake

it worked with remote dataset

dataset_path = 'hub://activeloop/animal10n-train'
ds = deeplake.load(dataset_path) # Returns a Deep Lake Dataset but does not download data locally
ds.summary()
ds.visualize()
`

image

`

copy to local

deeplake.copy('hub://activeloop/animal10n-train', './animal10n-train', num_workers=10)

it not worked

dataset_path = './animal10n-train'
ds = deeplake.load(dataset_path) # Returns a Deep Lake Dataset but does not download data locally
ds.summary()
ds.visualize()
`

image

Expected/Desired Behavior

ds.visualize worked with local dataset

Python Version

python3.10

OS

Ubuntu 18.04

IDE

Jupyter

Packages

deeplake 3.7.1

Additional Context

No response

Possible Solution

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR (Thank you!)

hi @journey-wang ! thanks for reporting this. We will investigate and get back to you if this is an expected behavior.

hi @journey-wang is your environment is clean ubuntu or you use WSL ?

hi @journey-wang is your environment is clean ubuntu or you use WSL ?

Hi @zapredelom I ran it in the docker environment, like docker run -it --name ubuntu --privileged --net host [image name]

@journey-wang but what is the operating system of your host machine ? is it ubuntu , mac or windows wsl ?

@zapredelom one more report with a more detailed info on os - #2663 (comment). Feel free to ask more questions!

commented

Hello, I also encountered the same problem. Now the dataset is stored locally, and then I create an iframe using the URL below, but no image is displayed.

image

Our public network port 38885 is open, and I can get response from Internet using the link below.
curl -H "Range: bytes=100-20000" http://ip:38885/7a1d36b8-d48d-470e-bce7-51a3b3462711/dataset_meta.json

image

When I check the request call chain from chrome, I find that some calls failed.

image

image

Is there a way to display the local dataset in the local Jupyter notebook?

Python Version
python3.10

OS
Ubuntu 18.04

IDE
Jupyter

Packages
deeplake 3.8.3

Hi there @fan-chao. Thanks for in detail message. We've addressed it here #2663 (comment).

Please note that local dataset visualization in notebook may be considered for inclusion in the list of enterprise features. Would you be interested in it in that case?

commented

Hi there @fan-chao. Thanks for in detail message. We've addressed it here #2663 (comment).

Please note that local dataset visualization in notebook may be considered for inclusion in the list of enterprise features. Would you be interested in it in that case?

If visualizing local datasets in a notebook can enhance our work efficiency and help us better understand the data, I would be very interested.

Happy to talk a bit more about your use case to see how we can be helpful! Do you mind booking a quick introductory call via cal.com/db?