unzip error when using glove embeddings from torchtext
lasgel opened this issue Β· comments
π Bug
Describe the bug A clear and concise description of what the bug is.
When using GloVe from torchtext.vocab an error occurs saying that
zipfile.BadZipFile: File is not a zip file
To Reproduce Steps to reproduce the behavior:
- Go to a directory where torchtext has not been used (meaning that there is no .vector_cache)
- from torchtext.vocab import GloVe
- glove = GloVe(name='6B', dim=50) or use any other valid combination of name and dim
- See error
Expected behavior The server at Stanford university from where it is downloaded is down till 3rd of July, so instead of trying to unzip a zip file where just a 404-page is stored (and is no zip archive either) one would expect to get a message that the download could not be completed
Environment
torchtext
Please copy and paste the output from our
environment collection script (or
fill out the checklist below manually).
You can get the script and run it with:
wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
# For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
python -c "import torchtext; print(\"torchtext version is \", torchtext.__version__)"
- PyTorch Version (e.g., 2.0.1):
- OS: Linux
- How you installed PyTorch: pip
- Python version: 3.11
Additional context Add any other context about the problem here.