BAAI-WuDao / WuDaoMM

WuDaoMM this is a data project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

你好,Flower.json和Landscape.json中的图片链接好像失效了

trouble-maker007 opened this issue · comments

我使用img2dataset下载wudaomm数据集,发现Flower.json中大量类似如下的图片链接无法下载,这是链接失效了吗?
image

{
        "name": "5d532c4a9cd24fbf1653ed3486c99244.jpg",
        "tag": "花卉",
        "url": "http://img5.iplant.cn/image2/b/1871135.jpg",
        "captions": "秘鲁天轮柱属"
    },
    {
        "name": "dc12ee1d49998f2f58ed4b6932a8ce90.jpg",
        "tag": "花卉",
        "url": "http://img6.iplant.cn/image2/b/1871166.jpg",
        "captions": "重瓣榆叶梅"
    },
    {
        "name": "0e656f0b4a40fd118c113aaef7416a3f.jpg",
        "tag": "花卉",
        "url": "http://img7.iplant.cn/image2/b/1871167.jpg",
        "captions": "重瓣榆叶梅"
    },
    {
        "name": "e370ac0b924499158c00733f75d1d239.jpg",
        "tag": "花卉",
        "url": "http://img0.iplant.cn/image2/b/1871170.jpg",
        "captions": "复羽叶栾树"
    },
    {
        "name": "77ab8dc175fb64c9701e917be1584fa5.jpg",
        "tag": "花卉",
        "url": "http://img1.iplant.cn/image2/b/1871183.jpg",
        "captions": "柳叶香彩雀"
    },
    {
        "name": "cdcf62fbb41757a3b8cf7e08e813d235.jpg",
        "tag": "花卉",
        "url": "http://img4.iplant.cn/image2/b/1871184.jpg",
        "captions": "柳叶香彩雀"
    },
    {
        "name": "03410634b2f8890aa2db0b3022f6192e.jpg",
        "tag": "花卉",
        "url": "http://img5.iplant.cn/image2/b/1871185.jpg",
        "captions": "柳叶香彩雀"
    },
    {
        "name": "8741369a97a1fbef8d36ee1c64972066.jpg",
        "tag": "花卉",
        "url": "http://img6.iplant.cn/image2/b/1871186.jpg",
        "captions": "柳叶香彩雀"
    }

Landscape.json中部分url是404无法访问:

image

['http://img.99118.com/630/201711/024380011A1C3976.jpg',
 'http://img.99118.com/630/201711/0243800277FB6158.jpg',
 'http://img.99118.com/630/201711/02438003ED3FFA68.jpg',
 'http://img.99118.com/630/201711/0243800441BDF0F1.jpg',
 'http://img.99118.com/630/201711/02438005F3C1D2F0.jpg']

Landscape.json中部分url多次下载还是是会超时无法下载:

['http://img.99118.com/630/201906/027010099460C760.jpg',
 'http://img.99118.com/630/201912/02749004AC628639.jpg',
 'http://img.99118.com/630/201910/027330060799DC6B.jpg',
 'http://img.99118.com/630/201906/026960016722002A.jpg',
 'http://img.99118.com/630/201911/02745017A2A57148.jpg']

@BAAI-WuDao