Использовать названия из Wikidata для wikipedia-затегированных объектов
d1g opened this issue · comments
Раз в OSM не обозначается old_name, alt_name, то хоть при геокодинге должно учитываться:
http://www.openstreetmap.org/relation/337422
- Питер
- Петербург
- СПб
- etc
It would be great to have this feature!
To have that data in gazetteer out, I have to have local dump of wikidata, If I'll query wikidata duiring data processing it will take forever to generate data. So I'll probabbly implement that, if it's possible to get wikidata subset.
to have local dump of wikidata
a SPARQL query to fetch labels
44791 Results in 452 ms thanks to WDQS
@d1g, cool, few more things:
- How to get language code for label?
- How to get type-codes? wikidata as I think could be binded not only to localities but also to streets and boundaries.
As an option, I can get a full list of wikidata identifiers during first pass and make a batch query to download all the wikidata in few calls to API, and join that data later, but it's easier and faster to have that data downloaded before gazetteer run.
- simply add a
(LANG(?l) as ?lang)
after?l
- I'm not exactly sure what codes do you mean. Could you please give an example for SPB or other item?
Here is the code for cities:
?item wdt:P31 wd:Q515; # cities
How could i get something like:
?item wdt:P31 wd:Q515; # cities
or ?item wdt:P31 wd:Q123; # states
or ?item wdt:P31 wd:Q1234; # streets
- for street http://tools.wmflabs.org/sqid/#/view?id=Q79007 ex https://www.wikidata.org/wiki/Q152244
- data about states isn't uniform in OSM-like sense: https://www.wikidata.org/wiki/Q835714 https://www.wikidata.org/wiki/Q35657 and so on
Heh, that's what I've been afraid of. Are there any analogs of rdbs joins and subqueries or recursive queries in wikidata?
- it is possble to nest selects, there are many existing examples here
- sparql-vs-sql
I prefer to load data using SPARQL, but perform really complex precessing using regular tools e.g. Python
@kiselev-dv, it is possible to fetch any division using Q10864048 item: 3623 Results in 6046 ms