number of database queries

Question

number of database queries

clime opened this issue 12 years ago · comments

There is a significant performance hit for making SELECT for each cell in gridCluster and kmeansCluster. Have you thought about reducing it to just one (or a few) queries? I am not completely sure that it is possible but I feel it should be and it would improve performance greatly (especially if you have lots of cells). Have you thought about it? I am looking for a way to do it but I would like to hear from you first what you think.

Thomas Uher · Answer 1 · Sun Apr 21 2013 17:12:09 GMT+0800 (China Standard Time)

For the kmeans method, this should be possible and is an interesting idea. One would have to calculate the number of visible cells and then get the number of clusters with k*cellcount. After that, only one SELECT would be needed, targeting the current (grid)bounds instead of each grid cell. Furthermore, this would reduce the amount of times the distance cluster has to be run. I will give that a try. Thank you for this input.

For the gridCluster I currently don't know how the amount of SELECT could be reduced, but that does not mean it is not possible. If you (or anyone else) knows a solution it would be highly appreciated.

Thomas Uher · Answer 2 · Thu Jun 05 2014 01:14:58 GMT+0800 (China Standard Time)

I might have found a way querying the database only once by using a grid calculated by a postgis function:
http://gis.stackexchange.com/questions/16374/how-to-create-a-regular-polygon-grid-in-postgis
Hopefully I will find the time to test this.

Thomas Uher · Answer 3 · Thu Jul 10 2014 00:01:31 GMT+0800 (China Standard Time)

query amount reduced using temporary tables

Michal Novotný · Answer 4 · Wed Jul 16 2014 12:01:29 GMT+0800 (China Standard Time)

Good job. I can't test because i am on travels but good job.