Questions tagged [gis]

Geographic information systems (GIS) provide facilities for storing, managing and analyzing geographic data which can be used for statistical purposes.

Geographic information systems (GIS) provide facilities for storing, managing and analyzing geographic data. Since GIS is not a unified system, the quality of data depends on how they were collected with GPS and aerial photography being the most reliable sources of data. GIS can relate any type of data to certain locations given that latitude and longitude (sometimes also elevation) are provided. This makes it an attractive system for those working with spatial statistical models and data visualization with maps. An example of the use of GIS is the relation of crime and unemployment rates across districts of a city.

The software package Stata has extensive routines for spatial data analysis and data visualization as do R and Matlab. The documentations are vast though the following links may provide a starting point for further reference:
Spatial Data Analysis in Stata - an Overview for Stata
Spatial Statistics Software and Spatial Data for Matlab
R Spatial Tips for R

46 questions
23
votes
2 answers

What statistical model or algorithm could be used to solve the John Snow Cholera problem?

I'm interested in learning how to develop a geographic approximation of some kind of epicenter based on the data from the John Snow Cholera outbreak. What statistical modeling could be used to solve such a problem without prior knowledge of where…
cylondude
  • 333
  • 1
  • 6
7
votes
2 answers

How to capture competitive spatial interactions between multiple stores and customers

I am estimating sales with data on customer and store locations and attributes using a Huff-style model, where sales decay with drive time and increase with attractiveness of the store. One hypothetical instance of the problem is illustrated below.…
dimitriy
  • 31,081
  • 5
  • 63
  • 138
6
votes
1 answer

Testing for non-random overlap of polygons

I have two polygon shapefiles and I want to see to what extent the observed amount of overlap is due to chance. I'm thinking of some kind of permutation test, but not sure of the best way to proceed. One idea is randomly relocating the polygons in…
Ben
  • 729
  • 9
  • 22
5
votes
1 answer

Can Moran's I results be compared against each other in different geographies?

Background I have 50 service areas, each service area has a number of points associated with. In order to compare the areas to each other, one idea was to describe the degree to which its service area's set of points were clustered or dispersed. The…
adin
  • 215
  • 1
  • 8
5
votes
0 answers

Help with designing the analysis of a radio telemetry study

I was wondering if there are any spatial ecologists out there who can provide some insight on the trouble I'm having with my Masters research. I am looking at the association a bat species to high density aggregations of a flowering plant. The bats…
Georgia
  • 51
  • 1
4
votes
2 answers

How to create training data for CNN using remote sensing imagery

Before I start with the issue I would like to touch base with some background information. I had been working with Random Forest for classification of Remote Sensing data, here the classification was based on the pixel value of the remotely sensed…
4
votes
3 answers

How do I calculate the proportion of smaller squares covered by a larger circle?

I would like to determine the proportion covered by the circle (say r = 2800m) of each cell (say 1000 x 1000 m). The circle is centered around the center of the focal cell of a rectangular neighborhood. I provide a rough visual estimation in the…
4
votes
2 answers

The right way to use Machine Learning to predict latitude and longitude

There are some simple ML techniques that can be used to easily predict latitude/longitude co-ordinates, such as predicting the latitude and longitude separately using two different models. However, I get the sense that this is a simple hack that…
bnsmith
  • 95
  • 1
  • 9
3
votes
2 answers

How to predict demand from historical "continuous" event data (date, lat, lon)?

I am attempting to predict demand for our service, both quantity but maybe more important, location (hotspots). I am by no means an experienced statistician, so I need some help :) I have all the historic data for our service, date, latitude and…
Zebs
  • 251
  • 2
  • 4
3
votes
1 answer

Clustering spatio-temporal data?

I have data in the form of timestamp,lat,long which is gps data for users. I'm new to data mining and want to understand how can I start clustering these data to understand more about it. Should I like build a matrix of one trajectory v/s the other…
gizgok
  • 569
  • 5
  • 9
3
votes
3 answers

95% confidence ellipse of Lenth's maximum likelihood estimation

I'm attempting to write a simple mobile app to help colleagues triangulate the signal of transmitter-fitted animals in the field. I've been using Russell Lenth's 1981 method of Maximum Likelihood Estimation (PDF: On Finding the Source of a…
petroica
  • 33
  • 5
3
votes
0 answers

Performing spatial logistic regression in R

I am trying to perform a logistic regression with the following code Y ~ x1+x2+x3,data=data, family=binomial(link="logit"). However on inspection of both the outcome and predictors i noticed that they are characterized by spatial…
Paulo
  • 31
  • 5
2
votes
2 answers

How to create a GIS basemap in R?

I am an expert GIS user moving towards R more and more. I have been using R for some basic regressions and such, but I would like to begin to use and manipulate GIS data in R. How can I create a basemap graphic similar to the one in this…
mikeLdub
  • 121
  • 3
2
votes
1 answer

Clustering spatial data based on location and values

I'm looking for a way, preferably in R, to create a cluster of point data (specifically, the centroids of UK postcodes), where each cluster comes as close as possible to containing a certain number of people (between 100 and 200). The data I have is…
Tom
2
votes
0 answers

Rate Smoothing in Spatial Data -

I asked this question on Reddit a few days ago but didn't get any responses. Working on some spatial epidemiology research a while back, that has resurfaced, I was told that I should smooth the incidence rates to account for areas with small…
Ross Wardrup
  • 165
  • 1
  • 4
1
2 3 4