Questions tagged [geography]
23 questions
11
votes
1 answer
What is the proper way of calculating the kernel density estimate from geographical coordinates?
I have to calculate the 2d kernel density estimate (kde) from a list of latitude and longitude coordinates. But one degree in latitude is not the same distance as one degree in longitude, this means that the individual kernels would be oval,…

Aaron de Windt
- 211
- 1
- 4
3
votes
0 answers
A reference request for a population statistic---experienced population density?
We all know that countries (for example) have population densities, but suppose we want to know what is the actual experienced population density (that is, if, say, in Sweden 95% of the population live in Stockholm, and a couple of hardy Lapps live…

Igor Rivin
- 287
- 1
- 3
- 11
3
votes
0 answers
How do I compare two geographic datasets with differing subdivisions?
Suppose I'm trying to figure out the relationship between home-ownership and wine drinking.
My information on where home-owners live comes from the census, and it divides a given geographical area up into zones like this:
This (made up) data is…

dain
- 365
- 1
- 10
3
votes
1 answer
Partial correlations among three distance matrices
I am trying to test the hypothesis that related species should deviate more in niche space the more they overlap in geographic space. In other words, related species either forage similarly and don't show a lot of geographic overlap, or if they…

forlooper
- 63
- 6
2
votes
1 answer
Years as continuous variable
Can I use "years" as a continuous variable ("years" as calendar years from 1984 to 2014) to see if NDVI (normalized difference vegetation index), of the same area at the same time (summer), has changed positively or negatively over the years, for…

PDS
- 21
- 2
2
votes
3 answers
Testing Hypothesis with Time series and Location Data
I have Data on Prices of house. Along with these variables.
1) Location
i)Latitude-Longitude
ii)City and State
2) Attribute of house.
i) No. of bedrooms and bathrooms (could be taken as proxy for size?)
3)Year Built and prices of house on…

A-dude
- 450
- 4
- 11
2
votes
0 answers
Density on Worldmap based on Relative Frequency
I'm trying to plot a 2d density on a worldmap. The problem is most of my points are around the same coordinates. My current map looks like this:
Code for map:
map = get_map(location = c(lon=43, lat=36), zoom = 6)
ggmap(map) +…

Nico
- 21
- 2
2
votes
1 answer
Correlation Between Locations
I have two data sets both matrices of latitudes and longitudes.
$A = [(X,Y),(X_2,Y_2)]$
$B = [(X_3,Y_3),(X_4,Y_4),(X_5,Y_5)]$
They are of different sizes
I want to calculate by how much each of the pairs of (x,y) coordinates in A correlate with…

rmaspero
- 121
- 3
2
votes
0 answers
Input geolocation Data in Neural Network
I am working on a Neural Network that should predict certain transport-tariffs based on start and end region of the transport.
The region can take different forms, from a whole country to a range of zip codes or just from one zip code to another.…

Paul
- 21
- 3
2
votes
1 answer
What is the right way to incorporate geographic data into a prediction?
Illustrative Problem
I have a number of geographically dispersed stores where I would like to understand what factors predict revenue. Each store has certain properties (sq ft, age, employees, etc.). I also have data for the area surrounding the…

Chris
- 155
- 1
- 6
2
votes
1 answer
ML model selection for prediction of latitude and longitude
I am doing a project in which my aim is to predict the likely locations of a set of latitude/longitude points based on a couple of variables. Since I've never done any ML on locational data, which models would you suggest to try out first?
I am…

slazien
- 103
- 1
- 12
2
votes
2 answers
Predicting lat/long from binary features
I have a number of observations that occur around my city (a small area), and several of them have latitude and longitude. I have been looking into predicting the latitude/longitude of the observations that are not geographically tagged, but I am…

neelshiv
- 314
- 3
- 12
1
vote
0 answers
How to assess geographical representativity of a sub-area?
My geographical zone $A$ is subdivided in $k$ different types of areas: $A_1 + A_2 + \dots{} + A_k = A$. These have been measured on a map with neglictible uncertainty: i.e. for any point on the map, it is unambiguous whether the point lies on type…

iago-lito
- 143
- 7
1
vote
0 answers
Clustering/pairing users based on multiple lat/lon values?
I have a data set that contains entities and lat/lon values for each one, potentially multiple lat/lon values.
I believe that some of these entities may actually be the same person, although I can't tell this from the data. Is there a method that…

neelshiv
- 314
- 3
- 12
1
vote
3 answers
Averaging predictions from two different models
First, a short introduction. I am predicting latitude/longitude using Random Forests and XGBoost based on several environmental variables and custom features such as cluster IDs (there are obvious spatial clusters in data). Based on validation set,…

slazien
- 103
- 1
- 12